<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Pattern for Re-engineering a Term-based Thesaurus, Which Follows the Record-based Model, to a Lightweight Ontology</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Boris Villaz´on-Terrazas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mari Carmen Sua´rez-Figueroa</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Asuncio´n Go´mez-P´erez</string-name>
          <email>asun@fi.upm.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ontology Engineering Group, Departamento de Inteligencia Artificial, Facultad de Inform ́atica, Universidad Polit ́ecnica de Madrid</institution>
          ,
          <addr-line>Spain WWW home page:</addr-line>
        </aff>
      </contrib-group>
      <fpage>108</fpage>
      <lpage>111</lpage>
      <abstract>
        <p>This pattern for re-engineering non-ontological resources (PR-NOR) fits in the Schema Re-engineering Category proposed by [3]. The pattern defines a procedure that transforms the term-based thesaurus components into ontology representational primitives. This pattern comes from the experience of ontology engineers in developing ontologies using thesauri in several projects (SEEMP1, NeOn2, and Knowledge Web3). The pattern is included in a pool of patterns, which is a key element of our method for re-engineering non-ontological resources into ontologies [2]. The patterns generate the ontologies at a conceptualization level, independent of the ontology implementation language.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <sec id="sec-1-1">
        <title>Problem</title>
        <p>Re-engineering a term-based thesaurus, which follows the record-based model, to design a
lightweight ontology.
A non-ontological resource holds a term-based
thesaurus which follows the record-based model. A
thesaurus represents the knowledge of a domain
with a collection of terms and a limited set of
relations between them.</p>
        <p>
          The record-based data model [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] is a
denormalized structure, uses a record for every term with
the information about the term, such as synonyms,
broader, narrower and related terms.
1 http://www.seemp.org
2 http://www.neon-project.org
3 http://knowledgeweb.semanticweb.org
        </p>
      </sec>
      <sec id="sec-1-2">
        <title>Applicability</title>
        <p>The semantics of the relation between narrower and broader terms are subClassOf.</p>
        <p>ti
2.4. Identify the thesaurus term, tr , which
are related terms of ti. They are
referenced in the same record that contains
ti.
2.5. For each one of the above identified
the</p>
        <p>saurus term tr :
2.5.1. Create the corresponding ontology
class, Cr class, if it is not created
yet.
2.5.2. Set up the relatedClass relation
be</p>
        <p>tween Cr and Ci
2.5.3. Repeat from step 2.4 for tr as a new</p>
        <p>ti
2.6. Identify the thesaurus term, tq , which
are equivalent terms of ti. They are
referenced in the same record that contains
ti.
2.7. For each one of the above identified
the</p>
        <p>saurus term tq :
2.7.1. Apply the SynonymOrEquivalence</p>
        <p>(SOE) pattern.</p>
      </sec>
      <sec id="sec-1-3">
        <title>Example</title>
        <p>Suppose that someone wants to build a lightweight ontology based on the European Training
Thesaurus (ETT), which is a term-based thesaurus and it follows the record-based model.
The European Training Thesaurus (ETT)
constitutes the controlled vocabulary of reference in the
field of vocational education and training (VET)
in Europe. The relation semantics between the
sub-ordinate and the super-ordinate concepts is
subClassOf. This classification scheme is available
at http://libserver.cedefop.europa.eu/ett/en/</p>
      </sec>
      <sec id="sec-1-4">
        <title>Ontology Generated</title>
        <p>
          The ontology generated will be based on the
lightweight ontology architectural pattern
(APLW-01) [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
        <p>
          Each thesaurus term is mapped to a class. A
subClassOf relation is defined between the new
classes for the BT/NT relation. A relatedClass
relation is defined between the new classes for the
RT relation. For the UF/USE relations the
SynonymOrEquivalence (SOE) pattern [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] is applied.
        </p>
      </sec>
      <sec id="sec-1-5">
        <title>Related Resources</title>
        <p>
          This pattern is related to the architectural pattern AP-LW-01 [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] for modelling a lightweight
ontology.
This pattern is being applied to re-engineer the European Training Thesaurus
(ETT)4 into a Education Ontology5, within the context of the SEEMP project.
It contains over 2500 terms (1550 are descriptors, and 950 non descriptors). This
term-based thesaurus is modelled following the record-based data model.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Summary and Future Work</title>
      <p>
        We have presented a pattern for transforming a term-based thesaurus, which is
modelled following a record-based data model, into a lightweight ontology. The
pattern is included in a pool of patterns, which is a key element of our method
for re-engineering non-ontological resources into ontologies [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>We plan to develop software libraries within a framework that implement the
transformation process suggested by the pattern. Moreover, we will include
external resources to improve the quality of the resultant ontologies. Finally, we need
to calculate how much effort do we save re-engineering classification schemes
using patterns compared with re-engineering classification schemes without them.
Acknowledgments. This work has been partially supported by the European
Comission projects NeOn(FP6-027595) and SEEMP(FP6-027347), as well as by
an R+D grant from the UPM.
4 http://libserver.cedefop.europa.eu/ett/en/
5 The ontology will be available at http://droz.dia.fi.upm.es/hrmontology/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>C.</given-names>
            <surname>Roussey</surname>
          </string-name>
          and
          <string-name>
            <surname>O. Corcho.</surname>
          </string-name>
          <article-title>SynonymOrEquivalence (SOE) Pattern</article-title>
          . http://ontologydesignpatterns.org,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>A.</given-names>
            <surname>Garc</surname>
          </string-name>
          <article-title>´ıa, A. G´omez-P´erez</article-title>
          , M. C.
          <article-title>Su´arez-</article-title>
          <string-name>
            <surname>Figueroa</surname>
            , and
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Villaz</surname>
          </string-name>
          <article-title>´on-Terrazas. A Pattern Based Approach for Re-engineering Non-Ontological Resources into Ontologies</article-title>
          .
          <source>In Proceedings of the 3rd Asian Semantic Web Conference (ASWC2008)</source>
          . Springer-Verlag,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>V.</given-names>
            <surname>Presutti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gangemi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>David</surname>
          </string-name>
          , G. Aguado de Cea,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Surez-Figueroa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Montiel-Ponsoda</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Poveda. NeOn Deliverable</surname>
          </string-name>
          <article-title>D2.5.1. A Library of Ontology Design Patterns: reusable solutions for collaborative design of networked ontologies</article-title>
          .
          <source>In NeOn Project</source>
          . http://www.neon-project.org,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>D.</given-names>
            <surname>Soergel</surname>
          </string-name>
          .
          <article-title>Data models for an integrated thesaurus database</article-title>
          .
          <source>Comatibility and Integration of Order Systems</source>
          ,
          <volume>24</volume>
          (
          <issue>3</issue>
          ):
          <fpage>47</fpage>
          -
          <lpage>57</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>M. C.</surname>
          </string-name>
          <article-title>Su´arez-</article-title>
          <string-name>
            <surname>Figueroa</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Brockmans</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gangemi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <article-title>G´omez-P´erez</article-title>
          , J. Lehmann,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lewen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Presutti</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Sabou</surname>
          </string-name>
          .
          <article-title>Neon modelling components</article-title>
          .
          <source>Technical report, NeOn project deliverable D5.1.1</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>