<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>UMLS-OWL: an OWL 2 translation of the Unified Medical Language System (UMLS®) Semantic-Network and Metathesarus for publishing in the semantic web</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Michel Héon</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joseph Aubut</string-name>
          <email>Joseph.aubut@plexusmed.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sheila Gaudreau</string-name>
          <email>Sheila.gaudreau@plexusmed.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Cotechnoe</institution>
          ,
          <addr-line>Montréal (QC)</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Plexus Med</institution>
          ,
          <addr-line>Trois-Rivières (QC)</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Background: The Unified Medical Language System is a formal ontology of the biomedical and health domain. Its goal is to offer a vocabulary that would facilitate the exchange of information between informatics systems. The semantics and basic facts of the UMLS are contained in the Semantic Network of Metathesaurus. The objective of this research is to translate their content into a OWL 2 Ontology that can be interrogated and inferenced through a SPARQL Endpoint. Methods: The translated model of the UMLS towards the UMLS-OWL was done through the classification of the conceptual entities, the atoms and the relationships of the UMLS into entities of the description logic T-BOX, R-BOX, and A-BOX. Results: The experimentation will validate the interoperability of the transformation by using tools of the semantic web to consult the UMLS-OWL.</p>
      </abstract>
      <kwd-group>
        <kwd>UMLS</kwd>
        <kwd>UMLS-OWL</kwd>
        <kwd>UMLS on semantic web</kwd>
        <kwd>Ontology transformation</kwd>
        <kwd>Biomedical ontology</kwd>
        <kwd>Biomedical linked open data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Context: Created in 1986 by the U.S. National Library of Medicine, the Unified
Medical Language System (UMLS®)[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is a database that regroups numerous
vocabularies and biomedical health norms so as the facilitate interoperability between
informatics systems. The UMLS is used to ameliorate or develop applications such as
Electronic Health Record’s classification tools, dictionaries and their translations into
many languages. The UMLS is a formal ontology, but is not an ontology of the
semantic web. To increase the interoperability of this vocabulary towards a use by the
agents of the web, it is necessary to express the UMLS in a semantic web language
such as OWL.
Goal: The goal of this research is to translate the entire UMLS’s Semantic Network
and Metathesarus in a relational representational model towards a representation form
of OWL 2. To do this, we are relying on the hypothesis that the translation is possible
since the UMLS is structured with an ontological architecture.
      </p>
      <p>
        Related work. Many past works [
        <xref ref-type="bibr" rid="ref2 ref3 ref4 ref5">2-5</xref>
        ] suggested methods of translation of the UMLS
in ontologies of the semantic web by addressing the formalization of the Semantic
Network of the UMLS either so as to emphasize the semi-formal character of the
UMLS [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], either to use the description logic (DL) so as to reveal inconsistencies
between statements originating from different data sources [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] or to evaluate the
capacity of an OWL-UMLS to interconnect different agents to UMLS [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and finally,
to formalize the semantic network module of the UMLS so that it serves as a basic
vocabulary to an upper-ontology [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
1 Methodology
Procedure: Fig. 1 presents the process divided in three steps, from the transformation
of the semantic network and of the ontological UMLS Metathesarus in text format to
the OWL 2 syntax. The first step consists in using MetamorphoSys, the installation
wizard suggested by UMLS, to convert the text format UMLS to a UMLS in a
MySQL Database. The second step aims to produce an UMLS-OWL by the
translation of the UMLS relational representation to an OWL representation according to the
architecture of the mapping in Fig. 2 with a combination of Java Persistence API and
Apache Jena OWL-API1 technologies. Third and last step consists in storing the
UMLS-OWL in a triplestore and to making it available on the web through a Linked
Open Data (LOD)2 server so that it can be accessed by a SPARQL-endpoint.
UMLS primitive classification in the DL architecture: Fig. 2 introduces the
repartition of the UMLS primitives in the architecture of the DL architecture who they
contain four levels of abstraction:
1. Metadata level: it encompasses all the primitives associated with all table
names and columns of the different tables of the UMLS
1 https://jena.apache.org/
2 Blazegraph (https://www.blazegraph.com/) in our case
2. Semantic level: it includes the TUI associated to the typological referencing
3. Conceptual level: it consists of the conceptual identities (CUI)
4. Factual level: it contains the different atomic entities (AUI) and roles (RUI)
Each primitive of the UMLS is classified according to the architecture of the DL that
is divided in 2 abstraction levels: the DL abstract level that includes the T-BOX and
R-BOX primitives and the DL factual level that stores the elements of the A-BOX.
The experimentation that was performed consisted in validating the usage of the
UMLS-OWL as a fitting vocabulary for the semantic use of a database. It was also
performed to validate the diffusion of the database on a LOD server. Fig. 3 presents
an overview of the programed scenario usin g the UMLS-OWL vocabulary. In the left
column, a taxonomic view the UMLS semantic networking is presented. In the right
part of Fig. 3, in G-OWL [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] notation, is presenting the association has_manifestion
of 5 different illnesses (names in bold) with the Findings and Signs or Symptoms .
In order to demonstrate the interoperability of the UMLS-OWL, the above scenario
has been put online in the LOD 3 server and interrogated by RelFinder[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]4 (see Fig 4).
3 http://iswc2017.plexusld.com:8080/bigdata/#query
4 http://iswc2017.plexusld.com/relfinder
      </p>
    </sec>
    <sec id="sec-2">
      <title>Conclusion</title>
      <p>This article presents UMLS-OWL, a version of the UMLS Semantic Network and
the Metathesaurus for the semantic web and the LOD server. Each UMLS primitive is
classified in the T-BOX, R-BOX or A-BOX according to the DL and the semantic
level which they correspond to. The UMLS-OWL’s ontological architecture’s
viability has been proved by the successful use of semantic web tools, its interoperability
using a LOD server. The automatization of the process of construction and
consolidation of the database is being currently worked upon using a Natural Language
Processor in order to identify and classify texts of the Metathesaurus according to the
UMLS-OWL’s semantic. We plan to use this work to develop a knowledge base that
will be used for the creation of a diagnostic assistant application.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>1. https://www.nlm.nih.gov/research/umls/</mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Kashyap</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Borgida</surname>
          </string-name>
          , A.:
          <article-title>'Representing the UMLS® Semantic Network Using OWL', in</article-title>
          <string-name>
            <surname>Fensel</surname>
          </string-name>
          , D.,
          <string-name>
            <surname>Sycara</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Mylopoulos</surname>
            ,
            <given-names>J</given-names>
          </string-name>
          . (Eds.): 'The Semantic Web - ISWC 2003: Second International Semantic Web Conference, Sanibel Island, FL, USA
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Jiménez-Ruiz</surname>
          </string-name>
          , E et al.:
          <article-title>'Logic-based assessment of the compatibility of UMLS ontology sources'</article-title>
          ,
          <source>Journal of Biomedical Semantics</source>
          ,
          <year>2011</year>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bhatt</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          et al. '
          <article-title>Ontology driven semantic profiling and retrieval in medical information systems'</article-title>
          ,
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          ,
          <year>2009</year>
          ,
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Schulz</surname>
          </string-name>
          , S et al. '
          <article-title>BioTopLite: An Upper Level Ontology for the Life Sciences Evolution, Design and Application'</article-title>
          , in Editor (Ed.):
          <article-title>'Book BioTopLite: An Upper Level Ontology for the Life Sciences Evolution, Design</article-title>
          and Application'
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Héon</surname>
          </string-name>
          , M et al.:
          <article-title>'Toward G-OWL: A Graphical, Polymorphic And Typed Syntax For Building Formal OWL 2 Ontologies'</article-title>
          .
          <source>Proceedings of the 25th International Conference Companion on World Wide Web, Montréal</source>
          , Québec, Canada 2016
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Heim</surname>
          </string-name>
          , P et al., T.:
          <article-title>'Interactive Relationship Discovery via the Semantic Web'</article-title>
          , in
          <string-name>
            <surname>Aroyo</surname>
          </string-name>
          , L.,
          <string-name>
            <surname>Antoniou</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hyvönen</surname>
          </string-name>
          , E.,
          <string-name>
            <surname>ten Teije</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stuckenschmidt</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cabral</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Tudorache</surname>
          </string-name>
          , T. (Eds.): '
          <source>The Semantic Web: Research and Applications: 7th Extended Semantic Web Conference, ESWC</source>
          <year>2010</year>
          , Heraklion, Crete, Greece, May 30 -June 3,
          <year>2010</year>
          , Proceedings,
          <string-name>
            <surname>Part</surname>
            <given-names>I</given-names>
          </string-name>
          ' (Springer Berlin Heidelberg,
          <year>2010</year>
          ), pp.
          <fpage>303</fpage>
          -
          <lpage>317</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>