<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Querying standardized EHRs by a Search Ontology XML extension (SOX)</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute for Medical Informatics, Bern University of Applied Sciences</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute for Medical Informatics, Statistics and Epidemiology (IMISE), Leipzig University</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Institute of Pathology, Leipzig University</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Motivation: The previously developed Search Ontology (SO) allows domain experts to formally specify domain concepts, search terms associated to a domain, and rules describing domain concepts. So far, Lucene search queries can be generated from information contained in the SO and can be used for querying literature data bases or PubMed. However, this is still insufficient, since these queries are not well suited for querying XML documents because they are not following their structure. However, in the medical domain, many information items are coded in XML. Thus, querying structured XML documents is crucial for retrieving similar cases or for identifying potential study participants. For example, information items of patients with a similar tumor classification documented in a certain section of the respective pathology report need to be retrieved. This requires a precise definition of queries. In this paper, we introduce a concept for the generation of such queries using a Search Ontology XML extension to enable semantic searches on structured data. Results: For a gain of precision, the paragraph of a document need to be specified, in which a specific information item expressed in a query is expected to appear. The Search Ontology XML Extension (SOX) connects search terms to certain sections in XML documents. The extension consists of a class which represents the XML structure and a relation between search terms and this XML structure. This enables an automatic generation of XPath expressions, which makes an efficient and precise search of structured pathology reports in XML databases possible. The combination of standardized Electronic Health Records with an ontology based query method promises a gain of precision, a high degree of interoperability and long term durability of both, XML documents and queries on XML documents.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>Since untagged information in health information systems (HISs) is
common, information access supported by automatic methods is
difficult. It is still an open question how to accelerate the access to
information captured in these systems or in Electronic Health Records
(EHRs). On the one hand, content must be structured by automatic
recognition processes. On the other hand, the structured data has to
be queried in a structured way.</p>
      <p>
        This paper will focus on the query side, by introducing a new
solution of semantic meaningful queries on structured XML
documents, defined by the Search Ontology (SO) XML Extension. The
SO
        <xref ref-type="bibr" rid="ref19">(Uciteli et al., 2014)</xref>
        has been developed to support full text
search on unstructured documents. It allows an user to formally
specify domain concepts, search terms associated to the domain, and
rules describing domain concepts. In this way, it simplifies the
definition of search rules. The SO can be used for information retrieval
in any domain by extending it by the corresponding domain
ontology.
      </p>
      <p>In this work, we introduce an extension of the SO that enables the
definition of queries on structured XML documents. Assuming that
we have structured and standardized XML documents, then we can
query certain parts of the XML document by XPath expressions. The
development of such XPaths is time consuming for domain experts,
but also for computer scientists. We suggest to use ontologies to
support domain experts in modelling XML queries.</p>
      <p>Out of the ontology based query models, XPaths can be generated
automatically, which in turn can be applied to document corpora on
XML database systems for searching similar cases or for the
identification of potential study participants. Even though the approach is
inherent independent from the underlying XML structure, we will
demonstrate the approach on an example of querying standardized
Electronic Health Records (EHRs) in the pathology domain.</p>
      <p>
        To address the problem of creating structured queries for
retrieving documents, previous work considered the unification of different
XML structures on the conceptual level, on the one hand by the
introduction of new query languages, e.g. CXPath
        <xref ref-type="bibr" rid="ref2 ref3">(Camillo et al.,
2003)</xref>
        or XSEarch (Cohen et al., 2004), or on the other hand by
introducing conceptual ontologies
        <xref ref-type="bibr" rid="ref4 ref8">(Cruz et al., 2004; Erdmann et al.,
1999)</xref>
        . In contrast to this unification approaches, the SOX approach,
introduced in this paper, is strongly bound to the used XML
structure. Indeed, this strong binding on a structure is only meaningful
when standardized XML based EHRs are used.
2
2.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>METHODS</title>
      <sec id="sec-2-1">
        <title>Overview</title>
        <p>Figure 1 gives an overview on the basic approach presented in this
paper.</p>
        <p>
          Fig. 1. (1) The domain expert models the queries by the usage of SOX in
Protégé. (2) Using an extended version of the OntoQueryBuilder Plugin,
Protége generates XPath expressions out of the ontology. (3) The expert applies
the generated XPath expressions to an XML database, that (4) returns the
relevant documents.
A domain expert is in the middle of the query formulation and
retrieval process. He uses Protégé, the ontology editor of the Stanford
University
          <xref ref-type="bibr" rid="ref16">(Musen 2015)</xref>
          , for modeling a query using the Search
Ontology (section 2.2) and SOX (section 3.1) as shown on the left
side in figure 1. By an adaption of the OntoQueryBuilder Plugin it
will be possible to generate XPaths expressions. Additionally, the
agent interacts with the XML database as shown on the right hand
side of figure 1, After recognizing section boundaries, unstructured
documents are stored on an XML database (section 2.3). Using the
XPaths (sections 2.4, 3.2), the domain expert can retrieve relevant
XML documents.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Search Ontology</title>
        <p>The formulation of structured queries can be very time consuming,
especially in safety-relevant domains like post market surveillance.
A concept can be described in different ways, on the one hand by
synonyms, on the other hand by complex phrases, which in turn
consist of multiple terms. Because of that we distinguish
Simple_Terms from Composite_Terms.
Composite_Terms. are made up of Simple_Terms, related by
the Object Property has_part and are constrained by the
additional Data Property max_distance, which defines the word
distance between two Simple_Terms, where max_distance=0
represents that one word immediately follows after another word.
Writing variations, synonyms of abbreviations of the
Simple_Terms can be handled by the assignment of multiple labels to
the concrete individual of a Simple_Term.</p>
        <p>For instance, the complication of a medical device (e.g. occluder)
is a reusable Search_Concept, which can be described_by
several Search_Terms.</p>
        <p>To such descriptions belongs among other things adjective
phrases like incomplete closure. Instead of the adjective, other terms
with the same semantic meaning could be used; the noun could be
replaced by any term which represents the meaning of closure. Out
of this definition, a query can be generated, which is in this example
the disjunction of all combinations of adjectives and nouns (cf.
second disjunction in Listing 1).
Listing 1. Lucene Query for an occlusion device complication; the
expression was generated by the plugin OntoQueryBuilder.</p>
        <p>("occlusion device" OR occluder) AND
(("insufficient sealing"~2 OR "insufficient
closure"~2 OR "incomplete sealing"~2 OR
"incomplete closure"~2 OR "inadequate sealing"~2 OR
"inadequate closure"~2))
The latter example indicates that the formulation of a query can
become a complex task; the cross-product of only 10 adjectives with
10 nouns results in 100 adjective substantive combinations. Hence,
we have to manage Concepts and Terms by an appropriate ontology,
especially if we want to reuse concepts or if we want to generate
cross-products of certain term combinations.</p>
        <p>
          The SO is used in practice in the OntoVigilance project
          <xref ref-type="bibr" rid="ref17">(OntoVigilance Homepage 2016)</xref>
          , where semantic searches have to be
managed within post market surveillance queries of medical
devices. In brief, domain experts can manage their domain search
ontology (DSO). By the usage of the developed plugin
OntoQueryBuilder a Lucene query can be generated.
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Standardized XML-based EHRs</title>
        <p>
          In this paper, we will concentrate on the special domain of
pathology, where a lot of semi-structured information occurs in terms of
pathology reports. We consider this information semi-structured,
because the pathologists structure their information by headers and
keywords, but these structure is usually not technically
implemented. In fact, pathology reports are based on certain section
patterns and section-introducing keywords, like material, macroscopy
or microscopy. We verified manually that documents originated
from the Institute of Pathology of Leipzig, the sections introducing
keywords like Material, Makroskopie or Mikroskopie were
constantly used for section tagging. Therefore, the reports can be
structured in sections by section boundary detection, which is not the
main focus of this article. Consequently, legacy data can be
transformed into a structured format. Suitable standards for long term
persistence are EN 14822 a.k.a. HL7 RIM
          <xref ref-type="bibr" rid="ref7">(EN 14822, 2006)</xref>
          and
EN 13606 a.k.a. openEHR
          <xref ref-type="bibr" rid="ref6">(EN 13606, 2012)</xref>
          . Both standards are
representable in XML, EN 14822 by the usage of CDA
          <xref ref-type="bibr" rid="ref5">(Dolin et al.,
2001)</xref>
          and EN 13606 by the usage of openEHR modeling tools
          <xref ref-type="bibr" rid="ref14">(Kropf et al., 2015)</xref>
          , which results in a standardized XML schema
based on the openEHR XML schemas
          <xref ref-type="bibr" rid="ref1">(Beale 2015)</xref>
          .
        </p>
        <p>
          In this work, we will use pathology reports, mapped to
standardized EHRs by the usage of the openEHR archetypes
openEHREHR-OBSERVATION.lab_test-histopathology.v1 for structuring
pathology data and openEHR-EHR-CLUSTER.tnm_staging_7th.v1
for structuring the TNM classification
          <xref ref-type="bibr" rid="ref18">(Sobin et al., 2011)</xref>
          data. Both
latter archetypes are available at the Clinical Knowledge Manager
(CKM) of openEHR [http://www.openehr.org/ckm/]. Consider the
following snippet of an XML based pathology EHR (cf. Listing 2)
where we demonstrate the challenges of querying the content. They
are mainly due to the linguistic variability of natural language.
Listing 2. Simplified XML based pathology EHR snippet, containing a
macroscopy and a TNM classification part. The snippet was cut to the
necessary elements, which we want to address in the query in this paper,
marked by a grey background: the macroscopy section (part of
openEHREHR-OBSERVATION.lab_test-histopathology.v1) and the primary tumor
classification (part of openEHR-EHR-CLUSTER.tnm_staging_7th.v1). The
doubling of the value tag is a result of the EN 13606 reference model, in
practice the two value tags have different namespace declarations.
&lt;Pathology […]
&lt;Macroscopic_findings&gt;
&lt;name&gt;
        </p>
        <p>&lt;value&gt;Makroskopisch&lt;/value&gt;
&lt;/name&gt;
&lt;Overall_macroscopic_description&gt;
&lt;name&gt;</p>
        <p>&lt;value&gt;Makroskopisch&lt;/value&gt;
&lt;/name&gt;
&lt;value&gt;
The TNM structured classification string in the XML snippet is
“pT2”. The sentence which introduces the overall macroscopy
description contains a noun Hautexzidat (HE) (en: excised skin
material), marked by a bold font. Due to linguistic variability, this noun
can vary, i.e. synonyms or abbreviations such as H.E. are used in
practice. In front of the noun there is an underlined adjective
keilförmig (en: cuneiform) for specifying the shape. Again, semantic and
linguistic variants of the term exist (e.g. rundlich (en: roundish)).
Furthermore, the order of the adjectives in the phrase could change:
the order “Ein ff. rundliches fadenmarkiertes H.E.” was found and
is also valid.</p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4 XPath Queries</title>
        <p>
          When EHRs are stored in structured XML, another query language
is more suitable than classical free text retrieval methods such as
Lucene
          <xref ref-type="bibr" rid="ref15">(McCandless et al., 2010)</xref>
          or SOLR
          <xref ref-type="bibr" rid="ref19">(Trey et al., 2014)</xref>
          .
XPath expressions are following the structure of the EHRs and are
a W3C standardized method for addressing parts in XML documents
(
          <xref ref-type="bibr" rid="ref20">XML Path Language (XPath), 2015</xref>
          ). An example XPath Query is
shown in Listing 3 for querying T2 and phrases of HE from EHR
documents similar to those in Listing 2.
Listing 3. Required XPath expressions for a search of EHRs which
contains T2 as primary tumor classification in the first part and defined phrases
of HE in the second part.
/Pathology/Tumour__-__TNM_Cancer_staging_7th_Edition/
Primary_tumour__openBrkt_T_closeBrkt_/
value[contains(value,'T2')]
/Pathology/Macroscopic_findings/Overall_macroscopic_description/
value[matches(value,'keilförmig(\w)* ([\w]*\s){0,2}Hautexzidat')]
or
/Pathology/Macroscopic_findings/Overall_macroscopic_description/
value[matches(value,'rundlich(\w)* ([\w]*\s){0,2}H.E.)]
or
/Pathology/Macroscopic_findings/Overall_macroscopic_description/
value[matches(value,'keilförmig(\w)* ([\w]*\s){0,2}H.E.)]
or
/Pathology/Macroscopic_findings/Overall_macroscopic_description/
value[matches(value,'rundlich(\w)* ([\w]*\s){0,2} Hautexzidat'')]
The first part of Listing 3 matches documents where the tumor class
is T2. In the second part, each disjunction represents one adjective
noun phrase. It consists of an adjective and a regular expression that
reflects possible declension variations followed by the noun phrase
representing Hautexzidat (HE). The expression ([\w]*\s){0,2}
implies that between the adjective and the noun a maximum of two
words are allowed to match the pattern.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>RESULTS</title>
      <sec id="sec-3-1">
        <title>3.1 Extension of the Search Ontology</title>
        <p>The output of the SO are Lucene queries, but they do not follow the
structure of XML documents, thus, they are not applicable to XML
documents. However, the SO delivers already a reusable framework
which only has to be extended for enabling structured queries in
XML. By extending the SO with the SOX, queries are automatically
producible out of the ontology, which can be executed on XML
documents. For this purpose, two elements were added to the SO, on the
top level of the ontology the class XML_Structure and the Object
Property in.
XML_Structure represents the XML document structure.
Namespaces and tag names of the XML document are defined by
XML_Structure class labels. Figure 5 illustrates the SO for the
described use case, where the documents follow the structure of
Listing 2 and XPaths of Listing 3 have to be generated as output.</p>
        <p>The modelling of the SO (illustrated in fig. 5) has to be done
manually by the domain expert. For querying HEs with different kinds
of shapes the Search_Concept HE_Shapes was defined,
described_by HE_Phrase, which consists of two
Simple_Terms (HE_Form and HE_Term). In the example of this
paper, individuals of HE_Form can be adjectives like rundlich or
keilförmig; HE_Term has only one individual, the different writing
variations (Hautexzidat, H.E.) can be handled by multiple label
assignments. Figure 5 illustrates also the usage of the in relation for
the specification of the position inside the XML document, for
instance is the term T2_Term expected in the associated section
Primary_Tumor. In a similar way, HE_Shapes are bound to the
XML_Structure. The following figure 4 shows the class
definition of HE_Shapes in Protégé.
In summary, out of the DSO XML extension, it is possible to
generate automatically the required XPath expressions.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Automatic XPath generation</title>
        <p>
          For each Search_Concept one XPath can be generated
automatically. This can be done by a scheduled adaption of the Lucene
export plugin, the OntoQueryBuilder, which is already developed as
part of the OntoVigilance
          <xref ref-type="bibr" rid="ref17">(OntoVigilance Homepage 2016)</xref>
          project.
The first part of Listing 3 can be generated out of the
Search_Concept T2, in which description the term T2_Term
is bound to the appropriate search location by the in relation. The
second part of Listing 3 is producible out of the
Search_Concept HE_Shapes, which is described_by the
Composite_Term HE_Phrase and is expected in the
Overall_macroscopic_description. This
Composite_Term yields to a disjunction expression of all combinations
of the labels of the HE_Form individuals with the labels of
HE_Term individuals, which is in essence a kind of a cross product.
        </p>
        <p>The generated XPaths can be used for structured queries and for
the integration in other XML techniques (XSLT or XQuery).
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>DISCUSSION</title>
      <p>With an example on querying structured EHRs, we introduced an
extension of the Search Ontology to support querying structured
XML documents. The SOX approach can simplify the managing of
a big pool of XPath expressions in one overarching DSO in practice.
4.1</p>
      <sec id="sec-4-1">
        <title>Standardized queries on standardized EHRs</title>
        <p>Indeed, SPARQL queries on OWL based patient data would be more
powerful than XPath expressions on XML, but a comprehensive and
long term persistence storage of pathology data within semantic web
technologies is only partially solved and still an open research
question. Therefore, until there is no standardized domain ontology
available, queries on standardized XML will be more stable and long
term durable. To put it in brief, the first requirement is a layer of
standardized EHRs and tools which work at this layer, like the
introduced SOX. After that step a more powerful ontology layer is
demandable.</p>
        <p>We used the EN 13606 standardized XML in this work. However,
another option would be EN 14822 or even any other proprietary
XML format. When the community comes to an agreement, which
EHR standard will be used in German Health Information Systems
in future, not only the EHR would be interoperable, the usage of a
standardized query language implies that queries could be
interoperable too. Presupposed standardized EHRs would be used, it is
imaginable that queries on such EHRs can be interoperable and
therefore used in different hospitals. When openEHR is used, the
depending SOX or the resulting queries could be stored in a repository and
they could be linked to the belonging archetypes. Consequently, the
query can be reused like the archetype itself, this would save
development time and lead to quality intensification.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Recognition methods vs. querying methods</title>
        <p>
          Research in Natural Language Processing (NLP) delivers methods
          <xref ref-type="bibr" rid="ref12 ref9">(Hahn et al., 2002; Friedman et al., 1999)</xref>
          for the recognition of
clinical information in medical documents. Nevertheless, it is unclear
when a reliable NLP system will automatically recognize and
annotate free text pathology reports or any other free textual clinical
documents in daily practice. It is important to think about practical
solutions on the recognition side, but also on the query side. The SOX
delivers a practical solution on the query side by the connection of
Search Terms to parts in XML documents.
        </p>
        <p>Listing 3 already respects declension forms of adjectives which
have a stable word stem. But for the development of a minimal SO,
NLP methods like stemming are necessary too. Because of that, we
have to think about the integration of such methods into ontological
contemplations about querying structured information in the near
future.
4.3</p>
      </sec>
      <sec id="sec-4-3">
        <title>Future ontological work</title>
        <p>
          XML elements are more than symbolic structures, they have to be
considered in detail, last but not least they should have be bound to
a top level ontology like the General Formal Ontology (GFO)
          <xref ref-type="bibr" rid="ref13">(Herre
2010)</xref>
          . In the SOX, the XML document structure was realized by
is_a relations, because we wanted to model queries in tree
structures in standard Protégé. Of course has_part relations would be
semantically better. For this reason, we plan to develop a proper
plugin for the modeling of has_part relations in tree structures in
Protégé. In addition, an automatic conversion of XML documents
into a SOX XML_Structure tree is demandable; this would
accelerate the query development in Protégé. X2OWL can generate an
OWL ontology from an XML data source
          <xref ref-type="bibr" rid="ref10">(Ghawi et al., 2009)</xref>
          and
is a good starting point.
5
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>CONCLUSION</title>
      <p>When EHRs are persisted in standardized XML, it is possible to
query them in a structured way. The introduced Search Ontology
XML extension connects search terms to certain parts in XML
documents and enables an ontology based definition of semantic
searches. Out of this, XPath expressions can be generated for
querying XML database systems. Our solution supports the reuse
oriented specification of complex and powerful XPath expressions
without deep syntactic knowledge about XPath. The approach is
open for additional extensions; parts of the ontology can be reused
and adapted easily for other use cases.</p>
    </sec>
    <sec id="sec-6">
      <title>ACKNOWLEDGEMENT</title>
      <p>Thanks to Claire Chalopin, Wolf Müller, Katrin Schierle, Lars
Voitel, Christian Wittekind for their support, to the reviewers of ODLS
for their constructive feedback, especially Dagmar Waltemath, and
the organizers, among which we mention Frank Loebe and Daniel
Schober, for arranging ODLS. This work was conducted using the
Protégé resource, which is supported by grant GM10331601 from
the National Institute of General Medical Sciences of the United
States National Institutes of Health.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Beale</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2015</year>
          )
          <article-title>openEHR reference-</article-title>
          models
          <source>XSDs release 1.0</source>
          .2 https://github.com/openEHR/reference-models/tree/master/models/openEHR/Release-1.0.2/XSD [cited
          <year>2016</year>
          -
          <volume>08</volume>
          -30]
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Camillo</surname>
            ,
            <given-names>S. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carlos</surname>
            <given-names>A. H.</given-names>
          </string-name>
          <article-title>and dos Santos Mello</article-title>
          ,
          <string-name>
            <surname>R.</surname>
          </string-name>
          (
          <year>2003</year>
          )
          <article-title>Querying heterogeneous XML sources through a conceptual schema</article-title>
          .
          <source>International Conference on Conceptual Modeling</source>
          , pp.
          <fpage>186</fpage>
          -
          <lpage>199</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          et al. (
          <year>2003</year>
          )
          <article-title>XSEarch: A semantic search engine for XML</article-title>
          .
          <source>Proceedings of the 29th international conference on Very large data bases</source>
          . Volume
          <volume>29</volume>
          , pp.
          <fpage>45</fpage>
          -
          <lpage>56</lpage>
          ,
          <string-name>
            <given-names>VLDB</given-names>
            <surname>Endowment</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Cruz</surname>
            ,
            <given-names>I. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <article-title>and</article-title>
          <string-name>
            <surname>Hsu</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          (
          <year>2004</year>
          )
          <article-title>An ontology-based framework for XML semantic integration</article-title>
          .
          <source>Database Engineering and Applications Symposium</source>
          . IDEAS'
          <fpage>04</fpage>
          .
          <string-name>
            <surname>Proceedings</surname>
          </string-name>
          . International, pp.
          <fpage>217</fpage>
          -
          <lpage>226</lpage>
          , IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Dolin</surname>
            ,
            <given-names>R. H.</given-names>
          </string-name>
          et al. (
          <year>2001</year>
          )
          <article-title>The HL7 Clinical Document Architecture</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <volume>8</volume>
          (
          <issue>6</issue>
          ), pp.
          <fpage>552</fpage>
          -
          <lpage>569</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>EN</surname>
          </string-name>
          <year>13606</year>
          .
          <article-title>(2012) Health informatics - Electronic health record communication [Norm]</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>EN</surname>
          </string-name>
          <year>14822</year>
          .
          <article-title>(2006) Health informatics - General purpose information components</article-title>
          [Norm].
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Erdmann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Studer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>1999</year>
          )
          <article-title>Ontologies as conceptual models for XML documents</article-title>
          .
          <source>Proceedings of the 12th International Workshop on Knowledge Acquisition, Modelling and Mangement (KAW'99)</source>
          , Banff, Canada.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Hripcsak</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>1999</year>
          )
          <article-title>Natural language processing and its future in medicine</article-title>
          .
          <source>Academic Medicine</source>
          <volume>74</volume>
          (
          <issue>8</issue>
          ), pp.
          <fpage>890</fpage>
          -
          <lpage>5</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Ghawi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Cullot</surname>
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2009</year>
          )
          <article-title>Building Ontologies from XML Data Sources</article-title>
          .
          <source>DEXA Workshops</source>
          , pp.
          <fpage>480</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Grainger</surname>
            ,
            <given-names>T</given-names>
          </string-name>
          , Potter,
          <string-name>
            <given-names>T.</given-names>
            and
            <surname>Seeley</surname>
          </string-name>
          <string-name>
            <surname>Y.</surname>
          </string-name>
          (
          <year>2014</year>
          )
          <article-title>Solr in action</article-title>
          . Manning.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Hahn</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Romacker</surname>
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Schulz</surname>
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2002</year>
          )
          <article-title>MEDSYNDIKATEa natural language system for the extraction of medical information from findings reports</article-title>
          .
          <source>International journal of medical informatics 67(1)</source>
          , pp.
          <fpage>63</fpage>
          -
          <lpage>74</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Herre</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2010</year>
          )
          <article-title>General Formal Ontology (GFO): A foundational ontology for conceptual modelling</article-title>
          .
          <article-title>Theory and applications of ontology: computer applications</article-title>
          . Springer Netherlands, pp.
          <fpage>297</fpage>
          -
          <lpage>345</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Kropf</surname>
            ,
            <given-names>S</given-names>
          </string-name>
          , Chalopin,
          <string-name>
            <given-names>C.</given-names>
            and
            <surname>Denecke</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          (
          <year>2015</year>
          )
          <article-title>Template and Model Driven Development of Standardized Electronic Health Records</article-title>
          .
          <source>Studies in health technology and informatics 216</source>
          , pp.
          <fpage>30</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>McCandless</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hatcher</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Gospodnetic</surname>
          </string-name>
          , Otis. (
          <year>2010</year>
          )
          <article-title>Lucene in Action: Covers Apache Lucene 3.0</article-title>
          . Manning Publications Co.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          <article-title>The Protégé project: A look back and a look forward</article-title>
          .
          <source>AI Matters. Association of Computing Machinery Specific Interest Group in Artificial Intelligence</source>
          ,
          <volume>1</volume>
          (
          <issue>4</issue>
          ),
          <year>2015</year>
          , pp.
          <fpage>4</fpage>
          -
          <lpage>12</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>OntoVigilance Homepage</surname>
          </string-name>
          (
          <year>2016</year>
          ) http://www.ontovigilance.org/ [cited 2016-
          <volume>06</volume>
          -16]
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Sobin</surname>
            ,
            <given-names>L. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gospodarowicz</surname>
            ,
            <given-names>M. K.</given-names>
          </string-name>
          and Wittekind C., eds. (
          <year>2011</year>
          )
          <article-title>TNM classification of malignant tumours</article-title>
          . John Wiley &amp; Sons.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Uciteli</surname>
          </string-name>
          , Alexandr et al. (
          <year>2014</year>
          )
          <article-title>Search Ontology, a new approach towards Semantic Search</article-title>
          . GI-Jahrestagung, pp.
          <fpage>667</fpage>
          -
          <lpage>672</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>XML Path Language (XPath)</surname>
          </string-name>
          (
          <year>2015</year>
          )
          <article-title>Version 1</article-title>
          .0.
          <string-name>
            <given-names>W3C</given-names>
            <surname>Recommendation</surname>
          </string-name>
          . https://www.w3.org/TR/xpath/ [cited 2016-
          <volume>06</volume>
          -09]
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>