<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>RReevveerrssee--eennggiinneeeerriinngg ooff XXMMLL SScchheemmaass:: AA SSuurrvveeyy??</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jakub Kl´ımek</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Neˇcasky´ Jakub Kl mek</string-name>
          <email>klimek@Rkseip.umbflifc.cuni.cz</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Necasky</string-name>
          <email>necaskyg@ksi.mff.cuni.cz</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>XML Research Group, Department of Software Engineering FacuXlMtyLofRMesaetahrcehmaGtricosupa,ndDePphayrstimcse,nCthoafrSleosftUwnariveeErsnitgyinienerPinraggue Faculty of MMatahleomstartaincsska ́ennda ́Pmhˇeysts ́ıic2s5,</institution>
          ,
          <addr-line>C1h1a8rl0e0s PUrnaihvears1ity in Prague MalostransTkheenCamzeecsht R2e5p,u1b1l8ic00 Praha 1</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2010</year>
      </pub-date>
      <fpage>96</fpage>
      <lpage>107</lpage>
      <abstract>
        <p>As approaches to conceptual modeling of XML data become more popular, a need arises to reverse-engineer existing schemas to the conceptual models. They make the management of XML schemas easier as well as provide means for accomplishing integration of various XML data sources. Some methods for reverse-engineering of XML schemas have been proposed and in this paper, they are compared using various criteria such as used XML schema languages, level of user involvement, number of XML schemas that can be covered by the conceptual model or support for consecutive XML schema evolution. They are also evaluated according to their potential to be used as parts of a system for management, evolution and integration of XML as a whole.</p>
      </abstract>
      <kwd-group>
        <kwd>XML</kwd>
        <kwd>schema</kwd>
        <kwd>reverse-engineering</kwd>
        <kwd>conceptual modeling</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Introduction
Today, XML [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] is a technology used in a wide variety of scenarios, from a
message format used by web services [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] to storage of data in databases [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. As the
number of possible usages of XML grows, so does the need of easy management
of large numbers of XML data sources and their integration. There we can use
conceptual modeling of XML data. It allows a domain expert to model the
problem domain independently of the implementation (XML in our case) and
then create corresponding XML schemas, which are used to describe a structure
of XML documents.
      </p>
      <p>
        A common situation today is that a company uses several XML formats
for various purposes and has these formats described by an XML schema. To
ease the process of managing those formats and schemas as they evolve in time,
the company can use a conceptual model such as [
        <xref ref-type="bibr" rid="ref1 ref15 ref16 ref19 ref2 ref23">1, 2, 15, 16, 19, 23</xref>
        ], to which
the schemas would be connected. A problem usually arises when there is a new
? This work was supported in part by the Czech Science Foundation (GA CˇR), grant
number P202/10/0573.
format that should be connected to the conceptual model, as most of the
conceptual models do not support this operation well enough. During the process of
connecting the new format to the conceptual model, the model may need to be
extended, if the format contains a concept that was not covered by the model.
A special case of this problem is, when the company does not have a conceptual
model at all, and wants to create it from the schemas which they already have
(they extend an empty model).
      </p>
      <p>Therefore, an important aspect of the approaches used for conceptual
modeling of XML data is if and how we can create the model from existing XML
schemas (or connect a new schema to an existing model) and once we have it,
if we can use it for the management of evolution of our set of schemas. The
process of creating a conceptual model from existing XML schemas is what we
call Reverse-engineering of XML schemas.</p>
      <p>In this paper, we compare and evaluate approaches for reverse-engineering
of XML schemas according to various criteria, including their usefulness as a
method that can be integrated into a system for management of evolution of
XML schemas.
1.1</p>
    </sec>
    <sec id="sec-2">
      <title>Outline</title>
      <p>The rest of this paper is structured as follows. In section 2, an introduction to
the frequently used techniques in this area is given. In section 3 we introduce
our framework for evolution and integration of XML schemas, against which the
approaches will be evaluated. Section 4 contains a description of our comparison
criteria. In section 5, we describe approaches to the problem, which
reverseengineer XML schemas to various user-friendlier models. In section 6, approaches
to reverse-engineering to ontologies are described. In section 7 we summarize our
findings and section 8 concludes.
2</p>
      <p>Terms
In this section we provide an introduction to basic techniques used widely in the
area of reverse-engineering of XML schemas.
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>XML schemas</title>
      <p>
        In this paper, by XML schema language we mean one of the XML schema
languages such as DTD [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ], XML Schema [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ], Relax NG [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], Schematron [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]
etc. We state this because sometimes an XML schema gets confused with the
actual XML Schema language.
2.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>Model-Driven Architecture</title>
      <p>
        Model-Driven Architecture (MDA) [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] is a general approach to modeling
software systems and can be profitably applied to data modeling as well. MDA
distinguishes several types of models that are used for modeling at different
levels of abstraction. For this paper, two types of models are important. A
Platform-Independent Model (PIM) allows modeling data at the conceptual level.
A PIM diagram is abstracted from a representation of the data in concrete data
models such as relational or XML. A Platform-Specific Model (PSM) models
how the data is represented in a target data model. For each target data model
(such as XML), we need a special PSM that is able to capture its implementation
details. A PSM diagram then models a representation of the problem domain in
this particular target data model, it provides a mapping between the conceptual
diagram and a target data model schema.
2.3
      </p>
      <p>
        UML class diagrams
A large number of approaches to reverse-engineering use UML class diagrams
as a PIM. Basically, it consists of classes representing concepts, associations
representing relations and attributes of classes, representing properties of the
concepts. For a more detailed description see [
        <xref ref-type="bibr" rid="ref21 ref22">21, 22</xref>
        ].
2.4
      </p>
      <p>
        Schema matching
Most of the reverse engineering approaches use some methods of schema
matching. They include string comparisons, data type compatibility measurements,
structural similarity measurements and linguistic resources like thesauri and
dictionaries. These methods are surveyed in detail in [
        <xref ref-type="bibr" rid="ref10 ref26">26, 10</xref>
        ]. There is one
major difference between XML schema matching and reverse-engineering of XML
schemas to conceptual models. XML schema matching usually works with two
different XML schemas (written in XML schema languages) and the goal is to
find mappings of components of one schema to the components of the second
schema. On the other hand, reverse-engineering of XML schemas works with
one XML schema and optionally a conceptual model. The goal is either to
create the conceptual model when there is none, or to find appropriate mappings
of the XML schema components to the components of the model, which can be
written in e.g. UML, and therefore is of a whole different type.
3
      </p>
      <p>Framework for evolution and integration of</p>
      <p>XML schemas
In this section, we introduce our framework for evolution and integration of
XML schemas. It comprises six levels, each representing a different view of an
XML system and its evolution. The framework is depicted in Figure 1. The lowest
level, called extensional level, represents XML documents. Its parent level, called
operational level, represents operations over XML documents, i.e. XML queries.
The level above is called schema level and represents XML schemas that describe
the structure of the XML documents.
Ontology</p>
      <p>PIM diagram
PSM diagram 1
. . .</p>
      <p>PSM diagram i
. . .</p>
      <p>PSM diagram n</p>
      <p>XML schema i
(DTD, XML Schema, Relax NG, …)</p>
      <p>XML s.ch.e.ma n
XMLXXqMuMeLLries
ddooccuummeennttss</p>
      <p>XMLXXqMuMeLLries
ddooccuummeennttss</p>
      <p>XMLXXqMuMeLLries
ddooccuummeennttss</p>
      <p>XXXMMMLLL
dddoooccucuummmeeennnttstss</p>
      <p>XXXMMMLLL
dddoooccucuummmeeennnttstss</p>
      <p>
        The platform-independent and platform-specific levels follow MDA [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] which
is based on modeling the problem domain on different levels of abstraction.
The platform-independent level represents the whole problem domain. It
consists of a conceptual model that specifies the problem domain independently
of its representation in the XML formats below. We call the conceptual model
a platform-independent model (PIM ) of the problem domain. The level below,
called platform-specific level, represents mappings of the problem domain to
particular XML formats. For each XML format it comprises a model of mapping
of a selected part of the PIM to XML element and attribute declarations. We
call this model platform-specific model (PSM ) of the selected XML format.
Recently, a number of approaches translating XML schemas to an ontology have
appeared. The ontology level is the topmost in our framework.
      </p>
      <p>In our framework, components in documents on individual levels can be
formally binded with components in documents on the neighboring levels. These
bindings can then be used for evolution of the conceptual model, XML schemas,
XML documents and queries. They provide means for automatic detection of all
the places on all the levels where a single change has an impact.
4</p>
      <p>Comparison criteria
We will firstly introduce several criteria which we will later use to compare
current approaches to reverse-engineering of XML schemas. In particular, we
will focus on the following criteria:
1. Target model - on what level of our framework does the target model of
the reverse-engineering method belong. This criterion is important, because
lots of methods only visualize the XML schema in another model (e.g. UML
class diagrams) and therefore their target is on the platform-specific level
(it is a PSM in our framework). A true conceptual model on the
platformindependent level (a PIM in our framework) should be independent of the
target implementation completely. If it is a conceptual model bent for a
specific implementation, it is in fact a PSM. Recently, some approaches to
mapping an XML schema directly to an ontology (the top level of our
framework) have appeared. This criterion distinguishes among these three types
of target models.
2. Number of schemas supported by the model - whether the method is limited
to only one schema, or whether it can reverse-engineer multiple schemas to
one model. This is also very important, because to be able to manage a set
of XML schemas, it is not enough to have a separate model for each schema.</p>
      <p>We need all the schemas to be related to one model.
3. XML schema languages supported - DTD, XML Schema, Relax NG,
Schematron, etc. As can be seen from our framework, the conceptual model can be
and should be independent of the actual XML schema language used on the
schema level, because the target data model (which a PSM should represent)
is XML itself, not a specific XML schema language.
4. Mapping to an existing model - whether the method can map a schema to
an already existing model or whether it can only generate a new model.
This criterion is paramount for evaluating the possibility of integrating an
approach to a bigger system for evolution and integration of XML schemas.
If it only can create a new model for each input, it cannot be used if we
already have the model and only want to add a new XML schema to the
system.
5. Level of user involvement - methods can be automatic or semi-automatic
(relying on human intervention). It is impossible to infer a conceptual diagram
automatically, as so far only a human can determine if two objects represent
the same concept. And because we need an exact and reliable match, if we
want to use an approach as a part of a system for integration of XML data,
we can not rely on an automatic translation (mapping) and we need the user
to at least confirm a match.
6. Evolution support - whether the approach is a part of a system that also
supports evolving the schema once it is integrated into the system (when the
system changes). This criterion indicates, whether the method is developed
by itself, or if it already is a part of a system that also helps with the
evolution of the mapped schemas (e.g. by preserving the bindings between
levels of our framework)
5</p>
      <p>Approaches to mapping to user-friendly models
In this section we evaluate different approaches to reverse-engineering of XML
schemas to various more user-friendly models.
5.1</p>
      <sec id="sec-4-1">
        <title>Yang Weidong et al.</title>
        <p>
          In [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ], there is an algorithm for automatic generation of UML class diagrams
(PIMs in our framework) from DTDs according to MDA using a DTD graph
as a PSM. The authors also claim that they can generate UML class diagrams
from XSDs, but the prototype implementation is not freely available, so it
cannot be verified. The main drawback of this approach is that it only serves as
an automatic translator from DTD to UML meant to make the schema more
understandable to people who do not know DTD or XML Schema.
        </p>
        <p>This approach does not support mapping of multiple XML schemas to the
PIM and it does not preserve any mappings between the PIM and the PSM nor
between the PSM and the DTD. It is automatic, limited to DTD only and it
cannot map a schema to an existing model. The bright side is that it actually
uses both PIM and PSM correctly.
5.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Mikael R. Jensen et al.</title>
        <p>
          In [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], another method of automatic conversion of DTDs to UML class diagrams
is presented. Again, it is meant for easier browsing of XML data available on the
Internet and to ease the work of a data integrator. In contrast with the previous
method, the UML diagrams reflect the structure of the DTD an therefore are
only on the PSM level.
        </p>
        <p>This approach does not support mapping of multiple XML schemas and it
does not preserve any mappings between the PSM and the DTD. It is automatic,
limited to DTD only and it cannot map a schema to an existing model.
5.3</p>
      </sec>
      <sec id="sec-4-3">
        <title>DIXSE framework</title>
        <p>
          In [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ], a semi-automatic method of deriving a semantic model from several
DTDs is presented. By default, for each element of every DTD a new element
is created in the model. This process is done automatically. If the user wants to
create a more meaningful model (e.g. wants all Address elements to be mapped
to one element of the model), a rule written in their DIXml language extending
the element in the DTD must be created manually. The model is a PSM as it
still preserves the DTD structure. It uses the Telos [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] metamodeling language.
        </p>
        <p>This approach supports semi-automatic mapping of multiple schemas to a
conceptual model, but it does not preserve any mappings between the PSM and
the DTDs and it is limited to DTD only. It can map a new DTD to an existing
model.
5.4</p>
      </sec>
      <sec id="sec-4-4">
        <title>Xyleme</title>
        <p>
          In [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ], a project called Xyleme is described. Its focus is to provide a unified view
of a large number of heterogeneous XML documents described by DTDs. This
enables the user to perform queries on one unified model (called an abstract DTD),
to which all the other DTDs describing the XML documents are mapped
automatically. The methods for discovery of mappings are mainly language based
(thesauri, discovery of synonyms, abbreviations, etc.). A semi-automatic
prototype implementation called SAMAG was used to evaluate the approach. In
SAMAG, a user needs to validate each syntactic relationship detected.
        </p>
        <p>This approach supports semi-automatic mapping of multiple schemas to a
model which is a PSM. It preserves no mapping between the PSM and the
DTDs. It is limited to DTD only.
5.5</p>
        <sec id="sec-4-4-1">
          <title>XTM - XML Tree Model</title>
          <p>
            In [
            <xref ref-type="bibr" rid="ref9">9</xref>
            ], a conceptual model for XML called XTM - XML Tree Model is proposed,
including an algorithm for reverse-engineering of XML Schema into XTM. It
also has a strong theoretical background. However, it is still only a PSM in our
framework.
          </p>
          <p>This approach automatically visualizes one schema at a time, is limited to
XML Schema and does not maintain any mappings between the PSM and the
schemas.
5.6</p>
        </sec>
        <sec id="sec-4-4-2">
          <title>Neˇcasky´</title>
          <p>
            In [
            <xref ref-type="bibr" rid="ref20">20</xref>
            ], a complex approach to the process of reverse engineering of XML schemas
to a conceptual model is presented. The model follows MDA as it uses UML class
diagrams as a PIM and their extension as PSMs. It is further described in [
            <xref ref-type="bibr" rid="ref19">19</xref>
            ].
Because the model has two levels, the process is divided into two parts. The
first part is an automatic translation of an XML schema to a PSM. The PSMs
are, however, independent of any specific XML schema language; the approach
is presented using XML Schema. The second part is a semi-automatic algorithm
for the reconstruction of mappings between the PSM and a PIM, but it has
some drawbacks. The most problematic one is the computational cost which is
up to mn, where m is a maximum number of outgoing PIM associations from one
PIM class and n is the number of PIM classes in the model. Therefore in practice,
the algorithm will not work if the PIM diagram contains a bigger number of
associations. Nevertheless, the algorithm uses maximum of information that we
can get from a PSM and thus can offer the best results. An implementation in an
experimental stage is available in the development version of XCase [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ], which
is a tool implementing the conceptual model and its evolution.
          </p>
          <p>This approach supports semi-automatic reverse-engineering to PSMs and to a
PIM. It supports multiple schemas, is independent of any specific XML schema
language and it can also map to an existing conceptual model. It maintains
mappings between the PIM and the PSM and therefore support further schema
evolution.</p>
          <p>
            Approaches to mapping to ontologies
Recently, a number of methods of reverse-engineering of XML schemas to
ontologies have appeared. The main difference between a PIM and an ontology is
that ontologies are more expressive, as the relations they capture can be more
complex and they may even involve logical formulae. A frequent language for
ontologies is OWL [
            <xref ref-type="bibr" rid="ref30">30</xref>
            ].
In [
            <xref ref-type="bibr" rid="ref8">8</xref>
            ], a method for integration of XML schemas into an ontology is presented.
At first, each input DTD is semi-automatically transformed into a so called CCM
- Canonic Conceptual Model. It combines the ER [
            <xref ref-type="bibr" rid="ref5">5</xref>
            ] and ORM [
            <xref ref-type="bibr" rid="ref11">11</xref>
            ] models. A
default transformation is made and a user is then allowed to make adjustments
where needed. The CCMs are PSMs in our framework. Then, each CCM is
integrated (again semi-automatically - user has to verify/adjust) to form the final
ontology. The mappings between the individual CCM components and the
components of the ontology are preserved. Although the authors call it an ontology,
it is in fact more of a conceptual diagram - a PIM in our framework, because
it still is a CCM, only independent of the XML structure. This method only
provides a unified view of the integrated data so far. No implementation was
mentioned.
          </p>
          <p>This approach supports semi-automatic mapping of multiple DTDs to
corresponding PSMs and to a PIM. It is restricted to DTD only. It preserves mappings
between PSMs and a PIM.</p>
          <p>
            In [
            <xref ref-type="bibr" rid="ref32">32</xref>
            ], an algorithm is proposed to match given XML document elements to
given ontology concepts to achieve an integrated view of multiple XML
documents, when matched to the same ontology. It is based on automatic structural
matching of a DTD tree to an ontology tree. However, a precondition is that
a domain expert has provided a table of synonyms, i.e. a list of semantically
matching strings from the DTD and from the ontology (which seems to be a
strong precondition).
          </p>
          <p>This in fact semi-automatic approach maps multiple DTDs, it is limited to
DTD only and it does not preserve any mappings between the DTDs and the
ontology. It can only map to an existing ontology.
6.3</p>
          <p>Bedini et al.</p>
          <p>
            In [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ], a general architecture of building ontologies from XML schemas is
presented. However, no specific methods are suggested, only a sequence of tasks
that a tool for ontology building should follow and also a set of rules according
to which concepts and their relations can be extracted from a XML Schema.
The general architecture takes the need for consecutive schema evolution into
account. A semi-automatic prototype implementation called Janus is presented
briefly. It requires human assistance for merging of similar concepts and it does
not preserve any mappings between the schemas and the ontology.
          </p>
          <p>This approach is semi-automatic, it is limited to XML schema and it does
not preserve any mappings between the schemas and the ontology. It also only
creates new ontologies.
6.4</p>
          <p>
            DTD2OWL
In [
            <xref ref-type="bibr" rid="ref29">29</xref>
            ], a method of automatic translation of DTD to OWL ontology is suggested.
In addition, this method transforms the actual XML documents into OWL
individuals. The authors suggest that the whole web should be transformed this
way. This method is pure translation of one DTD to one OWL ontology with no
support for schema evolution nor conceptual modeling.
          </p>
          <p>This approach is automatic, it is limited to DTD and it can only map one
DTD to one new ontology, not preserving any mappings.
7</p>
          <p>Summary
In this section, a brief summary of evaluated approaches and the comparison
criteria is given.</p>
          <p>5.1
5.2
5.3
5.4
5.5
5.6
6.1
6.2
6.3
6.4</p>
          <p>Model Schemas Languages Automatic
PSM One DTD Yes
PSM One DTD Yes
PSM Multiple DTD No
PSM Multiple DTD Yes
PSM One XSD Yes
PIM Multiple Anya No
PIM Multiple DTD No</p>
          <p>O Multiple DTD No
O Multiple XSD No
O One DTD Yes
Maps to</p>
          <p>New</p>
          <p>New
New, Existing
New, Existing</p>
          <p>New
New, Existing
New, Existing</p>
          <p>Existing</p>
          <p>New
New</p>
          <p>Evolution</p>
          <p>No
No
No
No
No
Yes
Yes
No
No
No
a This method is not limited to any XML schema language. Currently implemented
for XML Schema</p>
          <p>Table 1. Overview of approaches according to various criteria</p>
          <p>Let us review the comparison criteria used (the columns in Table 1 correspond
to them):
1. Whether the target of the approach is a PIM, PSM or an ontology
2. Whether the approach is limited to only one schema or whether it can handle
multiple schemas
3. Which XML schema languages can the approach handle
4. Whether the method is a pure automatic translation or whether the process
is semi-automatic - a user is involved
5. Whether the method creates a new target model or whether it can use an
existing one
6. Whether the method preserves mappings between the schemas and the target
model, which can be used for consecutive schema evolution
The best suitable method for our intention of creating a system for management
of XML schema evolution and integration is 5.6. However, it has some issues with
computational complexity, which need to be resolved before its implementation.
8</p>
          <p>Conclusion
In this paper, we have compared and evaluated several approaches for
reverseengineering of XML schemas according to given comparison criteria. Among
them, only one was well suited for being a part of a larger system for evolution
and integration of XML schemas. Also, this survey showed a severe lack of
support for newer XML schema languages like Relax NG or Schematron in the area
of conceptual modeling of XML and reverse-engineering of XML schemas.</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>R.</given-names>
            <surname>Al-Kamha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. W.</given-names>
            <surname>Embley</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S. W.</given-names>
            <surname>Liddle</surname>
          </string-name>
          .
          <article-title>Augmenting Traditional Conceptual Models to Accommodate XML Structural Constructs</article-title>
          .
          <source>In Proceedings of 26th International Conference on Conceptual Modeling</source>
          , pages
          <fpage>518</fpage>
          -
          <lpage>533</lpage>
          , Auckland, New Zealand, Nov.
          <year>2007</year>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>A.</given-names>
            <surname>Badia</surname>
          </string-name>
          .
          <article-title>Conceptual Modeling for Semistructured Data</article-title>
          .
          <source>In Proceedings of the 3rd International Conference on Web Information Systems Engineering Workshops</source>
          , pages
          <fpage>170</fpage>
          -
          <lpage>177</lpage>
          , Singapore, Dec.
          <year>2002</year>
          . IEEE Computer Society.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>I.</given-names>
            <surname>Bedini</surname>
          </string-name>
          , G. Gardarin, and
          <string-name>
            <given-names>B.</given-names>
            <surname>Nguyen</surname>
          </string-name>
          .
          <article-title>Deriving Ontologies from XML Schema</article-title>
          . CoRR, abs/1001.4901,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>R.</given-names>
            <surname>Bourret</surname>
          </string-name>
          .
          <article-title>XML and Databases</article-title>
          .
          <source>September</source>
          <year>2005</year>
          . http://www.rpbourret.com/ xml/XMLAndDatabases.htm.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>P.</given-names>
            <surname>Chen</surname>
          </string-name>
          .
          <article-title>The Entity-Relationship Model-Toward a Unified View of Data</article-title>
          .
          <source>ACM Transactions on Database Systems</source>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ):
          <fpage>9</fpage>
          -
          <lpage>36</lpage>
          , Mar.
          <year>1976</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>J.</given-names>
            <surname>Clark</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. Makoto. RELAX NG</given-names>
            <surname>Specification. Oasis</surname>
          </string-name>
          ,
          <year>December 2001</year>
          . http: //www.oasis-open.org/committees/relax-ng/spec-20011203.html.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>D.</given-names>
            <surname>Booth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. K.</given-names>
            <surname>Liu. Web Services Description</surname>
          </string-name>
          <article-title>Language (WSDL) Version 2.0 Part 0: Primer</article-title>
          . W3C,
          <year>June 2007</year>
          . http://www.w3.org/TR/wsdl20-primer/.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. R. dos Santos Mello and
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Heuser</surname>
          </string-name>
          .
          <article-title>A Bottom-Up Approach for Integration of XML Sources</article-title>
          .
          <source>In Workshop on Information Integration on the Web</source>
          , pages
          <fpage>118</fpage>
          -
          <lpage>124</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>J.</given-names>
            <surname>Fong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. K.</given-names>
            <surname>Cheung</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Shiu</surname>
          </string-name>
          .
          <article-title>The XML Tree Model - toward an XML conceptual schema reversed from XML Schema Definition</article-title>
          . Data Knowl. Eng.,
          <volume>64</volume>
          (
          <issue>3</issue>
          ):
          <fpage>624</fpage>
          -
          <lpage>661</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. H. Hai, Do. Schema Matching and
          <article-title>Mapping-based Data Integration: Architecture, Approaches and Evaluation</article-title>
          . VDM Verlag, Saarbru¨cken, Germany, Germany,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. T. Halpin and
          <string-name>
            <given-names>T.</given-names>
            <surname>Morgan</surname>
          </string-name>
          . Information Modeling and
          <string-name>
            <given-names>Relational</given-names>
            <surname>Databases</surname>
          </string-name>
          . Morgan Kaufmann Publishers Inc., San Francisco, CA, USA,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>ISO. Information Technology Document Schema Definition</surname>
          </string-name>
          <article-title>Languages (DSDL) Part 3: Rule-based Validation Schematron</article-title>
          .
          <source>ISO/IEC 19757-3</source>
          , feb
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>M. R. Jensen</surname>
            ,
            <given-names>T. H.</given-names>
          </string-name>
          <string-name>
            <surname>Møller</surname>
            , and
            <given-names>T. B.</given-names>
          </string-name>
          <string-name>
            <surname>Pedersen</surname>
          </string-name>
          .
          <article-title>Converting XML Data to UML Diagrams For Conceptual Data Integration</article-title>
          .
          <source>In In: 1st International Workshop on Data Integration over the Web (DIWeb) at 13th Conference on Advanced Information Systems Engineering (CAISE01)</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. J. Kl´ımek, L. Kopenec,
          <string-name>
            <given-names>P.</given-names>
            <surname>Loupal</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Maly</surname>
          </string-name>
          <article-title>´. XCase - A Tool for Conceptual XML Data Modeling</article-title>
          .
          <source>In Advances in Databases and Information Systems</source>
          , volume
          <volume>5968</volume>
          /2010 of Lecture Notes in Computer Science, pages
          <fpage>96</fpage>
          -
          <lpage>103</lpage>
          . Springer Berlin / Heidelberg, March
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>B.</given-names>
            <surname>Loscio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Salgado</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L.</given-names>
            <surname>Galvao</surname>
          </string-name>
          .
          <article-title>Conceptual Modeling of XML Schemas</article-title>
          .
          <source>In Proceedings of the Fifth ACM CIKM International Workshop on Web Information and Data Management</source>
          , pages
          <fpage>102</fpage>
          -
          <lpage>105</lpage>
          , New Orleans, USA, Nov.
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>M.</given-names>
            <surname>Mani</surname>
          </string-name>
          .
          <article-title>Erex: A conceptual model for xml</article-title>
          .
          <source>In Proceedings of the Second International XML Database Symposium</source>
          , pages
          <fpage>128</fpage>
          -
          <lpage>142</lpage>
          , Toronto, Canada, Aug.
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>J.</given-names>
            <surname>Miller</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Mukerji</surname>
          </string-name>
          .
          <source>MDA Guide Version 1.0</source>
          .1. Object Management Group,
          <year>2003</year>
          . http://www.omg.org/docs/omg/03-06-01.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>J. Mylopoulos</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Borgida</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Jarke</surname>
            , and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Koubarakis</surname>
          </string-name>
          .
          <article-title>Telos: representing knowledge about information systems</article-title>
          .
          <source>ACM Trans. Inf</source>
          . Syst.,
          <volume>8</volume>
          (
          <issue>4</issue>
          ):
          <fpage>325</fpage>
          -
          <lpage>362</lpage>
          ,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>M. Neˇcasky</surname>
          </string-name>
          <article-title>´. Conceptual Modeling for XML, volume 99 of Dissertations in Database and Information Systems Series</article-title>
          . IOS Press/AKA Verlag,
          <year>January 2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>M. Neˇcasky</surname>
          </string-name>
          <article-title>´. Reverse Engineering of XML Schemas to Conceptual Diagrams</article-title>
          .
          <source>In Proceedings of The Sixth Asia-Pacific Conference on Conceptual Modelling</source>
          , pages
          <fpage>117</fpage>
          -
          <lpage>128</lpage>
          , Wellington, New Zealand,
          <year>January 2009</year>
          . Australian Computer Society.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21. Object Management Group.
          <source>UML Infrastructure Specification 2.1</source>
          .2, nov
          <year>2007</year>
          . http://www.omg.org/spec/UML/2.1.2/Infrastructure/PDF/.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22. Object Management Group.
          <source>UML Superstructure Specification 2.1</source>
          .2, nov
          <year>2007</year>
          . http://www.omg.org/spec/UML/2.1.2/Superstructure/PDF/.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <given-names>G.</given-names>
            <surname>Psaila. ERX</surname>
          </string-name>
          :
          <article-title>A Conceptual Model for XML Documents</article-title>
          .
          <source>In Proceedings of the 2000 ACM Symposium on Applied Computing</source>
          , pages
          <fpage>898</fpage>
          -
          <lpage>903</lpage>
          , Como, Italy, Mar.
          <year>2000</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>C. Reynaud</surname>
            ,
            <given-names>J.-P.</given-names>
          </string-name>
          <string-name>
            <surname>Sirot</surname>
            , and
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Vodislav</surname>
          </string-name>
          .
          <article-title>Semantic integration of xml heterogeneous data sources</article-title>
          .
          <source>In IDEAS '01: Proceedings of the International Database Engineering &amp; Applications Symposium</source>
          , pages
          <fpage>199</fpage>
          -
          <lpage>208</lpage>
          , Washington, DC, USA,
          <year>2001</year>
          . IEEE Computer Society.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25. P. Rodr´ıguez-Gianolli and
          <string-name>
            <given-names>J.</given-names>
            <surname>Mylopoulos</surname>
          </string-name>
          .
          <article-title>A Semantic Approach to XML-based Data Integration</article-title>
          .
          <source>In ER '01: Proceedings of the 20th International Conference on Conceptual Modeling</source>
          , pages
          <fpage>117</fpage>
          -
          <lpage>132</lpage>
          , London, UK,
          <year>2001</year>
          . Springer-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <given-names>P.</given-names>
            <surname>Shvaiko</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Euzenat</surname>
          </string-name>
          .
          <article-title>A survey of schema-based matching approaches</article-title>
          .
          <source>Journal on Data Semantics</source>
          ,
          <volume>4</volume>
          :
          <fpage>146</fpage>
          -
          <lpage>171</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <given-names>T.</given-names>
            <surname>Bray</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Paoli</surname>
          </string-name>
          and
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Sperberg-McQueen</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Maler</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Yergeau. Extensible Markup</surname>
          </string-name>
          <article-title>Language (XML) 1.0 (Fourth Edition)</article-title>
          .
          <source>W3C</source>
          ,
          <year>September 2006</year>
          . http://www.w3.org/TR/REC-xml/.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <given-names>H. S.</given-names>
            <surname>Thompson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Beech</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Maloney</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Mendelsohn</surname>
          </string-name>
          .
          <source>XML Schema Part</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Structures (Second Edition). W3C</surname>
          </string-name>
          ,
          <year>October 2004</year>
          . http://www.w3.org/TR/ xmlschema-1/.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29. P. T. T. Thuy,
          <string-name>
            <given-names>Y.-K.</given-names>
            <surname>Lee</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Lee</surname>
          </string-name>
          .
          <article-title>DTD2OWL: Automatic Transforming XML Documents into OWL Ontology</article-title>
          .
          <source>In ICIS '09: Proceedings of the 2nd International Conference on Interaction Sciences</source>
          , pages
          <fpage>125</fpage>
          -
          <lpage>131</lpage>
          , New York, NY, USA,
          <year>2009</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30. W3C OWL Working Group.
          <article-title>OWL 2 Web Ontology Language</article-title>
          . W3C,
          <year>October 2009</year>
          . http://www.w3.org/TR/owl2-overview/.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <given-names>Y.</given-names>
            <surname>Weidong</surname>
          </string-name>
          , G. Ning, and
          <string-name>
            <given-names>S.</given-names>
            <surname>Baile</surname>
          </string-name>
          . Reverse Engineering XML. Computer and Computational Sciences,
          <string-name>
            <surname>International</surname>
          </string-name>
          Multi-Symposiums on,
          <volume>2</volume>
          :
          <fpage>447</fpage>
          -
          <lpage>454</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32. L.
          <string-name>
            <surname>Xiao</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Zhang</surname>
            , G. Huang, and
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Shi</surname>
          </string-name>
          .
          <article-title>Automatic Mapping from XML Documents to Ontologies</article-title>
          .
          <source>In CIT '04: Proceedings of the The Fourth International Conference on Computer and Information Technology</source>
          , pages
          <fpage>321</fpage>
          -
          <lpage>325</lpage>
          , Washington, DC, USA,
          <year>2004</year>
          . IEEE Computer Society.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>