<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>⋆ CCoonncceeppttuuaall MMooddeelliinngg ffoorr XXMMLL:: AA SSuurrvveeyy∗</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Martin Necasky Martin Neˇcasky´</string-name>
          <email>martin.necasky@mff.cuni.cz</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Charles University, Faculty of Mathematics and Physics</institution>
          ,
          <addr-line>CMhaalrolsetsraUnnsikveernsaitmy,. F2a5c,u1l1ty8 o0f0MParathheam1a,tCiczsecahndRPephuybsilcics, Malostranskemanratmin..2n5e</addr-line>
        </aff>
      </contrib-group>
      <fpage>40</fpage>
      <lpage>53</lpage>
      <abstract>
        <p>Recently XML is the standard format used for the exchange of data between information systems and is also frequently applied as a logical database model. If we use XML as a logical database model we need a conceptual model for the description of its semantics. However, XML as a logical database model has some special characteristics which makes existing conceptual models as E-R or UML unsuitable. I n this paper, the current approaches to the conceptual modeling of XML data are described in an uniform style. A list of requirements for XML conceptual models is presented and described approaches are compared on the base of the requirements.</p>
      </abstract>
      <kwd-group>
        <kwd>conceptual modeling</kwd>
        <kwd>XML</kwd>
        <kwd>XML Schema</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Today XML is used for the exchange of data between information systems and
it is frequently used as a logical database model for storing data into databases.
If we use XML as a logical database model we need a conceptual model for
modeling XML data. There is the Entity-Relationship (E-R) [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] model for the
conceptual modeling of relational data. However, XML as a logical database
model has some special differences which makes the E-R model u nsuitable for
the conceptual modeling of XML data. The main differences are the following:
– hierarchical structure
– irregular structure
– ordering on siblings
– mixed content
      </p>
      <p>
        These features cannot be properly modeled in the E-R model. T here are
some approaches, for example Extended E-R [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], EReX [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], EE R [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], XER
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], ERX [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], and C-XML [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], trying to extend the E-R model t o be suitable
for the conceptual modeling of XML data. It is possible to extend the E-R model
to model ordering, mixed content and irregular structure of XML data. However,
there is a problem with the modeling of a hierarchical structure of XML data.
⋆ This paper was supported by the National programme of research (Information
society project 1ET100300419)
      </p>
      <p>Suppose an E-R diagram with a relationship type Enroll between two entity
types Student and Course representing courses enrolled by students. Each
student may enroll zero or more courses and each course may be enrolled by zero
or more students. The diagram is shown in Figure 1(a).
Figures 1(b), (c), and (d) show possible representations of the relationship
type in a hierarchical structure. Oriented arrows denote a nesting. There is not
the best nesting of the concepts. The nesting of courses into students illustrated
by Figure 1(b) is suitable when we need to see students and the courses they
enrolled. The nesting of students into courses illustrated by Figure 1(c) is suitable
when we need to see courses and the students enrolled in them.</p>
      <p>The previous example shows another difference between the conceptual level
of XML and the E-R model. This difference is not in the structur e but it is
in the usage of XML. It is shown that there may be many ways of how to use
entity types connected together by a relationship type. If we represent data in
the form of XML, each of these ways may require another hierarchical ordering
of the entities. However, this feature cannot be effectively modeled by the E-R
model.</p>
      <p>
        Another possibility of how to model XML data is to start from a hierarchical
structure. This approach may be called the hierarchical approach. There are
conceptual models based on the hierarchical approach, for example X-Entity
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], ORA-SS [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and Semantic Networks for XML [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The base of a schema in
the hierarchical approach is a tree, whose nodes are entity types and edges are
relationship types between entity types. Figures 1(b), (c), and (d) show examples
of a basic hierarchical schemata.
      </p>
      <p>The hierarchical approach is able to solve the mentioned problem with
different views of the same data. For each of the views there is a separate tree.
However, a problem with the modeling of attributes of relationship types or with
the modeling of n-ary relationship types, effectivelly solved in the E-R mode l,
arises. Another problem arises when deciding which of hierarchical organizations
of the same data is the best to select as the basic organization used for the data
storage.</p>
      <p>
        The goal of this paper is to describe the existing conceptual models for XML
based on the E-R model and on the hierarchical approach. Ther e are approaches
based on the UML (Unified Modeling Language) [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and ORM (Object Role
Modeling) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] models, too. However, we do not describe them in this paper. We
propose a list of requirements for conceptual models for XML and compare the
described models against the requirements. The main contributions of this paper
are the unified descriptions of the conceptual models and the comparison of the
models against the list of requirements. This paper is an abbreviated version of
the full paper [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] where all the conceptual models mentioned in this paper are
compared in detail and described in an unified formalism.
      </p>
      <p>Section 2 introduces the list of requirements for conceptual models for XML.
Section 3 describes representatives of the conceptual models for XML based on
the well-known E-R model. Section 4 describes representati ves of the hierarchical
conceptual models for XML. Section 5 compares the described conceptual models
against the requirements introduced in Section 2.
2</p>
      <p>Requirements for Conceptual Models for XML
Requirements for conceptual models for XML are summarized in this section.
There are two groups of the requirements described. The first group consists of
general requirements covering general goals of the XML conceptual modeling.
The second group consists of modeling constructs requirements covering
requirements on what kinds of modeling constructs should XML conceptual models
support.
2.1</p>
      <sec id="sec-1-1">
        <title>General Requirements</title>
      </sec>
      <sec id="sec-1-2">
        <title>Independence on XML schema languages The conceptual model should</title>
        <p>
          be independent on a certain XML schema language (XML Schema [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], DTD,
. . .). The constraints given by a certain XML schema language should not
be propagated to the conceptual level.
        </p>
        <p>Formal foundations The modeling constructs of the conceptual model should
be described formally, which allows to compare the model with other
conceptual models or to describe the operations on the model structures and
modeled data (for example, data transformation between two conceptual
schemata or their integration).</p>
        <p>Graphical notation A user-friendly graphical notation for the formal
modeling constructs should be offered by the conceptual model.</p>
        <p>
          Logical level mapping There should be algorithms for mapping of the
conceptual modeling constructs to the XML logical level. The logical schema
should implement as many integrity constraints arised from the conceptual
schema as possible. It may require the usage of more than one XML schema
language for the logical level description (XML Schema and Schematron [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ],
for example). The hierarchical structure of the XML data should be utilized
as much as possible on the logical level.
        </p>
        <p>
          Different structures on the logical level The XML logical level is
hierarchical. However, there are different users with different requirements
accessing the modeled data on the logical level. Hence, there can be different
hierarchical views of the same data. Each of the views suits to different
requirements. It should be possible to model the different hierarchical views
on the conceptual level and translate them to the corresponding views on
the logical level. Moreover, there should be algorithms allowing automatic
translation of data from one logical view to another logical view (using XSLT
[
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], for example).
        </p>
        <p>
          Semantic web mapping With the increasing usage of the semantic web
technologies the problem of publishing data in the form of RDF [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] triples
described by RDF Schema [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] or OWL [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] arises. One possible solution is
to have the data internally represented in the form of XML and translate
them to the RDF triples represented in the form of RDF/XML [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]
utilizing XSLT. The conceptual model for XML should consider this problem. It
would be useful to have algorithms for the translation from the conceptual
level to the semantic web level where the structures from the conceptual level
are described using OWL. It would allow companies to publish their
internally represented data on the semantic web and, backwards, to obtain data
from the semantic web and integrate them to the internal representation
automatically.
2.2
        </p>
      </sec>
      <sec id="sec-1-3">
        <title>Modeling Constructs Requirements</title>
        <p>Hierarchical structure Although it can be useful to keep a document
designer out of the hierarchical structure of XML data on the conceptual level,
the conceptual model should offer modeling constructs for modeling nesting
explicitly. For example, aggregation relationship types can be used.
However, non-hierarchical relationship types (for example, a ssociation
relationship types or references) should be offered too. The conceptual model should
introduce contructs for modeling a recursive structure.</p>
        <p>Cardinality for all participants The hierarchical structure of XML data
restricts the specification of cardinality constraints only to the nested
participants of the relationship type. However, it should be possible to specify
cardinality constraints for the all participants on the conceptual level.
N -ary relationship types For the same reason, the modeling of n-ary
relationship types and their translation to the XML logical level is problematic.
However, it should be possible to model n-ary relationship types on the
conceptual level.</p>
        <p>Attributes of relationship types For the same reason again, the modeling
of attributes of relationship types is problematic. Nor the nesting nor the
concept of referential integrity on the XML logical level do not allow to
directly express attributes of relationship types. However, the conceptual
model should allow to model attributes of relationship types.</p>
        <p>Ordering XML is ordered and this property should be propagated to the
conceptual level. It should be possible to express the ordering on values of
attributes, the ordering on concepts connected with another concept (for
example, a book has a title page first, followed by an abstract, chapters,
appendixes and a bibliography in this order) and the ordering on a participant
of a relationship type (for example, the list of authors of a book or the list
of chapters of a book are ordered).</p>
      </sec>
      <sec id="sec-1-4">
        <title>Irregular and heterogeneous structure XML data may have irregular and</title>
        <p>heterogeneous structure. The conceptual model should introduce constructs
for modeling such a structure. For example, variant-valued constructors for
constructing attributes or disjunctive relationship types should be
introduced.</p>
        <p>
          Document-centric data The difference between the conceptual models for
XML and the other conceptual models is that the conceptual models for
XML must allow to model document-centric data. It means that not only
the real-world objects with attributes and relationships b ut also the certain
parts of documents are modeled on the conceptual level. Hence, there should
be corresponding modeling constructs offered by the conceptual model. It
means to allow attributes and relationships of a given concept to be mixed
with a text when represented in a document content. However, the mixed
content should not be restricted as it is restricted by XML Schema. Some
form of generalized mixed content should be introduced allowing to specify
where the text values may appear exactly (as it is possible in Relax NG [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]
schemata, for example).
        </p>
        <p>Reuse of content The reuse of content should be supported by the
conceptual model. For example, the concept inheritance (modeled by IS-A
relationship types in E-R, for example) supports the reuse of con tent. However,
the conceptual model may be inspired in the XML Schema language and
may support named types and named groups of concepts on the conceptual
level.</p>
        <p>Integration of conceptual schemata XML data are often used for the data
integration. However, it can not be done effectivelly and automatically
without the support on the conceptual level. A conceptual model for XML should
offer modeling constructs to support an integration of schemata on the
conceptual level and it should allow to merge different conceptual schemata to
an overall conceptual schema. Further, it would be useful to generate XSLT
transformation scripts to translate data corresponding to one conceptual
schema to data corresponding to another conceptual schema.
3</p>
        <p>
          ER- Based Conceptual Models for XML
In this section, we describe two representatives of the conceptual models for
XML based on the E-R model. The first representative is the Ext ended E-R
model [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] and the second representative is the EReX model [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
      </sec>
      <sec id="sec-1-5">
        <title>Extended E-R Model (by Antonio Badia)</title>
        <p>
          Extended E-R model proposed by Badia in [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] is a minimalistic extension to the
E-R model. The extension is based on the idea of integration o f structured and
semistructured data where an overall conceptual schema is needed. Moreover,
the author proposes algorithms for the translation of E-R sc hemata to relational
schemata and to DTD schemata. Further, he studies the utilization of
combination of relational schemata and DTD schemata for a data representation. The
author proposes the following DTD based extensions to the E- R model.
– Each attribute is marked as optional or required. If an entity type has an
optional attribute its entities may or may not have a value of the attribute.
If an entity type has a required attribute its entities must have a value of
the attribute.
– A choice between two or more attributes called choice attribute can be
modeled. A choice attribute can be inclusive or exclusive. If an entity type has an
inclusive choice attribute its entities may have values of one or more of the
attributes in the choice. If an entity type has an exclusive choice attribute
its entities may have only a value of one of the attributes in the choice.
        </p>
        <p>In a graphical representation, an optional attribute is connected to the
corresponding entity type by a solid line with two dashes crossing it. A choice of
attributes is expressed by marking the choice with an upward triangle, with the
choices in the opposed side of the triangle.</p>
        <p>
          Figure 2 displays the entity type Student having the optional attribute
phone and the exclusive choice attribute involving the attributes hosteladdr
and homeaddr, i.e. each student has a hostel address or a home address but not
both.
EReX is an extension to the E-R model proposed by Mani in [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. The author
introduces the following extensions to the E-R model:
– Categorization of entity types can be modeled using category relationship
types. Category relationship types are a special kind of binary relationship
types similar to IS-A relationship types from the well-know n E-R model. A
category relationship type is displayed by an arrow with the label CAT going
from its category entity type to its categorizied entity type. The difference
between the IS-A relationship types and the category relati onship types is
that a categorized entity type may have an empty key (i.e. an entity type
with an empty key must be categorized). Moreover the integrity constraints
called coverage constraints can be specified on categorizied entity types.
– Total and exclusive coverage constraints can be specified for categories and
for roles of entity types in relationship types. A total coverage constraint
specifies that the union of sets of instances of all included categories or roles
must be the same as a set of instances of the categorizied entity type or the
entity type with the included roles. An exclusive coverage constraint specifies
the disjunction between the sets of instances of the included categories or
roles. We do not formally define the coverage constraints here. We show them
only in a form of the examples illustrated in Figure 3.
– Order constraints can be specified for participants of a relationship type. An
ordering on a participant E of a relationship type R is displayed by a thick
solid line between R and E. If an ordering on E in R is specified, then for a
given entity e of the entity type E the set of relationships of the relationship
type R with e as a participant is ordered.
        </p>
        <p>The extending modeling constructs of the EReX model are demonstrated by
the schema in Figure 3. It displays the categorized entity type P erson and its
categories Student and P rof essor. The key of P erson is empty. Further, there
are the entity types Book and P aper connected with P rof essor by the
relationship types AuthorOf B and AuthorOf P , respectively. Attributes of Book and
P aper are not displayed. There is an ordering specified on the entity types Book
and P aper in the relationship types AuthorOf B and AuthorOf P , respectively.
It means, that the authors of a given paper or a given book are ordered.</p>
        <p>The total coverage constraint Student + P rof essor = P erson specifies,
that each person is a student or a professor and there are no other persons.
The exclusive coverage constraint Student| P rof essor specifies that students
and professors are disjoint. The total coverage constraint AuthorOf B.pbook +
AuthorOf P.ppaper = P rof essor specifies that each professor is an author of
some paper or book.</p>
        <p>Hierarchical Conceptual Models for XML
The extensions of the E-R model allow to model conceptual sch emata with a
graph structure. However, XML schema languages allow to express relationship
types only by nesting and references. It is possible to express all the relationship
types from an E-R schema by references, but it leads to flat sch emata and the
advantages of the hierarchical structure of XML are not utilized. On the other
hand, if the hierarchical structure is used to express relationship types in a
conceptual schema the problem with the decision about what to nest arises.
Another problem is how to represent n-ary relationship types and attributes of
relationship types.</p>
        <p>
          In this section we describe a basic hierarchical conceptual model for XML
first. In the next subsections, we describe two representatives of the conceptual
models for XML based on the hierarchical approach. The first representative is
the ORA-SS model [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] and the second representative is the Sem antic Networks
for XML model [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
4.1
        </p>
      </sec>
      <sec id="sec-1-6">
        <title>Basic hierarchical conceptual model for XML</title>
        <p>The basic hierarchical conceptual model for XML can be easily defined as a
restriction of the E-R model where only the binary relations hip types with
cardinality types (1, 1) : 1 or (1, 1) : N and without attributes are allowed. Each
relationship type is oriented from the entity type with the arbitrary cardinality
called the parent participant to the entity type with the cardinality (1, 1) in the
relationship type called the child participant. We say that the child participant
is nested in the parent participant. This kind of relationship types may be called
nesting binary relationship types. When modeling XML data, the nesting binary
relationship types are represented by a nesting of elements on the XML logical
level. They express a hierarchical structure on the XML logical level explicitly
on the conceptual level. However, the semantics of nesting relationship types do
not have to be only ”part-of”. It may be a general association too.</p>
        <p>Such restrictions are too strong and do not allow to model conceptual schemata
with richer semantics. Nor n-ary relationship types, nor attributes of relationship
types can be modeled. Moreover, lots of redundancies may appear in schemata.
There are some approaches extending this basic hierarchical model described in
the following subsections.
4.2</p>
        <p>
          ORA-SS
ORA-SS is a rich hierarchical conceptual model for XML propo sed by Dobbie
et al. in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. ORA-SS has three basic modeling constructs: ob ject types,
relationship types and attributes. The object type construct is similar to the entity
type from the E-R model. Relationship types between object t ypes represent
hierarchical relationships. Non-hierarchical relations hips can be modeled by the
references. The authors introduce the concept of n-ary hierarchical relationship
types and attributes of hierarchial relationship types. Moreover, the authors offer
the following extending features:
– Cardinality constraints for the both participants of hierarchical relationship
types.
– An ordering on different concepts. The first type is an ordering on values of
a multivalued attribute of an object type. The second type is an ordering
between the attributes of an object type and nesting relationship types going
from the object type. The third type is an ordering on a relationship type
going from an object type. It allows to specify ordering between the objects
nested by the ordered relationship type in the parent object.
– A disjunction between two or more attributes or nesting relationship types.
        </p>
        <p>It allows to model irregular structure.</p>
        <p>Figure 4 displays an ORA-SS schema representing professors as employees of
departments and professors as members of projects. Each professor is employed
by exactly one department and each department employes one or more
professors. Each professor is a member of zero or more projects and each project has
one or more members. There is a ternary relationship type AuthorOf between
the object types P roject, P rof essor and P aper. It represents papers written
by a professor participating in a project. Each professor is an author of zero
or more papers in a project and each paper has one or more authors. For each
project there is a list of member professors and for each member professor there
is a list of papers he wrote during his work in the project. Moreover, there is
the attribute pages of the relationship type AuthorOf . For a professor being
an author of a paper in a project the value of pages is the number of pages the
professor wrote in the paper. However, the attribute pages cannot be directly
assigned to the relationship type AuthorOf . It must be assigned to the nested
object type P aper. For each P rof essor instance nested in a P roject instance
there must be a P rof essor instance nested in a Department instance containing
the name and address values of the professor. This is modeled by the reference
between the object types.</p>
      </sec>
      <sec id="sec-1-7">
        <title>Semantic Networks for XML</title>
        <p>
          The semantic network model for XML was introduced by Feng et al. in [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. The
model is a little extension to the basic hierarchical conceptual model described
in Section 4.1. Schemata in the semantic network model for XML are called
semantic networks. Nodes in semantic networks are used for modeling objects from
the real world and their attributes, and edges are used for modeling relationships
between the objects.
        </p>
        <p>Only binary hierarchical relationship types without attributes can be
modeled in the semantic network model. Moreover the parent participant cardinality
constraint of a hierarchical relationship type must be equal to (1, 1). Beside the
hierarchical relationship types it is possible to use non-h ierarchical relationship
types for modeling associations.</p>
        <p>Different constraints can be specified in the semantic network schema for
XML. Constraints can be specified over a node, over an edge and over a set of
edges. These constraints are a uniqueness, order, disjunction, etc.</p>
        <p>Figure 5 displays a semantic network schema. There are departments
represented by the node Department and professors in the departments represented
by the node P rof essor. The content of the node P rof essor is ordered. For each
professor there are the papers he wrote represented by the node P aper. Each
paper may be composed of chapters or sections, but not both (the exclusive
constraint). The courses offered by a department are represented by the node
Course. P rof essor is associated with Course. It represents the relationships
between a professor and the courses he teaches.</p>
        <p>Comparison of Described Conceptual Models
In this section, we compare the conceptual models mentioned in this paper.
The comparison is made against the general requirements and the modeling
constructs requirements introduced at the beginning of the paper.</p>
        <p>There are two comparative tables. Table 1 compares the models against the
general requirements and Table 2 compares the models against the modeling
constructs requirements. The well-known E-R model and the b asic hierarchical
model are compared too.</p>
        <p>We are not able to decide, which of the previous two approaches (E-R
extensions, hierarchical modeling) is better for the conceptual modeling of XML
data. Conceptual models based on the E-R model allow user to c reate a schema
with no metadata redundancy, but there is the problem with the modeling of the
specific XML features. Hierarchical conceptual models solve the problem with a
hierarchical structure of XML, but there arises problems such as data and
metadata redundancy, modeling of attributes of relationship types, and modeling of
n-ary relationship types.</p>
        <p>There are requirements that are not met by the described models. The
modeling of document centric data and the reuse of content is problematic. The
important requirement on the integration of conceptual schemata is solved only
by the ORA-SS model. None of the models solves the problem of t he integration
with the semantic web technologies.
− − − − − −
1 formal description is missing
−
−
−
−
−
−
√
−
−
−
−
√
√
√
−
−
√
−
−
−
− − − − − −
− on the content of a concept
− − − − √ 2 − √ −</p>
        <p>− on the participant of a relationship type
− − √ √ − √ 3 − −
· irregular and heterogeneous structure
√ −1 variant-valued attribute constructor
√</p>
        <p>√
− − − − −
− disjunctive constraints on relationship types
− √ 4 √ 4 − − √ − −
· document-centric data
− basic mixed content</p>
        <p>√
− − − − − − −</p>
        <p>− generalized mixed content
− − − − − − −
· reuse of content</p>
        <p>− IS-A or the category concept
√ √ √ √ √ √</p>
        <p>−
− named types and groups of concepts
−
− − − − − −
· integration of conceptual schemata
− modeling constructs</p>
        <p>√
− − − − −
− algorithms for merging schemata
−
−
√
−
−
−
−
−
−
−
−
−
−
√
−
−
−
−
−
−</p>
        <p>− − − − − − − −
− algorithms for the data translation between schemata
− − − − − − − − −
1 with the complex attributes extension
2 unordered content is restricted by the restrictions of xsd:all
3 only by ordered attributes, native XML ordering is not utilized
4 using the category concept
5 indirect modeling using hierarchical relationship types is possible
√
√
− 5
− 5
√
√
√
√
√
−
−
√
−
√
√
√
−
√
−
−
√
√
−
√
√
−
−
√
−
−
−
−</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Conclusions</title>
      <p>In this paper, we describe a state of the art of the conceptual modeling for XML.
There has been several papers proposing new conceptual models for XML. We
selected some representants of the models and describe them in this paper. We
propose a list of requirements conceptual models for XML should satisfy and
compare the mentioned conceptual models against the requirements.</p>
      <p>The comparison of the models shows that there is a poor support for some
specific XML features as ordering or mixed content as described by the modeling
constructs requirements proposed in Section 2. Moreover, the models poorly
concentrates on the usage of conceptual schemata for the data integration and
the integration with the semantic web technologies as described by the general
requirements proposed in Section 2.</p>
      <p>For these reasons, there is an open space for a research in the area of the
conceptual modeling for XML. Not only new modeling constructs should be
proposed to support specific XML features. The utilization of the conceptual
models for the data integration between different data sources including semantic
web resources should be explored too.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>A.</given-names>
            <surname>Badia</surname>
          </string-name>
          .
          <article-title>Conceptual Modeling for Semistructured Data</article-title>
          .
          <source>In Proceedings of the 3rd International Conference on Web Information Systems Engineering Workshops (WISE 2002 Workshops)</source>
          , p.
          <fpage>1701</fpage>
          -
          <lpage>77</lpage>
          . Singapore,
          <year>December 2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>J.</given-names>
            <surname>Clark</surname>
          </string-name>
          .
          <source>XSL Transformations (XSLT) Version</source>
          <volume>1</volume>
          .0. World Wide Web Consortium,
          <source>Recommendation REC-xslt-19991116. November</source>
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>J.</given-names>
            <surname>Clark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Makoto</surname>
          </string-name>
          .
          <source>RELAX NG Specification. Oasis. December</source>
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>G.</given-names>
            <surname>Dobbie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Xiaoying</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.W.</given-names>
            <surname>Ling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.L.</given-names>
            <surname>Lee</surname>
          </string-name>
          .
          <article-title>ORA-SS: An Obj ect-RelationshipAttribute Model for Semi-Structured Data</article-title>
          .
          <source>Technical Report</source>
          , Department of Computer Science, National University of Singapore. December 2000
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>D.W.</given-names>
            <surname>Embley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.W.</given-names>
            <surname>Liddle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Al-Kamha</surname>
          </string-name>
          .
          <article-title>Enterprise Modeli ng with Conceptual XML</article-title>
          .
          <source>In Proceedings of the 23rd International Conference on Conceptual Modeling (ER</source>
          <year>2004</year>
          ), p.
          <fpage>1501</fpage>
          -
          <lpage>65</lpage>
          . Shanghai, China,
          <year>November 2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>L.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Dillon</surname>
          </string-name>
          .
          <article-title>A Semantic Network-Based Des ign Methodology for XML Documents</article-title>
          .
          <source>ACM Transactions on Information Systems</source>
          , Volume
          <volume>20</volume>
          ,
          <string-name>
            <surname>Number</surname>
            <given-names>4</given-names>
          </string-name>
          , p.
          <fpage>3904</fpage>
          -
          <lpage>21</lpage>
          .
          <year>October 2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>D. C.</given-names>
            <surname>Fallside</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Walmsley. XML Schema</surname>
          </string-name>
          <article-title>Part 0: Primer Second Edition</article-title>
          . World Wide Web Consortium,
          <source>Recommendation REC-xmlschema-0-200 41028. October</source>
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>T.</given-names>
            <surname>Halpin</surname>
          </string-name>
          .
          <article-title>Information Modeling and Relational Databases From Conceptual Analysis to Logical Design</article-title>
          . Morgan Kaufmann Publishers,
          <year>2001</year>
          . ISBN:
          <fpage>1</fpage>
          -
          <lpage>55860</lpage>
          - 672-6
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9. International Organization for Standardization.
          <source>Information Technology Document Schema Definition Languages (DSDL) Part</source>
          <volume>3</volume>
          :
          <article-title>Ruleb-ased Validation Schematron</article-title>
          .
          <source>ISO/IEC 19757-3</source>
          ,
          <year>February 2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>B.R.</given-names>
            <surname>Loscio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.C.</given-names>
            <surname>Salgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.R.</given-names>
            <surname>Galvao</surname>
          </string-name>
          .
          <article-title>Conceptual Modeling of XML Schemas</article-title>
          .
          <source>In Proceedings of the Fifth ACM CIKM International Workshop on Web Information and Data Management (WIDM</source>
          <year>2003</year>
          ), p.
          <fpage>1021</fpage>
          -
          <lpage>05</lpage>
          . New Orleans, Louisiana, USA,
          <year>November 2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>M.</given-names>
            <surname>Mani</surname>
          </string-name>
          .
          <article-title>EReX: A Conceptual Model for XML</article-title>
          .
          <source>In Proceedings of the Second International XML Database Symposium (XSym</source>
          <year>2004</year>
          ), p.
          <fpage>1281</fpage>
          -
          <lpage>42</lpage>
          . Toronto, Canada,
          <year>August 2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>M. Mani</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>R.R.</given-names>
          </string-name>
          <string-name>
            <surname>Muntz</surname>
          </string-name>
          .
          <article-title>Semantic Data Modeling Using XML Schemas</article-title>
          .
          <source>In Proceedings of the 20th International Conference on Conceptual Modeling (ER</source>
          <year>2001</year>
          ), p.
          <fpage>1491</fpage>
          -
          <lpage>63</lpage>
          . Yokohama, Japan,
          <year>November 2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>F.</given-names>
            <surname>Manola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Miller. RDF</given-names>
            <surname>Primer</surname>
          </string-name>
          .
          <source>World Wide Web Consortium, Recommendation REC-rdf-primer-20040210. February</source>
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. M.
          <article-title>Necasky: Conceptual Modeling for XML: A Survey</article-title>
          .
          <source>Technical Report No. 2006- 3</source>
          , Dep. of Software Engineering,
          <source>Faculty of Mathematics and Physics</source>
          , Charles University, Prague,
          <year>2006</year>
          , 54 p.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15. Object Management Group.
          <source>UML 2</source>
          .
          <article-title>0 Superstructure Specification</article-title>
          .
          <source>October</source>
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>G.</given-names>
            <surname>Psaila. ERX</surname>
          </string-name>
          :
          <article-title>A Conceptual Model for XML Documents</article-title>
          .
          <source>In Proceedings of the 2000 ACM Symposium on Applied Computing</source>
          , p.
          <fpage>8989</fpage>
          -
          <lpage>03</lpage>
          . Como, Italy,
          <year>March 2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>A.</given-names>
            <surname>Sengupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mohan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Doshi. XER - Extensible Entity R elationship Modeling</surname>
          </string-name>
          .
          <source>In Proceedings of the XML 2003 Conference</source>
          , p.
          <fpage>1401</fpage>
          -
          <lpage>54</lpage>
          . Philadelphia, USA,
          <year>December 2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>M. K. Smith</surname>
            , Ch. Welty,
            <given-names>D. L.</given-names>
          </string-name>
          <string-name>
            <surname>McGuinness. OWL Web Ontology Language Guide. World Wide Web Consortium</surname>
          </string-name>
          ,
          <source>Recommendation REC-owl-guide- 20040210. February</source>
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <given-names>B.</given-names>
            <surname>Thalheim</surname>
          </string-name>
          .
          <article-title>EntityR-elationship Modeling: Foundations of Database Te chnology</article-title>
          . Springer Verlag,
          <year>2000</year>
          , Berlin, Germany.
          <source>ISBN: 3-540-65470 -4</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>