<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Measuring Complexity of Domain Standard Speci cations using XML Schema Entropy</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>George Feuerlicht</string-name>
          <email>george.feuerlicht@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vladimir Kovar</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Hartman</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marek Beranek</string-name>
          <email>marek.beranekg@unicorncollege.cz</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pavel Bory</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Prague University of Economics</institution>
          ,
          <addr-line>W. Churchill Sqr. 4, 130 67 Prague 3</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Unicorn College</institution>
          ,
          <addr-line>V Kapslovne 2767/2, 130 00 Prague 3</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Technology</institution>
          ,
          <addr-line>Sydney, P.O. Box 123, Broadway, NSW 2007</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
      </contrib-group>
      <fpage>124</fpage>
      <lpage>131</lpage>
      <abstract>
        <p>XML schemas are used extensively in e-commerce standardization initiatives. Such XML-based standards de ne the structure and the semantics of messages that are used to implement business transactions in a particular industry domain (e.g. travel). The design of the document structures that form the message payloads is of key importance as once the speci cation is published it is di cult to re-design the documents without impacting on existing applications. Furthermore, such domain standards need to be maintained and evolved over long time periods, typically decades, without unduly increasing the complexity of the speci cation. The concept of software entropy has been used in the literature to estimate complexity and to express decline in quality, maintainability and understandability of software though its lifetime. In this paper we propose a Message Software Entropy (MSE) metric that estimates the complexity of XML message structures and we use this metric to study the complexity of a subset of the Open Travel Alliance Speci cation as it evolves over time.</p>
      </abstract>
      <kwd-group>
        <kwd>complexity metrics</kwd>
        <kwd>XML schema evolution</kwd>
        <kwd>software entropy</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        XML schemas speci cations are used extensively in both vertical domain (e.g.
Open Travel Alliance [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]) and horizontal domain (e.g. ebXML [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]) e-commerce
standardization initiatives. Such XML standards de ne the structure and the
semantics of messages that are used to implement business transactions in
a speci c domain of interest (e.g. travel). The messages are typically delivered
using SOAP [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] or REST [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] web services, so that in e ect the speci cation forms
a basis for a large and complex SOA (Service Oriented Architecture) software
system. The design of the document structures that form the message payloads
is of key importance as once the speci cation is published it is di cult to
redesign without impacting on existing applications. Furthermore, the longevity
of domain standards requires that the design of the standard documents allows
graceful evolution over a very long time period, typically decades, without unduly
increasing the complexity of the speci cation. Many existing standard speci
cations were designed to overcome the limitations of the Internet as it was at
the end of the last century (i.e. high latency, poor reliability and slow response
time) rather than with focus on software maintainability and evolution. This
typically resulted in speci cations consisting of large complex messages suited
for stateless communication, but di cult to maintain and evolve. Standard XML
speci cations (e.g. OTA) typically contain hundreds of complex XML message
schemas and thousands of schema elements. As these speci cations evolve over
time incorporating new requirements their complexity further increases. For
example, the OTA message schema that de nes the structure of the ight
availability requests (OTA AirAvailRQ) contains 428 elements with multiple levels
of nesting. Design of such XML schemas typically follows the Document
Engineering approach [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] or a similar methodology that produces XML documents
by aggregating data elements based on pre-de ned simple and complex types.
For example, OTA message level schemas are constructed by aggregation of
simple (OpenTravel Simple Types) and complex (OpenTravel Common Types,
and Industry Common Types) schema elements [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This design approach while
ensuring uniform structure and semantics of data elements can result in
overlapping message schemas and excessive complexity, reducing the maintainability of
the speci cation [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Evaluation of the quality of the design of XML schema has been the subject
of recent research interest. In our previous work we have proposed data coupling
metrics that evaluate interdependencies among of a set of XML message schemas
by estimating the level of data coupling [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. In this paper we focus on estimating
the complexity of message schemas using the concept of Schema Entropy [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
In the following section (Section 2) we review related work that addresses the
problem of design quality of XML schemas. We then describe our proposal for
the Message Schema Entropy (MSE) metric and an XSD Analyzer tool that
we developed to compute the MSE metric (Section 3). In Section 4 we present
experimental results obtained by analysing OTA Air (Airline) schemas. We note
that the OTA message schemas were chosen as an example of open industry
domain speci cations, and that we do not imply any criticism of the OTA schema
design in this work. Section 5 presents our conclusions and outlines directions
for further work.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Evaluation of the quality of design of XML schemas has been the subject of
recent research interest [
        <xref ref-type="bibr" rid="ref10 ref11 ref9">9-11</xref>
        ]. Ensuring XML schema design quality for industry
domain speci cations presents a particularly di cult problem as the schemas are
often developed in the absence of a domain data model [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Current work in this
area includes research that focuses on identifying dependencies among schema
elements and developing tools for automating the propagation of schema changes
to all dependent components. Necasky, et al. proposed a ve-level XML
evolution architecture with the top level Platform-Independent Model (PIM) that
represents the data requirements for a particular domain of interest. PIM model
is mapped into a Platform-Speci c Model (PSM) that describes how parts of the
PIM schema are represented in XML. PSM then maps into Schema, Operational
and Extensional level models. Atomic operations (create, update, and remove)
for editing schemas are de ned on classes, attributes, and associations, and a
mechanism for propagating these operations from PIM to PSM schema is
proposed. Composite operations are constructed from atomic operations to
implement complex schema changes [
        <xref ref-type="bibr" rid="ref12 ref13 ref14">12-14</xref>
        ]. Numerous XML schema quality metrics
have been proposed primarily with the objective to measure various aspects of
schema complexity. McDowell et al. proposed eleven metrics and two composite
indexes to measure the quality and complexity of XML schemas. These
metrics are based on counts of complex type declarations, derived complex types,
number of global type declarations, number of simple types, element fanning
(fan-out { number of child elements that an element has, and fan-in { number
of times that an element is referenced by), etc. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The authors formulate a
Quality Index and a Complexity Index that estimate the quality and complexity
of XML schemas based on a weighted values of the metrics. A metric analysis
tool is provided for developers to verify the validity of the metrics in the
context of speci c projects. The concept of entropy [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] has been adapted to the
measurement of complexity of software and was initially applied to procedural
software [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and later to object-oriented design [
        <xref ref-type="bibr" rid="ref17 ref18">17,18</xref>
        ]. Ruellan [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] used an
entropy measure to assess the amount of information contained in XML documents
(information density) with the objective to reduce the size of XML documents
and to improve processing speed of XML messaging applications. Thaw et al.
.[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] proposed entropy-based metrics to measure reusability, extensibility,
understandability of XML schema documents. Basci et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] proposed and validated
XML schema complexity metric that evaluates the internal structure of XML
documents taking into account various sources of complexity that include
recursion and complexity arising from importing external schema elements. The
authors used the concept of Schema Entropy (SE) to assess XML schema
complexity. SE is evaluated based on the complexity of schema elements as measured
by fan-in and fan-out, and the number of simple elements that constitute
individual schema elements. The SE metric was empirically validated using publicly
available XML schemas, and the authors conclude that the metric provides a
useful feedback when comparing schemas with equal number of elements [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
In [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] Tang et al. apply an entropy-based measure to assessing the structural
uniformity (structuredness) of XML documents. Two metrics are de ned:
PathBased Entropy and Subtree-Based Entropy that attempt to measure the level
of diversity of a set of XML documents. Unlike Basci et al. [
        <xref ref-type="bibr" rid="ref11 ref8">8,11</xref>
        ], the authors
base the entropy calculation on XML documents, rather than XML schemas.
Pichler et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] developed a set of metrics to analyse the complexity of business
documents with the objective of estimating the e ort involved in data element
mapping between di erent business document standards.
      </p>
      <p>
        Our proposal di ers from the above approaches in two important respects.
Firstly, we estimate schema entropy by adapting an entropy-based metric
originally developed for object-oriented design [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], and secondly our focus is on the
evaluation of the changes in complexity of domain speci cations as the speci
cations evolves over time.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Proposed Message Schema Entropy (MSE) Metric</title>
      <p>
        The concept of software entropy has been used in literature to express decline
in quality, maintainability and understandability of software though its lifetime.
In our formulation of the Message Schema Entropy (MSE) metric we adapt the
Class De nition Entropy (CDE) metric for object-oriented design described in
Bansiya et al. [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. The CDE metric calculates the frequency of occurrence of
name strings in a given class. Calculation of MSE metric involves computing the
frequency of occurrence of complex schema elements (i.e. schema elements that
contains other elements and attributes) using the formulae:
where:
N = total number of unique complex elements in the message schema
ni = number of occurrences of the ith complex element in the message schema
      </p>
      <p>M = total number of non-unique complex elements in the message schema
Pi = ni/M</p>
      <p>Figure 1 shows a fragment of the schema of the OTA AirAvailRQ message
illustrating the Speci cFlightInfo element. The OTA AirAvailRQ message is used
to implement (web) service for ight availability inquiry and includes 428
elements with multiple levels of nesting. Elements are based on simple types (e.g.
FlightNumber) or on complex types, as is the case with the Speci cFlightInfo
element, which is based on the Speci cFlightInfo Type (extension of Speci
cFlightInfo Type). MSE calculation is based on counting complex schema elements (i.e.
elements based on complex types) and represents an approximation, as the
internal complexity of individual elements is not taken into account.
3.1</p>
      <p>XSD Analyzer Tool
We have developed an XSD Analyzer tool that calculates the values of the MSE.
XSD Analyzer allows the selection of message schemas for analysis and produces
an output that includes the total number of non-unique complex schema elements
(N1), the number of unique (distinct) complex schema elements (N2), the value
of MSE, and counts of occurrences of complex schema elements.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Experimental Results</title>
      <p>The OTA Air messages are a subset of the OTA speci cation and are used
to implement services that support various business functions related to
airline travel such as checking ight availability, ight booking, etc. For example,
the Search and Availability of ights business function is implemented using
the Air AvailabilityRQ/RS (request/response) message pair. OTA de nes
common data types (OTA AirCommonTypes) for the airline messages that form a
global type repository of XML Schema components (i.e. simple and complex
type de nitions) used in the construction OTA Air messages. OTA di erentiates
between complex types (types that contain multiple data elements) and simple
types (types that contain a single data element). We use a subset of OTA Airline
(Air) message schemas for our calculations. Open Travel Alliance typically
publishes two (A and B) schema speci cations each year. We have used air message
schema speci cations for our evaluation of schema complexity. Table 1 shows
MSE values for 26 OTA Air message schemas for versions 2010B to 2013B. The
total value of the MSE for the entire set of messages is also shown at the bottom
of the tables. It is evident that MSE increases monotonically for all message
schemas (with some minor exceptions) as new (enhanced) schema versions are
released, indicating that the complexity of the OTA speci cation is increasing.
The total MSE increases from a value of 143 for the 2010B OTA Speci
cation to 154 for the 2013B OTA Speci cation, representing a 7% increase. This
increase in MSE over the period of four years is probably not signi cant and
indicates relative stability of the OTA messages schemas. The interpretation of
the signi cance of the MSE values and the increase in MSE as new versions are
released requires further analysis. Some insight into the relative complexity of
the message schemas can be gained by comparing the MSE of individual
messages schema with the value of MSE computed for the global schema elements
(OTA AirCommonTypes) that include all the common elements shared across
OTA Air messages. The MSE value for the OTA AirCommonTypes schema for
the 2013B version of the OTA Speci cation equals to 7.55. It can be noted that
the MSE value for the some of the message schemas (e.g. OTA AirAvailRQ,
OTA AirBookRQ, OTA AirBookRS) exceeds this value, indicating that the
complexity of these schemas is of similar magnitude as the complexity of the entire
global schema. In addition to globally de ned schema elements individual
message schemas include locally de ned elements that contribute to MSE, explaining
this apparent inconsistency.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and Further Work</title>
      <p>Complexity of domain standard speci cations is a major factor inhibiting the
evolution of speci cations and increasing the maintenance costs of domain
applications. There is a need for reliable metrics that estimate the complexity of
XML-based standard speci cation and can be used to identify excessively
complex schema design early in the design cycle. In this paper we have proposed a
Message Software Entropy (MSE) metric that estimates the complexity of XML
schemas and we have used this metric to study the complexity of a subset of
the Open Travel Alliance Speci cation as it evolves over a period of four years
and seven version releases. The results indicate monotonic increase in schema
complexity as measured by MSE, with the total MSE increasing from 143 for the
2010B OTA Speci cation to 154 for the 2013B OTA Speci cation, representing
a 7% increase. This increase is probably not signi cant and indicates relative
stability of the OTA speci cation. We have also noted that the MSE value for
some of the message schemas is of similar magnitude as the value of MSE for the
global schema elements OTA AirCommonTypes (7.55), indicating that the
complexity of these schemas (e.g. OTA AirAvailRQ, OTA AirBookRQ) is similar to
the complexity of the entire OTA Air global schema. The current version of the
MSE metric is purely based on complex element counts and does not take into
account the complexity of the individual elements or the number of levels in the
message schema. This makes it easy to interpret the metric, but it also reduces
the accuracy of the estimates of schema complexity. MSE metric represents only
a rst approximation of the complexity of XML schemas, and we are working
on re ning the MSE metric to take account a range of XML schema structural
features.</p>
      <p>Acknowledgments. George Feuerlicht wishes to acknowledge the support of
Research Centre for Human Centered Technology Design at the Faculty of
Engineering and Information Technology, University of Technology, Sydney,
Australia.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. OTA.
          <article-title>OTA Speci cations</article-title>
          . (
          <year>2010</year>
          ) Available from: http://www.opentravel.org/ Specifications/Default.aspx
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. ebXML. ebXML
          <string-name>
            <surname>- Enabling A Global Electronic</surname>
            <given-names>Market.</given-names>
          </string-name>
          (
          <year>2007</year>
          ) Available from: http://www.ebxml.org/
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. WC3.
          <article-title>SOAP Speci cations</article-title>
          . (
          <year>2007</year>
          ) Available from: http://www.w3.org/TR/soap/
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Pautasso</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zimmermann</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leymann</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Restful web services vs. big'web services: making the right architectural decision</article-title>
          .
          <source>in 17th international conference on World Wide Web</source>
          . Beijing, China: ACM (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Glushko</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGrath</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Document Engineering for e-Business</article-title>
          .
          <source>in Proceedings of the 2002 ACM symposium on Document Engineering (DocEng'02)</source>
          .
          <article-title>McLean, Virginia</article-title>
          , USA: ACM Press New York, NY, USA (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Pichler</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Strommer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huemer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Size matters!? measuring the complexity of xml schema mapping models</article-title>
          .
          <source>in Services (SERVICES-1)</source>
          ,
          <year>2010</year>
          6th World Congress on,
          <source>IEEE</source>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Feuerlicht</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Evaluation of Quality of Design for Document-Centric Software Services</article-title>
          .
          <source>in Service-Oriented Computing-ICSOC 2012 Workshops</source>
          . Springer (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Basci</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Misra</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Entropy as a Measure of Quality of XML Schema Document</article-title>
          .
          <source>The International Arab Journal of Information Technology</source>
          ,
          <volume>8</volume>
          (
          <issue>1</issue>
          ): pp.
          <volume>75</volume>
          {
          <issue>83</issue>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>McDowell</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidt</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yue</surname>
            ,
            <given-names>K.B.</given-names>
          </string-name>
          :
          <article-title>Analysis and metrics of XML schema (</article-title>
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Visser</surname>
          </string-name>
          , J.:
          <article-title>Structure metrics for XML Schema</article-title>
          .
          <source>Proceedings of XATA</source>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Basci</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Misra</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Measuring and evaluating a design complexity metric for XML schema documents</article-title>
          .
          <source>Journal of Information Science and Engineering</source>
          ,
          <volume>25</volume>
          (
          <issue>5</issue>
          ): pp.
          <volume>1405</volume>
          {
          <issue>1425</issue>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Necasky</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Conceptual Modeling for XML, ser</article-title>
          .
          <source>Dissertations in Database and Information Systems Series</source>
          , IOS Press/AKA Verlag (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Necasky</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mlynkova</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>A Framework for E cient Design, Maintaining, and Evolution of a System of XML Applications</article-title>
          . Proceedings of the Databases, Texts, Speci cations, and Objects, DATESO. 10: pp.
          <volume>38</volume>
          {
          <fpage>49</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Necasky</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mlynkova</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Five-Level Multi-Application Schema Evolution</article-title>
          .
          <source>Proceedings of the Databases</source>
          , Texts, Speci cations, and Objects, DATESO 9: pp.
          <volume>213</volume>
          {
          <fpage>217</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Shannon</surname>
            ,
            <given-names>C.E.</given-names>
          </string-name>
          :
          <article-title>A mathematical theory of communication</article-title>
          .
          <source>ACM SIGMOBILE Mobile Computing and Communications Review</source>
          ,
          <volume>5</volume>
          (
          <issue>1</issue>
          ): pp.
          <volume>3</volume>
          {
          <issue>55</issue>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Mohanty</surname>
            ,
            <given-names>S.N.</given-names>
          </string-name>
          :
          <article-title>Entropy metrics for software design evaluation</article-title>
          .
          <source>Journal of systems and software</source>
          ,
          <year>1981</year>
          .
          <volume>2</volume>
          (
          <issue>1</issue>
          ): pp.
          <volume>39</volume>
          {
          <fpage>46</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Bansiya</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Etzkorn</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>An entropy-based complexity measure for object-oriented designs</article-title>
          .
          <source>Theory and Practice of Object Systems</source>
          ,
          <volume>5</volume>
          (
          <issue>2</issue>
          ): pp.
          <volume>111</volume>
          {
          <issue>118</issue>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Olague</surname>
            ,
            <given-names>H.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Etzkorn</surname>
            ,
            <given-names>L.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cox</surname>
            ,
            <given-names>G.W.:</given-names>
          </string-name>
          <article-title>An Entropy-Based Approach to Assessing Object-Oriented Software Maintainability and Degradation-A Method and Case Study</article-title>
          .
          <source>in Software Engineering Research and Practice. Citeseer</source>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Ruellan</surname>
          </string-name>
          , H.:
          <article-title>XML Entropy Study</article-title>
          . in
          <source>Balisage: The Markup Conference</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Thaw</surname>
            ,
            <given-names>T.Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khin</surname>
            ,
            <given-names>M.M.</given-names>
          </string-name>
          :
          <article-title>Measuring Qualities of XML Schema Documents</article-title>
          .
          <source>Journal of Software Engineering and Applications</source>
          <volume>6</volume>
          : p.
          <volume>458</volume>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bressan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Measuring XML structured-ness with entropy</article-title>
          ,
          <source>in Web-Age Information Management</source>
          . Springer. pp.
          <volume>113</volume>
          {
          <issue>123</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>