<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>On Evaluating Multi-Level Modeling</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Colin Atkinson</string-name>
          <email>atkinson@informatik.uni-mannheim.de</email>
          <email>tk@ecs.vuw.ac.nz</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Thomas K u¨hne</string-name>
          <email>tk@ecs.vuw.ac.nz</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Mannheim</institution>
          ,
          <addr-line>Mannheim</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Victoria University of Wellington</institution>
          ,
          <addr-line>Wellington</addr-line>
          ,
          <country country="NZ">New Zealand</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>-Multi-Level Modeling is receiving increasing levels of interest and its active research community is continuing to make progress. However, to advance the discipline effectively it is necessary to increase industry adoption and achieve better community cohesion. We believe that the key to addressing both these challenges is to promote the creation of more comparisons in the multi-level modeling field based on meaningful objective evaluations. In this position paper, we provide our view on what constitutes meaningful evaluations and discuss some of the issues involved in obtaining them, while presenting a broad overview of existing multi-level modeling evaluations. In particular, we emphasize the importance of understanding and managing the difference between internal and external qualities.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>I. INTRODUCTION</title>
      <p>
        Although Multi-level Modeling (MLM) has seen steady
development over recent years, industry adoption is still virtually
non-existent (a rare application of MLM in an industry setting
is described in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]). One explanation for the low adoption rate
is the current unavailability of industrial-strength approaches
and tools. However, even if better tool support were
available, wider adoption would still be hindered by the lack of
compelling evidence that switching from Two-Level Modeling
(TLM) to MLM brings benefits in industrial contexts. Creating
convincing comparisons would reduce this barrier and could
even expedite MLM research through industrial funding.
      </p>
      <p>Another obstacle to the discipline’s future growth is the lack
of research cohesion that may eventually make it impossible
for community members to build on each other’s results.
Without a sense of direction — i.e., a common understanding
of the way forward – the discipline runs the risk of research
diversification to a point where it loses its core focus and
subsequently critical mass. We therefore believe that objective
comparisons between competing approaches are not only
desirable to provide a compass for future development but may
eventually become necessary for the discipline’s survival.</p>
      <p>Since convincing comparative evaluations are the key to
addressing both of the aforementioned challenges, in this
position paper we discuss some of the issues involved in
performing such evaluations in the context of MLM. We first
establish the basic parameters of meaningful, scientifically
sound evaluations and then discuss a variety of concrete
approaches, pointing to existing work where applicable.</p>
    </sec>
    <sec id="sec-2">
      <title>II. MEANINGFUL EVALUATIONS</title>
      <p>The effectiveness of MLM evaluations in promoting
industry adoption and research cohesion depends on the extent to
which they measure something of relevance. Naturally,
relevance itself depends on the stakeholders and their respective
goals. However, in general, a meaningful evaluation provides
results that have some kind of real-world relevance. In contrast,
a meaningless evaluation – e.g., measuring the number of
vowels in a language’s keywords – has no such real-world
relevance. A comparison based on such an evaluation would
not yield any meaningful insights into which language should
be preferred for achieving any reasonable real-world impact.</p>
      <p>Meaningful evaluations, on the other hand, should be designed
to deliver some insights that provide the basis for pragmatic
guidance. In order for an evaluation to be meaningful in our
sense, it must address the following aspects:
A1: Measurability Any targeted properties must be
objectively observable. Ideally, measurements should yield numeric
results that are directly proportional to the property being
measured, as it is then not only possible to decide which
approach is better but also by how much. It is never possible to
judge an approach or tool to be, e.g., “good” or “productive”,
without breaking down how the quality concerned manifests
itself in terms of measurable properties.</p>
      <p>A2: Conclusiveness Measurements should yield consistent
results. Repeated performances of the evaluation need not yield
exactly the same outcome, but they should deliver reliable
values within a given margin of error. This also excludes
results with low confidence (e.g., because they lack statistical
significance).</p>
      <p>A3: Impartiality The choice of the properties to be measured
must not favor particularities of one solution that have no
proven relationship to the ultimate goal. For example, a set of
postulated requirements must be formulated in such a manner
that they reference the problem-domain and the ultimate
benefits to the targeted user rather than solution details.</p>
      <p>A4: Trueness When using proxies (e.g. substitutes for
realworld artifacts or practitioners) care must be taken to ensure
that no circumstantial bias is introduced. Trueness, therefore
comprises at least:</p>
      <p>A4:1: Context Relevance Model proxies (i.e., samples used
in lieu of real-world models) and the assumed operations on
them should be demonstrated to be representative. Otherwise,
a skewed selection could introduce undesired bias.</p>
      <p>A4:2: Demographic Relevance Substitute users should be
demonstrated to be representative of real users. In general, it
is not possible to transfer results between different bodies of
users (e.g., from students to practitioners in the field).</p>
      <p>
        A5: Pragmatic Relevance Targeted properties must have a for MLM is its ability to reduce accidental complexity [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ],
bearing on the actual needs of the intended users. This criterion i.e., the difference in complexity between an ideal model and
is the very foundation of a meaningful evaluation. The previous a concrete model involving solution-induced overhead, e.g.
aspects essentially characterize sound evaluations, whereas workarounds.
pragmatic relevance requires that there is an intent to measure A number of evaluations of multi-level modeling have
something of pragmatic value. been based on approximating the complexity of a model by
      </p>
      <p>
        It is obviously challenging to “tick” all the above “boxes” measuring its size, that is, the number of its elements. For
in practice, but we feel it is useful to have a checklist that example, Gerbig performed a comparison based on model size
helps to document where an evaluation may be lacking. in his Ph.D. thesis using a sample model from the enterprise
architecture domain [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The MLM version of the model has
      </p>
      <p>
        III. INTERNAL VERSUS EXTERNAL QUALITIES 50 modeling elements while the TLM version, using standard
Some of the aforementioned aspects are more difficult to workaround patterns such as the Type-Object pattern [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ],
address than others. In order to understand why, it is important has 95 modeling elements, amounting to an increase of 90%.
to be aware of whether an evaluation is intended to evaluate Rossini et al. performed a similar evaluation which yielded
an internal quality or an external quality. We use these terms a three-fold increase in the number of modeling elements
with their usual meaning in software engineering [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. in a two-level versus a multi-level model of their CloudML
      </p>
      <p>
        In our context, internal qualities pertain to the directly scenario [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
measurable properties of a model, e.g., number of model The extent of the practical relevance of the above
evaluaelements, number of constraints, average inheritance depth, tions was shown by de Lara et. al. by measuring the application
etc. External qualities, on the other hand, pertain to the frequency of TLM workaround techniques (cf. “Item
Descripexperience users have when working with a model, e.g., tor” pattern [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], “Type-Object” pattern [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], “Adaptive
Objectcreating it, understanding it, maintaining it, etc. Model” [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], etc.) in real-world models [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Since these
      </p>
      <p>
        Ultimately, only the external qualities have a direct bearing workaround techniques are responsible for increases of the size
on meaningful evaluations. However, due to the cost and of two-level models relative to their multi-level counterparts,
challenges involved in assessing external qualities directly de Lara et al. hence demonstrated that the observations made
in a meaningful way, one often attempts to approximate the in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] apply to a wide range of modeling practice. As
assessment of external qualities by assessing internal qualities much as 35% of all models in some areas [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], could thus
instead, based on the idea that there is a correlation between benefit from the potential size reductions.
internal and external qualities. It is standard practice to as- Although the above results provide a convincing
endorsesume that optimizing certain internal qualities (e.g., reducing ment for the practical relevance of MLM, they do so only to
complexity) is the key to achieving certain desirable external the extent that the assumption that model size1 approximates
qualities (e.g., increased maintainability). However, such an model complexity is reasonable. A larger model based on a
indirect evaluation of external properties is only trustworthy simple underlying language could conceivably be preferable
if the assumed underlying correlation has been demonstrated, to a compact model based on a complex language.
or at least has been made plausible by compelling arguments. Going beyond assessing model size, it appears useful
      </p>
      <p>
        Interestingly, A1&amp;A2 are most easily addressed by focusing to consider other classic metrics [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and
qualon the internal qualities of an approach. Such qualities, e.g., ity attributes [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Indeed, in his MLM vs TLM
the complexity of the models created by an approach, can comparison, Gerbig also considered such classic
mettypically be reliably assessed. In contrast, assessing external rics [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Overall, however, these proved to be less
conqualities often implies some compromise in A1&amp;A2 because clusive than model size comparisons, although he
desample populations may be small or certain assumptions may tected clear advantages for MLM with respect to
counot generalize. pling (average number of distinct connected classes) and
      </p>
      <p>
        Aspects A3 &amp; A5, on the other hand, are best addressed by overhead2 ((well-formedness rules + additional operations)=
focusing on the external qualities of an approach. External element count)) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
qualities directly reflect the utility of the approach to its Given these less conclusive results (compared to model
users and hence avoid solution bias (A3) plus intrinsically size analyses) it would be easy to be skeptical about the
imply pragmatic relevance (A5). The increased cost involved in actual advantages offered by MLM. However, it is important
directly assessing external qualities relates to ensuring conclu- to observe that these metrics were originally designed to target
siveness (A2) and trueness (A4). This cost is considerable and the type level only and thus entirely ignore the instance-level
therefore represents a major hurdle for this kind of evaluation. complexity caused by the application of TLM workarounds.
      </p>
      <p>This weakness of classic metrics for evaluating MLM is
IV. ASSESSING INTERNAL QUALITIES understandable given their motivation rooted in programming
and/or modeling software. In these contexts, instances and</p>
      <p>Complexity is one of the most commonly measured internal
qualities since it is assumed to have a correlation with
important external qualities such as maintainability, robustness,
and trustworthiness etc. In fact, the main value proposition
1Apparently equivalent to the much debated “lines of code” metric for
source code.</p>
      <p>
        2Referred to as “complexity” in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>Demonstrate understanding of a model.</p>
      <p>Read an incomplete model and correctly add missing parts.</p>
      <p>Read a defective model and identify all issues.</p>
      <p>Read a defective model and address all issues.</p>
      <p>Create a model from scratch for a specified purpose.</p>
    </sec>
    <sec id="sec-3">
      <title>The goal of this position paper has been to provide a</title>
      <p>discussion of the issues involved when aiming to perform
meaningful evaluations while providing a broad overview of
the MLM evaluations that have been conducted to date. The
number of already existing MLM evaluations is encouraging
and each of them represents a very useful step towards growing
MLM as a discipline. However, our discussion has shown
that the evaluations performed until now are overwhelmingly
focused on internal rather than external qualities. Hence their
pragmatic relevance – in the absence of the demonstration of
their relationships are irrelevant to users. However, in many Another external quality which lends itself relatively
domain modeling applications instances directly represent the straightforwardly to measurement is model robustness, i.e., the
subject under study. In such contexts, the complexity of resilience of a model to user error. Here the goal would be to
instance models is therefore very much a concern to users assess the likelihood of introducing errors when
creating/mainand should thus be considered in evaluations. taining models. In particular, in the context of MLM to TLM</p>
      <p>
        Instead of focusing on model properties (e.g., model com- comparisons, one would expect a two-level model to suffer
plexity), one may also consider language properties (e.g., lan- from more accidentally introduced errors than a
correspondguage expressiveness). For example, Atkinson et al. based their ing multi-level model. TLM would only provide the same
comparison of Melanee with MetaDepth on the differences be- safeguards against the introduction of model inconsistencies
tween their respective language features [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Grossmann et al.’s if all the well-formedness constraints implied by MLM are
more comprehensive comparison of 21 MLM approaches [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] transposed into the equivalent TLM models. One would still,
also involved language feature comparisons. However, Gross- however, expect a higher rate of well-formedness violations,
mann et al. also considered the intended target audience and since it is most likely easier to make mistakes in a lower level
the purpose of approaches, and furthermore considered the two-level model, compared to a higher-level multi-level model.
extent to which an approach has seen industry usage. This The final external quality we can cover here is productivity,
latter consideration could be regarded as including an external i.e., the speed by which users can develop or make changes to
quality, but without further information on how well the models. The underlying hypothesis of what could be referred
respective MLM approaches performed in industrial contexts to as cognitive challenge-based evaluations is that modeler
it is only a good starting point for further investigations. performance is a function of the adequacy of the language/tool
      </p>
      <p>Ideally, feature-based comparisons should be accompanied used. The higher the adequacy of the language/tool, the better
by an analysis of the impact of the different features on users. the modeler should perform when facing standard tasks.
While certain features may seem elegant, ultimately their value To this end, we propose a “5C”-approach, comprising the
must be assessed by considering external qualities. cognitive challenges listed in Table I.</p>
      <p>Assessing the adequacy of an approach would be performed
V. ASSESSING EXTERNAL QUALITIES by measuring completion speeds for representative concrete
tasks of the above five kinds. If languages/tools actually yield
different levels of productivity, one should expect to see
differences in the C1-C5 completion measurements. Ideally,
subjects should be chosen in such a way that results transfer
to the intended user base in order to achieve demographic
relevance. Full context relevance will be very hard to achieve
with this approach as it is typically not feasible to work with
realistically sized models in such experiments.</p>
      <p>
        In order to evaluate the ultimate purpose of any approach
intended to deliver value to a user, it is necessary to determine
properties based on external qualities which relate to user
experience. As far as we are aware, only two MLM evaluations
of this kind have been performed to date. Both of these
investigate model changes and thus can be reasonably regarded
as evaluating (aspects of) maintainability. In his Ph.D. thesis,
Gerbig performed a comparative model change analysis by
counting the number of primitive change operations needed
to respond to certain requirements changes [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. It turned out
that a homogeneous treatment of all classification levels and
Melanee’s emendation service [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] reduce the effort needed to
change the multi-level version of the model compared to the
two-level, EMF-based version.
      </p>
      <p>
        Kimura et al., also used a change-based approach to
compare Melanee, MetaDepth and EMF, with a particular
focus on extensibility [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. These kinds of analyses exhibit
ideal measurability, reproducibility, impartiality, and pragmatic
relevance. However, whether context relevance is adequately
addressed depends on how representative the chosen models
and editing operations are.
a strong correlation between the internal qualities they asses
with the external qualities that matter to users – is limited.
      </p>
      <p>It is natural that the first evaluations performed in an
emerging field are focused on internal qualities, as these are
usually much easier to asses than external ones. However, we
believe that for a) the benefits of MLM to become convincing
enough to generate serious interest from industry, and b)
comparative evaluations to become useful enough to maintain
the cohesion and momentum the research community requires,
more user-oriented evaluations focusing on external qualities
will be needed.</p>
      <p>An important initiative in this regard is the “Bicycle
Challenge” proposed by the MULTI 2017 workshop as a common
sample scenario, allowing various MLM approaches to be
compared based on an example with practical relevance.</p>
      <p>Ideally, more such benchmarks will be designed in the
future along with agreed upon usage scenarios, e.g., involving
subsequent extensions, detecting and removing defects, etc.</p>
      <p>It will remain a challenge to distinguish models and usage
scenarios that have context relevance from those that do not,
but any attempts to move MLM evaluations towards directly
assessing external qualities or to strengthen the confidence
in hitherto only assumed correlations between internal and
external qualities will represent significant steps forward.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Atkinson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gerbig</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kennel</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>On-the-fly emendation of multilevel models</article-title>
          .
          <source>In: Proceedings of the 8th European Conference on Modelling Foundations and Applications</source>
          . pp.
          <fpage>194</fpage>
          -
          <lpage>209</lpage>
          . ECMFA'
          <volume>12</volume>
          ,
          <string-name>
            <surname>Springer</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Atkinson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gerbig</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lara</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guerra</surname>
          </string-name>
          , E.:
          <article-title>A feature-based comparison of melanee and metadepth</article-title>
          .
          <source>In: Proceedings of the 3rd Workshop on Multi-Level Modelling co-located with the 19th ACM/IEEE International Conference MODELS 2016. CEUR Workshop Proceedings</source>
          , vol. Vol-
          <volume>1722</volume>
          , pp.
          <fpage>25</fpage>
          -
          <lpage>34</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Atkinson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , Ku¨hne, T.:
          <article-title>Reducing accidental complexity in domain models</article-title>
          .
          <source>Software and Systems Modeling</source>
          <volume>7</volume>
          (
          <issue>3</issue>
          ),
          <fpage>345</fpage>
          -
          <lpage>359</lpage>
          (Springer Verlag,
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Bansiya</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>C.G.</given-names>
          </string-name>
          :
          <article-title>A hierarchical model for object-oriented design quality assessment</article-title>
          .
          <source>IEEE Trans. Softw. Eng</source>
          .
          <volume>28</volume>
          (
          <issue>1</issue>
          ),
          <fpage>4</fpage>
          -
          <lpage>17</lpage>
          (
          <year>Jan 2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Chidamber</surname>
            ,
            <given-names>S.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kemerer</surname>
            ,
            <given-names>C.F.</given-names>
          </string-name>
          :
          <article-title>A metrics suite for object oriented design</article-title>
          .
          <source>IEEE Transactions on Software Engineering</source>
          <volume>20</volume>
          (
          <issue>6</issue>
          ),
          <fpage>476</fpage>
          -
          <lpage>493</lpage>
          (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Coad</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Object-oriented patterns</article-title>
          .
          <source>Communications of the ACM</source>
          <volume>35</volume>
          (
          <issue>9</issue>
          ),
          <fpage>152</fpage>
          -
          <lpage>159</lpage>
          (
          <year>Sep 1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Gerbig</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          : Deep,
          <article-title>Seamless, Multi-format, Multi-notation Definition and Use of Domain-specific Languages</article-title>
          .
          <source>Ph.D. thesis</source>
          , University of Mannheim (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Igamberdiev</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grossmann</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Selway</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stumptner</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>An integrated multi-level modeling approach for industrial-scale data interoperability</article-title>
          .
          <source>Software &amp; Systems</source>
          Modeling pp.
          <fpage>1</fpage>
          -
          <lpage>26</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Igamberdiev</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grossmann</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stumptner</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A feature-based categorization of multi-level modeling approaches and tools</article-title>
          .
          <source>In: Proceedings of the 3rd Workshop on Multi-Level Modelling co-located with the 19th ACM/IEEE International Conference MODELS 2016. CEUR Workshop Proceedings</source>
          , vol. Vol-
          <volume>1722</volume>
          , pp.
          <fpage>45</fpage>
          -
          <lpage>55</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Johnson</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Woolf</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Type object</article-title>
          . In: Martin,
          <string-name>
            <given-names>R.C.</given-names>
            ,
            <surname>Riehle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Buschmann</surname>
          </string-name>
          ,
          <string-name>
            <surname>F</surname>
          </string-name>
          . (eds.)
          <source>Pattern Languages of Program Design 3</source>
          , pp.
          <fpage>47</fpage>
          -
          <lpage>65</lpage>
          . Addison-Wesley (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Kosaku</given-names>
            <surname>Kimura</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.S.:</surname>
          </string-name>
          <article-title>An evaluation of multi-level modeling frameworks for extensible graphical editing tools</article-title>
          .
          <source>In: Proceedings of the 3rd Workshop on Multi-Level Modelling co-located with the 19th ACM/IEEE International Conference MODELS 2016. CEUR Workshop Proceedings</source>
          , vol. Vol-
          <volume>1722</volume>
          , pp.
          <fpage>35</fpage>
          -
          <lpage>44</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Lara</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guerra</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuadrado</surname>
            ,
            <given-names>J.S.:</given-names>
          </string-name>
          <article-title>When and how to use multilevel modelling</article-title>
          .
          <source>ACM Transactions on Software Engineering and Methodology</source>
          <volume>24</volume>
          (
          <issue>2</issue>
          ),
          <volume>12</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          :
          <fpage>46</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Lorenz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kidd</surname>
          </string-name>
          , J.:
          <article-title>Object-oriented software metrics: a practical guide</article-title>
          . Prentice-Hall, Inc. (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Ma</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shao</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Ma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            ,
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          :
          <article-title>Applying oo metrics to assess uml meta-models</article-title>
          . In: Baar,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Strohmeier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Moreira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Mellor</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.J</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of UML</source>
          <year>2004</year>
          , Lisbon, Portugal, pp.
          <fpage>12</fpage>
          -
          <lpage>26</lpage>
          . Springer (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Purao</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vaishnavi</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Product metrics for object-oriented systems</article-title>
          .
          <source>ACM Comput. Surv</source>
          .
          <volume>35</volume>
          (
          <issue>2</issue>
          ),
          <fpage>191</fpage>
          -
          <lpage>221</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Rossini</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de Lara</surname>
          </string-name>
          , J.,
          <string-name>
            <surname>Guerra</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikolov</surname>
          </string-name>
          , N.:
          <article-title>A comparison of two-level and multi-level modelling for cloud-based applications</article-title>
          .
          <source>In: Proceedings of ECMFA 2015</source>
          . pp.
          <fpage>18</fpage>
          -
          <lpage>32</lpage>
          . LNCS 9153 (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Sommerville</surname>
            ,
            <given-names>I.: Software</given-names>
          </string-name>
          <string-name>
            <surname>Engineering</surname>
          </string-name>
          . Pearson, 10th edn. (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Yoder</surname>
            ,
            <given-names>J.W.</given-names>
          </string-name>
          , Johnson, R.E.:
          <article-title>The adaptive object-model architectural style</article-title>
          .
          <source>In: Proceedings of the 3rd IEEE/IFIP Conference on Software Architecture: System Design, Development and Maintenance</source>
          . pp.
          <fpage>3</fpage>
          -
          <lpage>27</lpage>
          . Kluwer (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>