Embedding Defect and Traceability Information in CIM-
          and PIM-Level Software Models

                                Jörg Rech and Mario Schmitt

             Fraunhofer IESE, Fraunhofer-Platz 1, 67663 Kaiserslautern, Germany
                   +49 (0) 631 6800 2210, Joerg.Rech@iese.fraunhofer.de
                  +49 (0) 631 6800 2215, Mario.Schmitt@iese.fraunhofer.de


       Abstract. Additional information about software models comes in different
       forms such as defects detected, design patterns used, traceability information to
       other abstraction levels, etc. In this paper, we present how additional informa-
       tion about defects, context, traceability, etc. can be embedded into UML- and
       BPMN-based PIM- and CIM-level software models. Furthermore, we present a
       tool that uses the information about quality defects within a PIM to visualize
       defects directly in the diagrams of a software model.

       Keywords: Quality Defects, PIM, CIM, Traceability, Defect Annotation,
       Traceability Annotation, Embedded Information, Software Models, MDSD


1    Introduction
Business users together with business analysts and architects generate the basic char-
acteristics of a software system that result in computation independent models (CIM),
including, e.g., role, product, or process models. In order to support the traceability of
elements and of decisions made on the CIM-level to the PIM-level as well as the
traceability of problems identified at the PIM-level to the CIM-level, we have to store
additional information about the software model along with the software model.
   Additional information about elements in a software model such as a PIM or CIM
comes in different forms such as quality defects detected, patterns (roles) used, trace-
ability information to other abstraction levels, etc. This information is documented by
users or automated mechanisms and has to be visualized in standard or special views
of a modeling tool, made available to other systems for further analysis (e.g. impact
analyses), or persisted over a long period of time.
   Kolovos et al. [7] differentiate between external and embedded traceability infor-
mation and decided on using an external approach. They argue against the embedded
approach (based on stereotypes), as it does not support inter-model relations, pollutes
the models, and degrades uniformity.
   However, while several other options are possible beside stereotypes, storing addi-
tional complex information in a metamodel such as the UML [11] for PIM level is not
straightforward. An extension of the UML metamodel would result in non-standard
models that are not exchangeable between tools. Besides, in order to apply similar
mechanisms to models at different abstraction levels based on different (or previously
unknown) metamodels, we need a generic approach that can be easily adapted to and
complies with a broad range of metamodels.
   In our context, the de-facto metamodel on the PIM level is UML [11], which is
built using the OMG’s Meta Object Facility (MOF) meta-metamodel. MOF is a com-
mon modeling language kernel providing a unified basis for all OMG metamodels.
On the CIM level, modeling focuses on business process modeling using mostly the
Business Process Modeling Notation BPMN [2] and the Business Process Definition
Metamodel BPDM [1]. While BPMN provides just a graphical notation for process
orchestration, BPDM is a CIM-level metamodel for business process modeling, using
BPMN as the graphical notation. Similar to UML, BPDM is based on the MOF [8]
meta-metamodel.
   Several solutions to the abovementioned problem of embedding information into
software models are possible. In order to store the information in an Eclipse-based
IME for PIMs, such as Topcased [10], and for CIMs, such as the BPMN-Editor of the
SOA Tools Platform (STP) [4], we can persist information as:
• Markers/Properties in/of the software model that are managed and stored by the
   tool, but cannot easily be shared between users or across a versioning system (e.g.,
   CVS)
• MOF Tag, a construct in MOF that enables the multiple “tagging” of MOF model
   elements with attribute-value pairs and can be shared across a versioning system,
• Comment, a construct of MOF and many other metamodels (e.g., “Text Annota-
   tions” in BPMN) similar to MOF Tag for storing a single additional information
   item (i.e., text field) per element, but, typically, is used for developer documenta-
   tion,
• UML Profile/Stereotype, an extension mechanism in UML that can be used to
   integrate additional elements into the UML. However, the information might be
   confused with, domain-specific stereotypes for example and could flood the user
   with too much information, not necessary in day-to-day work,
• External files, similar to diagram interchange [3] files in Topcased, which use a
   semantic bridge to refer to elements of the software model(s). However, these files
   need to be used by the tools at work in order to synchronize changes to the model
   elements.
In order to enable the annotation of elements in a MOF-based software model, with
respect to providing easily synchronizable and versionable information, we selected
MOF Tags to persist information about defects detected, context factors, and trace-
ability information. Furthermore, the tagging mechanism allows embedding complex
information within a software model using an XML schema to describe and structure
the specific content for every specific annotation. The XML schema represents a
metamodel that allows us to define the substructures for the information on defects,
traceability, etc.


2       A Metamodel for Defect and Traceability Annotations
While traceability and context information has to be annotated manually (for now),
defects are identified by diagnostic mechanisms that analyze the system and find


    2
typical recurring problems, that have a negative effect on internal quality aspects
(e.g., maintainability, portability, or usability).
    The five types of defect-related embedded information are: Defect Annotations
(with information about the diagnosed quality defects), Context Annotations (with
context information on design patterns and roles applied or on special stereotypes,
used to differentiate the diagnosis), Decision Annotations (with decisions such as
“ignore” for individually diagnosed quality defects in case they are wrongly
diagnosed or not removable in this specific location), Symptom Annotations (with
information on the identified symptoms), and Treatment Annotations (which are used
to store the treatments applicable for removing the diagnosed quality defects).
    Traceability information uses just one type of annotation (Trace Annotations) that
realizes traces from one element to one or more other elements (e.g., from one CIM
element to multiple PIM elements (downwards), from one PIM to multiple CIM ele-
ments (upwards), or from one PIM to multiple other PIM elements (sidewards)). Mul-
tiple types of references can be used between abstractions and within abstractions.
    Furthermore, while single-location defects are enclosed within one abstraction at
one element, multi-location defects refer to other elements (resp. annotations) within
the same model, and all defects might refer to elements on another abstraction level to
document rationales for not removing a defect or to pinpoint a cause or (design) deci-
sion (e.g., in a CIM).
    Figure 1 shows different aspects. On the OMG-specification-side (right), it outlines
the generic approach as proposed by MOF for annotating model elements with addi-
tional information (metainformation) using the Tag entity. It introduces the base
model elements of BPMN-based CIMs (BPDM::Element) and UML-based PIMs
(UML::Element) both deriving from MOF::Element, which acts as common model
element abstraction. Eclipse provides for these concepts either an OMG-conformant
implementation or analog concepts that can be easily mapped (center of Figure 1) to
OMG. For MOF, the element MOF::Element is mapped to the Ecore element
Ecore::EModelElement of the Eclipse Modeling Framework (EMF) and MOF::Tag is
mapped to Ecore::EAnnotation. Similarily, for CIM-modeling, the BPMN element
BPDM::Element is mapped to the SOA Tools Platform (STP) project object
STP::BPMN::NamedBpmnObjects. Finally, the element UML::Element of OMG’s
UML is implemented (as a one-to-one representation) by the Model Development
Tools (MDT) project’s UML2 MDT::UML2::Element.
    Furthermore, Figure 1 presents an XML-based metamodel (left) for defect- and
traceability-oriented model metainformation and shows how actual metainformation
is embedded using the tagging mechanism/within annotations. A metamodel similar
to the traceability information metamodel is used by Feng et al. [6] for external trace-
ability models.
    A model element may have multiple annotations (EAnnotations) associated with it,
each consisting of a source URI denoting an annotation’s type and an arbitrary num-
ber of key/value pairs. Following the structure of annotations, we bundle all model
metainformation in a single annotation element, but internally distribute information
to multiple key/value pairs according to the information’s scope/type (e.g. «Traceab-
lity» or «Quality»). That is, key determines the type/scope of the XML-based metain-
formation assigned to value.


                                                                                      3
Figure 1 Metamodel for Quality and Traceability Information in CIM and PIM Models
   Figure 2 gives a simplified, exemplary XMI serialization of a UML-based PIM
model with a model element annotated with quality information. Quality information
consists of quality model and defect detection information. According to the (non-
functional) requirements a software system has to meet, a quality model defines and
prioritizes mandatory quality aspects and thus, is the basis for interpreting/verifying
the quality of a software model. In the context of VIDE-DD, determining the quality
of a software model focuses on detecting quality defects. A quality defect represents a
system-independent defect at one or more model elements with a negative impact on
certain quality aspects. Defects are diagnosed on the basis of one or more quantifiable
characteristics of a model or its model elements, so-called symptoms. The intensity
with which symptoms promote related defects differs and amongst other things,
largely depends on the characteristic’s deviation from previously defined threshold(s).
For removing a defect or mitigating a defect’s (negative) impact on certain quality
aspects, treatments refer to available techniques (e.g. refactorings). The exemplary
annotation in Figure 2 illustrates the concept of defect detection information: A Lazy


  4
Class defect has been diagnosed for the PIM-level class Opportunity based on the
Number of Operations. Hence, a negative impact on the declared quality aspect Main-
tainability is expected, treatable by applying an Inline Class refactoring.
 <uml:Model>
  ...
  <packagedElement xmi:type="uml:Class" xmi:id="_CyIsaF-fEdySHqlLXw_Tew"
   name="Opportunity">
   <eAnnotations source="http://www.iese.fraunhofer.de/ModelMetaInformation">
    <details key="QualityInformation" value="
      <!-- BEGIN: Embedded QualityInformation XML-string -->
      <QualityInformation>
       <DefectDetectionInformation>
        <Defects>
         <Defect name="Lazy Class" description="Class Opportunity provides
                       not enough functionality to justify its existence."
          pluginId="diagnosis.lazyclass"
          defectiveElement="_CyIsaF-fEdySHqlLXw_Tew">
          <IdentifiedSymptoms>
           <Symptom name="Number of Operations"
            description="Number of operations is below threshold"
            pluginId="analysis.noo"
            sourceElement="_CyIsaF-fEdySHqlLXw_Tew"
            parentDefect="diagnosis.lazyclass" promotesDefect="major">
            <Thresholds>
             <Threshold name="Lower Threshold"
              unit="Integer" targetValue="6" actualValue="2"/>
            </Thresholds>
           </Symptom>
          </IdentifiedSymptoms>
          <AffectedQualityAspects>
           <QualityAspectImpact id="ISO9126_Maintainability"
            impact="negative" severity="major"/>
          </AffectedQualityAspects>
          <IndicatedTreatments>
           <Treatment name="Inline Class"
            description="Move all features of Opportunity into another class and delete it."
            pluginId=" refactoring.inlineclass"
            destinationElement="_CyIsaF-fEdySHqlLXw_Tew"
            parentDefect="diagnosis.lazyclass"/>
          </IndicatedTreatments>
         </Defect>
        </Defects>
       </DefectDetectionInformation>
       <QualityModel name="" description="">
        <QualityAspect id="ISO9126_Maintainability"
         name="Maintainability"
         description="The ease with which a software system or component can be modified…"
         aspectPriority="2"/>
       </QualityModel>
      </QualityInformation>
      <!-- END: Embedded QualityInformation XML-string -->
    "/>
   </eAnnotations>
  </packagedElement>
  ...
 </uml:Model>
Figure 2 Serialization of Quality Information Annotation
   Furthermore, we distinguish single- from multi-location defects [9]. A single-
location defect (e.g. Lazy Class) affects one model element (e.g., a class), whereas


                                                                                               5
multi-location defects apply to more than one element within the same model. For
example, a Shotgun Surgery defect is present when, due to strong coupling of classes,
a change in one class requires many subsequent changes in other classes. As each
concerned class is annotated with defect information, it is necessary to interrelate this
information, e.g. in order to elicit and apply adequate treatments. Thus, urlToDefects
(cf. Figure 1) allows for referencing related defects in other model elements.
   A model element’s traceability information comprises one to many traces to ele-
ments, both at different and at same abstraction levels. As presented in Figure 3, the
key component of a trace is urlToElement for identifying related elements using a
URL reference. The URL syntax is a path to the containing model repository, fol-
lowed by a model identifier (the model’s name) and the XMI-id of the model element.
To qualify the relation of two elements linked by a trace, different types of references
can be assigned to a) traces between abstractions, such as “realizes / is realized by”,
“refines / is refined by”, “specifies / is specified by“, “requires / is required by”, etc.
and b) traces within abstractions, such as “includes / is part of”, “verifies / is verified
by”, “defines / is defined by”, “constrains / is constrained by”, etc. (see [12] or [5]).
 <uml:Model>
  ...
  <packagedElement xmi:type="uml:Class" xmi:id="_CyIsaF-fEdySHqlLXw_Tew"
   name="Opportunity">
   <eAnnotations source="http://www.iese.fraunhofer.de/ModelMetaInformation">
    <details key="TraceabilityInformation" value="
      <!-- BEGIN: Embedded Traceability Information XML-string -->
      <TraceabilityInformation>
       <Trace
        urlToElement="http://iese.fhg.de/SalesOpportunity_CIM.bpmn#_TG7coT3iEd2hQ-HeytPXvA"
        type="realizes"
        rationale="Implementation of Opportunity data object"
        traceCreation="automatic" transitionAutomation="automatic"/>
      </TraceabilityInformation>
      <!-- END: Embedded Traceability Information XML-string -->
    "/>
   </eAnnotations>
  </packagedElement>
 </uml:Model>

Figure 3 Serialization of Traceability Information Annotation in PIM
   As generative model-driven development relies on model transformations between
abstraction levels, the information about whether a trace creation or transition be-
tween two related elements has been carried out manually, semi-automatically, or
automatically is of interest, e.g. for evaluating the quality of model transforma-
tions/model generators or for determining the overall level of automation. The XMI
serialization of traceability information between an Opportunity data object at the
CIM level and its implementation class at the PIM level is exemplified in Figure 3
(PIM-to-CIM) and Figure 4 (CIM-to-PIM).


  6
    <bpmn:BpmnDiagram>
     ...
     <artifacts xmi:type="bpmn:DataObject" xmi:id="_TG7coT3iEd2hQ-HeytPXvA"
      name="Opportunity">
      <eAnnotations source="http://www.iese.fraunhofer.de/ModelMetaInformation">
       <details key="TraceabilityInformation" value="
         <!-- BEGIN: Embedded Traceability Information XML-string -->
          <TraceabilityInformation>
           <Trace
            urlToElement="http://iese.fhg.de/SalesOpportunity_PIM.uml#_CyIsaF-fEdySHqlLXw_Tew"
            type="isRealizedBy"
            rationale="Implementation of Opportunity data object"
            traceCreation="automatic" transitionAutomation="automatic"/>
          </TraceabilityInformation>
          <!-- END: Embedded Traceability Information XML-string -->
       "/>
      </eAnnotations>
     </artifacts>
     ...
     </bpmn:BpmnDiagram>
Figure 4 Serialization of Traceability Information Annotation in CIM


3       Visualizing (Defect) Annotations in Modeling Environments
The information stored within the annotations can be used, for example, by the dia-
gram visualizer to enrich the standard UML diagrams with information about the
defects. As presented in Figure 5, the VIDE Defect Detector (VIDE-DD) extends the
Topcased modeling environment [10] and decodes the information within the annota-
tion in order to decorate an element (e.g., a class) with a defect icon or list all annota-
tions for the user (see 6). This tool is aimed at enriching the visualization of the mod-
els in order to inform designers and maintainers about potential threats to model qual-
ity.


Figure 5 The VIDE Quality Defect Detector


4       Conclusion
We presented how additional information about defects, context, or traceability can be
embedded in a UML- or BPMN-based software model (PIM or CIM) using Annota-
tions. To structure the information within these annotations, we used an XML-based


                                                                                                 7
metamodel that supports single- and multi-location annotations from CIM-to-PIM,
within PIM, and from PIM-to-CIM. Furthermore, we presented a tool that integrates
quality defect diagnosis into the contemporary modeling environment Topcased and
uses the annotations to present them in standard diagrams.
   In the future, more tools for defect diagnosis and traceability support will be devel-
oped and integrated into software development tools that have to overcome the chal-
lenges of synchronization and versioning. This is especially important for tools on the
model level, as these have to support quality assurance in and traceability between
multiple software and transformation models.


References
1.  BPDM-Beta1, Business Process Definition MetaModel (BPDM), Beta 1, OMG Adopted
    Specification, OMG, dtc/07-07-01, 2007.
2. BPMN-1.1, Business Process Modeling Notation, V1.1, OMG, 2008.
3. DI-1.0, UML Diagram Interchange Specification, version 1.0, Specification, Object Man-
    agement Group, Inc. (OMG), Needham, MA, USA, 2006.
4. EMF, "Eclipse Modeling Framework (EMF)," http://www.eclipse.org/modeling/emf/, last
    accessed on 1. April 2008.
5. A. Espinoza, P. P. Alarcon, and J. Garbajosa, Analyzing and Systematizing Current Trace-
    ability Schemas, In Annual IEEE/NASA Software Engineering Workshop (SEW), pp. 21-
    32, 2006.
6. Y. Feng, G. Huang, J. Yang, and H. M. Mei, Traceability between Software Architecture
    Models, In 30th Annual International Computer Software and Applications Conference
    (COMPSAC), pp. 41-44, 2006.
7. D. S. Kolovos, R. F. Paige, and F. A. C. Polack, On-Demand Merging of Traceability
    Links with Models, In 2nd EC-MDA Workshop on Traceability, 2006.
8. MOF-2.0, Meta Object Facility (MOF) Core Specification, version 2.0, Specification,
    Object Management Group, Inc. (OMG), Needham, MA, USA, formal/06-01-01, 2006.
9. J. Rech and A. Spriestersbach, Quality Defects in Model-driven Software Development,
    Deliverable, Fraunhofer Institute for Experimental Software Engineering (IESE), Kaiser-
    slautern, D4.1, 2007.
10. TopCased, "Topcased IDE," http://www.topcased.org/, last accessed on 27 November
    2007.
11. UML-2.1.1, Unified Modeling Language (UML), version 2.1.1, Object Management
    Group, Inc. (OMG), Needham, MA, USA, 2007.
12. A. von Knethen, Change-Oriented Requirements Traceability. Support for Evolution of
    Embedded Systems PhD Thesis. Kaiserslautern: University of Kaiserslautern, Department
    of Computer Science, 2002.


     8