<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Barcelona, Catalunya, Spain, April</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Automated Traceability between Requirements and Model-Based Design</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Maria Bonner</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marc Zeller</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gabor Schulz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dagmar Beyer</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mihaela Olteanu</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Siemens AG</institution>
          ,
          <addr-line>Gleiwitzer Str. 555, 90475 Nürnberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Siemens AG</institution>
          ,
          <addr-line>Otto-Hahn-Ring 6, 81739 München</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Siemens S.R.L.</institution>
          ,
          <addr-line>Bulevardul 15 Noiembrie 78, 500097 Brasov</addr-line>
          ,
          <country country="RO">Romania</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>17</volume>
      <issue>2023</issue>
      <abstract>
        <p>Traceability is an important aspect in system development, since it helps to ensure that the developed system fulfills all requirements and prevents failures. When developing safety-critical systems, traceability is mandatory to demonstrate that the system is implemented correctly. Unfortunately, establishment and maintenance of trace links are hard to achieve manually in today's complex systems. Tools and methods for automated trace link generation, which have been proposed so far, have no broad adoption in industry and there is lack of evidence of their efectiveness. To address these challenges we introduce a tool, which allows establishing bi-directional traceability links between requirements and model-based designs using Artificial Intelligence (AI). Our tool is an extension of the Siemens toolchain for Application Lifecycle Management (ALM), systems engineering, and embedded software design. It creates trace links between requirements written in natural language and CapitalTM software, AUTOSAR, SysML, UML, or Arcadia models for system/software design. This paper describes the implemented use-cases of tracing system/SW/HW requirements to system architecture models and provides an overview of the tool architecture.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Requirement traceability</kwd>
        <kwd>Model-Based Design</kwd>
        <kwd>NLP</kwd>
        <kwd>Ontology</kwd>
        <kwd>Semantic Web</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Modern Electrical/Electronical (E/E) systems are characterized by a huge growth in
complexity. One of the big challenges is to assure that all requirements are implemented
correctly in software, electrical, electronic, and network designs. Especially, when
developing safety-critical system, safety standards, e.g., ISO 26262 [1] in automotive, ISO
17894 [2] in marine, or ARP 4754/4761 [3] in avionics, demand a proof that the system
is built correctly. Hence, it is required to show vertical traceability between (safety)
requirements, the system/software design, and the test cases.</p>
      <p>Following the ISO 26262 guideline as well as established standards for the engineering of
automotive E/E systems such as ASPICE [4], functional/safety system requirements are
typically derived from either stakeholder/user requirements or safety goals and provide
the input to create a functional system design. In the next step, the functional system
architecture is defined in a model-based way, e.g., as an Internal Block Diagram in
SysML [5], as a function chain in Capella [6], or as a functional architecture in CapitalTM
Systems Architect [7]. Afterwards, the functional requirements are refined in a set of
technical (safety) requirements which are the basis to create a technical/physical system
architecture. The physical system architecture describes the technical components (HW
&amp; SW) which realize the functional architecture. In a next design step, the technical
(safety) requirements are refined into a set of SW (safety) requirements and a set of HW
(safety) requirements which are the basis for the implementation of the E/E system by
a set of interconnected electronic control units (ECUs) which run a set of SW-based
functions.</p>
      <p>In the automotive domain, the SW of an ECU is realized using the AUTOSAR
standard[8]. Hence, the HW &amp; SW (safety) requirements can be linked to AUTOSAR
elements which realize these requirements. However, creating the necessary trace links is
cumbersome and in case of changes of either the requirements or the design, the links
must be adjusted accordingly. Since engineers fear to create wrong trace links which may
lead to confusion during the entire development, often no trace links are created at all.</p>
      <p>We propose a tool for semi-automating the task of creating and maintaining trace links
between requirements and model-based design (see Figure 1). The tool is aligned with
the Siemens development flow for the design of E/E systems [ 7]. For our tool we use
a semantic web layer, which extends the definition of Siemens ontology libraries [ 9] for
Siemens tools with a traceability model between the diferent tools.</p>
      <p>In this tool demo, we present a use-case, in which safety requirements are defined and
stored in the PolarionTM tool and the engineer is conducting the model-based design
of the E/E system in the CapitalTM software. The task of the engineer is to establish
trace links between safety requirements to the model-based designs to illustrate that the
safety features are implemented correctly. The proposed trace link recommender tool
suggests either trace links to a model element in the model-based design, or trace links to
a safety requirement in PolarionTM. In addition to the use-case, we provide an abstract
architecture of the tool with its underlying technologies.</p>
      <p>The rest of the paper is organized as follows: In Section 2, we provide a brief overview
of the foundations used for our tool. Section 3 describes the abstract architecture of our
tool. Afterwards, we present diferent use cases in which our tool automates the trace
link creation. At the end, we summarize the main results of the paper and provide an
outlook on future research work in Section 5.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Foundations</title>
      <p>There are a number of approaches in the area of information retrieval which allow to
extract information from diferent sources. In ontology-based approaches, requirements
are extracted into a knowledge graph conformed to a predefined ontology. This extraction
happens with a high level of granularity, additional reasoners allow to identify broader
semantic meaning [10]. Finally, the rules over ontologies can be defined to automate
the trace link generation or the availability of training data to adapt models for domain
specific entity recognition [ 11]. As opposed to granular approaches, which assume easy
recognizable structure, there are vector space based approaches [12], which assume text
conversion into a vector space model, and application of a distance measure to calculate
the semantic similarity between artifacts. Despite fundamental problems with these
techniques, such as recognition of diferent words in the same contexts (synonymy) or
identification of the same word for multiple purposes (polysemy), vector space methods
show high applicability in practice.</p>
      <p>Recent approaches for automated trace link generation are using machine learning
techniques [13, 14, 15, 16]. These techniques assume existence of predefined traces,
which are used as training data. A machine learning model learns from these data by
noticing the connection between artifacts and then proposes new traces. Machine learning
algorithms have big potential, since they can discover features which are not always
obvious to humans. The drawback is that machine learning techniques require large,
labeled datasets, which can hardly be shared outside of organizations. Additionally, there
is no guarantee that an algorithm trained on one dataset will perform well on another
dataset with the same structure.</p>
      <p>Our goal is to propose a tool, which (1) allows to aggregate information from diferent
sources in a semantic data layer, and (2) allows to generate trace links based on
vectorbased methods between the retrieved data.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Architecture</title>
      <p>The traceability recommender tool is an extension of existing Siemens tools and adds an
additional semantic data layer for data aggregation and communication of heterogeneous
sources (see Figure 2). This layer allows customizing traced artifacts for better data
organization by the definition of ontologies. Siemens is actively working on the Siemens
ontology database [9], which allows to unify product lifecycle management systems for
the creation of a digital twin over several domains.</p>
      <p>The traceability recommender tool covers diferent modelling domains. The E/E
architecture is defined either in the tool Capital TM Systems Architect or using SysML
or the Arcadia methodology. While the software system is defined using UML or
the AUTOSAR standard for embedded software in the automotive domain. For each
modelling domain there is a separate ontology. Traceability information are contained in
cross-domain ontologies. Additionally, ontologies and extractors for requirements defined
in the Polarion ISO 26262 templates [17] are created. These extractors preprocess and
label data. The preprocessing steps during extraction allow to add semantic context to
the textual information about requirements and elements from the model-based design.
Moreover, further ontologies can be added any time to extend the functionality of the
traceability recommender tool.</p>
      <p>Based on the information gathered in the semantic data layer, the traceability
recommender tool allows users to choose a model to represent requirements and elements
from model-based design into a common vector space, to compute their similarities, e.g.,
using the cosine similarity measure [18]. The most similar items are taken by the tool
as candidate trace elements and presented as recommendations to the user. One of the
possible semantic layers can be GraphDB [19], an RDF database for knowledge graphs.
In the next section, an example of the usage of the built-in semantic vector package [12]
will be shown.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Use-Case</title>
      <p>In this section, we present two possible outputs of the traceability recommender tool.
Here, an exemplary functionality of a Seat Heater in a car is taken as a use case. It
consists of a set of requirements specified in textual form (requirement types are presented
in Figure 1), a functional design, a physical design, and an AUTOSAR model. The
relevant information from all these artifacts is successfully extracted and preprocessed
into our semantic data layer.</p>
      <p>In the first use case, the traceability recommender tool is used to find requirements
which are implemented by a specific element in the AUTOSAR model. Assuming that
an engineer is working on the AUTOSAR model and defined an element of the type
RPortPrototype, which is named pt_seatheaterMode. The traceability recommender
tool returns three SW requirements from the Polarion specification, which are related to
the RPortPrototype pt_seatheaterMode, as depicted in Figure 3. The first column of
the table contains the full path to the SW requirement in the semantic data layer. The
second column contains the id of the SW requirement in the Polarion project. The third
and the fourth columns represent the title and the description of the requirement. Finally,
the fifth column contains the similarity score, which tells how similar is the provided
element in the model-based design to the suggested requirement. The information in
the table can now be used by the engineer to create trace links between the suggested
requirements and the AUTOSAR element. Hence, consistency and completeness in the
mapping of the HW &amp; SW (safety) requirements to AUTOSAR elements, which realize
these requirements, can be achieved.</p>
      <p>In a second use case, the engineer wants to verify that all safety requirements from a
given specification are addressed in the model-based design. In this scenario, our tool
recommends trace links for a specific SW requirement to the elements in which implement
the requirement the AUTOSAR model. For instance, for the requirement with the
title: "Activate seat heater degraded mode" and the description: "When the seat heater is
powered, the seat heater supervisor shall activate the seat heater Degraded Mode (via power
cutof) if any failure is detected continuously for 200ms." the traceability recommender tool
suggests trace links to the ports, which include such words as mode, cutof , temperature.
These words are either exisiting in the requirement name or description or have a similar
meaning. Moreover, our tool suggests trace links to software component types, which
are either connected with the mentioned ports or have similar words in their description.
Some of the recommended AUTOSAR elements for which a trace link should be created
are depicted in Figure 4. Based on this information the engineer can create trace links
between the requirement and the suggested AUTOSAR elements. Especially, when the
requirements of the AUTOSAR model are modified, the recommender tool helps engineers
to detect inconsistencies and eases the adaptation of the trace links to the performed
changes.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion and Further Research</title>
      <p>Ensuring that all requirements are implemented correctly, is a major challenge in the
development of modern E/E systems. Establishing and maintaining trace links between
requirements and the model-based design are hard to achieve manually due to the
increasing system complexity. In this paper, we introduce an AI-based recommender tool.
This tool extends the Siemens toolchain for the development of automotive E/E systems
and supports engineers to create bi-directional trace links between requirements written
in natural language and CapitalTM software, AUTOSAR, SysML, UML, or Arcadia
models for the system/software design.</p>
      <p>As next steps, we will generalize the existing approach, since the development of the
recommender is performed on specific datasets. Moreover, we plan sophisticated empirical
studies on the datasets and developed similarity algorithms. To increase support for the
engineers, we will incorporate reinforcement learning approaches in the recommender
tool. Hence, the recommender can self-improve based on confirmed generated traces.
Moreover, we plan to implement data obfuscation methods in the data extraction process,
since it is often not possible to share data between diferent companies.
[1] International Organization for Standardization, ISO 26262: Road vehicles –
Functional safety, 2011.
[2] Int. Organization for Standardization (ISO), ISO/TC 8/SC 8 Ship design, ISO
17894:2005 Ships and marine technology – General principles for the development
and use of programmable electronic systems in marine applications, 2005.
[3] Society of Automotive Engineers Inc. (SAE), ARP 4761: Guidelines and Methods
for Conducting the Safety Assessment Process on Civil Airborne Systems and
Equipment, 1996.
[4] Automotive SPICE, Standards Development Organisation, 2022. URL: https://www.</p>
      <p>automotivespice.com/.
[5] OMG, Systems Modeling Language (SysML), 2017. URL: http://www.omg.org/
spec/SysML/.
[6] P. Roques, MBSE with the ARCADIA Method and the Capella Tool, in: 8th
European Congress on Embedded Real Time Software and Systems (ERTS 2016),
Toulouse, France, 2016, pp. 1–8. URL: https://hal.science/hal-01258014.
[7] Siemens, Architecture-driven E/E Systems Development Flow,
White Paper, 2021. URL: https://resources.sw.siemens.com/en-US/
white-paper-ee-systems-development-flow.
[8] AUTOSAR initiative, AUTOSAR Website - Standards, 2018. URL: https://www.</p>
      <p>autosar.org/standards/classic-platform.
[9] S. T. Maja Milicic Brandt, Industrial Ontologies at Siemens, 2022.
https://ontocommons.eu/sites/default/files/20210607_MajaMilicicBrandt_
Ontocommons.pdf.
[10] M. Vierlboeck, D. Dunbar, R. Nilchiani, Natural language processing to extract
contextual structure from requirements, 2022, pp. 1–8. doi:10.1109/SysCon53536.
2022.9773855.
[11] Y. Li, J. Cleland-Huang, Ontology-based trace retrieval, in: 2013 7th International
Workshop on Traceability in Emerging Forms of Software Engineering (TEFSE),
2013, pp. 30–36. doi:10.1109/TEFSE.2013.6620151.
[12] D. Widdows, T. Cohen, The semantic vectors package: New algorithms and public
tools for distributional semantics, in: 2010 IEEE Fourth International Conference
on Semantic Computing, 2010, pp. 9–15. doi:10.1109/ICSC.2010.94.
[13] R. Rasiman, F. Dalpiaz, S. España, How efective is automated trace link recovery in
model-driven development?, in: Requirements Engineering: Foundation for Software
Quality: 28th International Working Conference, REFSQ 2022, Birmingham, UK,
Proceedings, Springer-Verlag, 2022.
[14] J. Guo, J. Cheng, J. Cleland-Huang, Semantically enhanced software traceability
using deep learning techniques, in: 2017 IEEE/ACM 39th International Conference
on Software Engineering (ICSE), 2017, pp. 3–14. doi:10.1109/ICSE.2017.9.
[15] N. Ali, Y.-G. Gueheneuc, G. Antoniol, Trust-based requirements traceability, in:
2011 IEEE 19th International Conference on Program Comprehension, 2011, pp.
111–120. doi:10.1109/ICPC.2011.42.
[16] J. Cleland-Huang, B. Berenbach, S. Clark, R. Settimi, E. Romanova, Best practices
for automated traceability, Computer 40 (2007) 27–35. doi:10.1109/MC.2007.195.
[17] Siemens, Polarion Extensions, 2022. URL: https://extensions.polarion.com/.
[18] C. D. Manning, P. Raghavan, H. Schütze, Introduction to Information Retrieval,
Cambridge University Press, Cambridge, UK, 2008. URL: http://nlp.stanford.edu/
IR-book/information-retrieval-book.html.
[19] GraphDB, Semantic Graph Database, 2023. URL: https://graphdb.ontotext.com/.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>