<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Workshops, OpenRE, Posters and Tools Track, and Doctoral Symposium, Essen, Germany</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Requirements Engineering for Actors-with-Learning: Encompassing the Two Kinds of Modeling for Full Cognitive Cycle RE</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Eric Yu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Toronto</institution>
          ,
          <addr-line>Toronto, ON</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>1</volume>
      <fpage>2</fpage>
      <lpage>04</lpage>
      <abstract>
        <p>Models are used extensively in software engineering - to facilitate requirements analysis, architectural design, and so on. Models are also central to recent approaches to AI, being the output of computationally intensive machine learning algorithms drawing on large datasets. In this short paper, we first suggest that developing machine learning applications, like other software initiatives, can benefit from familiar requirements engineering (RE) techniques such as goal- and agent-oriented RE, so that the resulting systems will address the needs and wants of end-users and other stakeholders. Then, we suggest that the two kinds of modeling are fundamentally complementary in that they pertain to the two sides of a cognitive cycle. We propose that an RE framework needs to address both sides of the cognitive cycle for all relevant actors, in order to be able to respond to the complex socio-technological realities of a data-rich world.</p>
      </abstract>
      <kwd-group>
        <kwd>1 Requirements modeling</kwd>
        <kwd>AI</kwd>
        <kwd>iStar</kwd>
        <kwd>agent-oriented</kwd>
        <kwd>goal-oriented</kwd>
        <kwd>knowledge level</kwd>
        <kwd>cognitive agent</kwd>
        <kwd>learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Models play a crucial role in requirements engineering (RE). They offer a simplified view of reality,
facilitating specialized and focused lines of inquiry and analysis. Models are also central to recent
approaches to AI, being the output of computationally intensive machine learning algorithms drawing
on large datasets. Today, machine learning (ML) has become pervasive in software systems and already
has far reaching impacts on everyday life. Having effective RE techniques that ensure technology
systems meet humans needs and wants is therefore of crucial importance to system designers as well as
to technology users and society at large.</p>
      <p>In this paper, we first argue that ML models, despite being computationally generated, require
considerable human conceptual effort and reasoning to achieve desired results, and can therefore benefit
from familiar RE techniques based on conceptual modeling such as goal-oriented approaches, with
some modifications and extensions.</p>
      <p>
        We then note that, although the two kinds of models and modeling appear to be at odds with each
other, one focusing on empirical data while the other relies on domain experts and human knowledge,
the two are ultimately complementary. Drawing on the notion of a cognitive cycle from the study of
human cognition [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], we suggest that each kind of modeling addresses one side of the cognitive cycle
– perception-to-knowledge on one side, and knowledge-to-action on the other. When used together, the
two kinds of models complete the cognitive cycle for each other.
      </p>
      <p>We argue that a framework that conceptually encompasses both kinds of modeling can offer the
more comprehensive requirements analysis that is needed for today’s complex sociotechnical realities.
We envision an agent-oriented requirements modeling framework based on an abstract
“knowledgelevel” cognitive agent covering the full cognitive cycle, so that each side of the cycle can potentially be
implemented at the physical level with a mix of human and machine intelligence and enactment.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Machine Learning Models Are Models Too</title>
      <p>
        In building software systems and applications, we are now routinely encountering two kinds of
models. Models have been core to the practice of software engineering for a long time. It has been said
that the critical skill of the software professional and computer scientist is the ability to work with
abstractions [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Models are used to conceive of systems to be built and to shape a new reality. They
serve as blueprints for analysis and communication with project sponsors and among project team
members. These models are handcrafted, relying on the knowledge of domain experts, business
stakeholders, and the creativity of system architects, strategist, and designers.
      </p>
      <p>Models of a different type are equally pervasive, in the empirical sciences. Models in this context
are derived from data collected from the world. Models aim to accurately capture the essence of reality
as it exists, so as to be able to build theories and understand phenomena, and to predict the future.</p>
      <p>In the past decade or so, this second kind of modeling has received a big boost from computational
advances - a convergence of advances in machine learning algorithms, raw computing power, and
network and mobile technologies. As modern life has gone digital, the massive amounts of data that is
the digital footprint of business transactions, social interactions, and physical activities are being tapped
to produce models that in turn reshape our lives.</p>
      <p>
        Although ML-generated models aim to faithfully reflect what is in the “raw data”, they are
nevertheless artifacts, produced through the deliberate actions and decisions of data scientists, ML
engineers, and project sponsors. While the specific activities in an ML project may be substantially
different from those in a conventional software development project [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and involving more
experimentation and iteration, the models must ultimately serve some purpose and meet the needs of
project sponsors and stakeholders. Goal-oriented RE techniques (employing modeling of the first kind)
that have been developed to support conventional systems development could potentially be adapted to
support the development of ML systems (which produces models of the second kind).
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposal #1: Requirements Modeling for Machine Learning (RE4ML)</title>
      <p>
        The recently developed GR4ML framework [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] follows a goal-oriented conceptual modeling based
approach to provide systematic guidance on developing ML applications. High-level strategic business
goals are progressively refined until the need for decisions are encountered. In order for decision makers
to benefit from data assets to make informed decisions, they need to be prompted to ask relevant
questions. These are formulated as “Question-Goals”. Based on the type of Question-Goals, candidate
classes of ML algorithms producing desired “Insights” in the form of models are suggested using
precompiled catalogs. The framework also includes conceptual modeling support for selecting among
candidate algorithms, as well as for determining data preparation workflow.
      </p>
      <p>As in other types of system development, goal-oriented requirements modeling can facilitate making
the connection between domain problem understanding and technology solution, clarifying and
justifying the step by step progression from problem to solution, all the while avoiding building
solutions that solve the wrong problem. Some initiatives may start top-down from business drivers,
while others may emphasize bottom-up leveraging of existing assets, such as readily available data
assets and reusable components. Ultimately the top-down and bottom-up elements must align for a
successful project. These are supported by connections across the three submodels in GR4ML – the
Business View, the Analytics/ML Design View, and the Data Prep View [16].</p>
      <p>
        The goal-oriented approach of GR4ML can potentially be extended to an agent-oriented approach
along the lines of i* [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], so as to reflect the differing interests of various parties and strategic
dependencies among them. The modeling concepts introduced in GR4ML specifically to address ML
needs suggests corresponding dependency types as extensions to i*. One actor might pose a
QuestionGoal to another actor and depend on the latter for Insight (an ML-model), which the first actor can use
to perform predictions. An alternate relationship might be that the second actor, instead of providing
the model, offers a prediction instead. This suggests a different type of dependency. In the first case,
the depender actor avoids exposing its query data to the model provider, and can further test the model
to determine its reliability. In the second case, the dependee actor need not disclose the model to the
depender, expecting the latter to trust the inference results. These distinctions can help the various
parties decide what relationships to adopt.
      </p>
      <p>By analyzing the network of dependencies among actors, one could determine whether the actor is
using appropriate data sources and other inputs, which may come with biases reflecting the strategic
interests of actors along the data supply chain. As in i*, quality (softgoal) dependencies among actors
can be identified and evaluated.</p>
      <p>
        Catalogues of design techniques for achieving common recurring softgoals can be compiled to assist
data scientists and system designers, as done for non-functional requirements (NFR’s) in conventional
software engineering [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. For ML and AI applications, NFR’s such as fairness, accuracy, trust, and
ethical principles are of particular concern [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ]
      </p>
      <p>
        The actor abstraction in i* hides the details of internal operations of the actor. How an actor is able
to produce an ML-model can be revealed incrementally by treating that actor as composed of multiple
collaborating actors each responsible for some part of the ML process, e.g., data sourcing, cleaning,
feature engineering, algorithm selection, training, testing, tuning, etc. These responsibilities could be
treated as roles which may be performed by the same or different individuals (i* agents) [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. The Two Sides of the Cognitive Cycle</title>
      <p>In treating an ML model as an artifact to be produced in a development project, as we did in the
preceding section, we have not given special consideration to the complementary nature of the two type
of models. Both types are models in that they are representations of reality, but each is arrived at and
used differently. ML modeling generates an abstraction of the world based on empirical data, whereas
conventional requirements and software engineering models are abstractions used by human
professionals mainly to produce software that act on the world when instantiated. From the viewpoint
of creating an intelligent agent or an effective information system, it is clearly equally important that
its models of the world be grounded in empirical data, and that its vision of the desired world as
expressed in models can be realized through action.</p>
      <p>
        By analogy to human cognition [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], the two kinds of models and modeling correspond to the two
sides of a cognitive cycle – data-driven ML corresponds to the perception-to-model side, whereas
model-driven software development corresponds to the model-to-action side. A cognitive agent is
incomplete if either side is missing. In humans, the cognitive cycle is repeated frequently (estimated
cycle time of around 300 milliseconds [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]) to be able to sense changes in the environment and to act
accordingly.
      </p>
      <p>Conventional software constructed based on handcrafted models with knowledge elicited from
domain experts (e.g., through interviews, surveys, etc.) would typically have a long lag time on the
perception-to-model side. Today, with ML, many software systems are approaching near real-time
cognitive cycles in sensing the environment and responding to changes, by adopting, for example,
selfadaptive software architectures [11] or data-driven requirements engineering techniques [12].</p>
      <p>RE frameworks today need to encompass the full cognitive cycle, at least at a conceptual level
(regardless of the extent to which ML models or conceptual models are directly used in the automated
systems), so that requirements for both sides of the cycle will be identified, analyzed, and used to guide
development of a coherent system comparable to an intelligent cognitive agent.</p>
      <p>We briefly envision such a full cognitive cycle RE framework in the next section.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Proposal #2: Requirements Modeling for Actors-with-Learning (RE4AL)</title>
      <p>
        As automated systems are ultimately situated within a human social setting, requirements modeling
must cover the overall sociotechnical system [
        <xref ref-type="bibr" rid="ref8">17, 8</xref>
        ]. The i* agent-oriented modeling framework was
designed so that actors can be modelled at an abstract level without prejudging their internal
composition in terms of human or machine elements or some combination [13]. As ML becomes
prevalent in software systems, we nevertheless do not want to prejudge at the early requirements stage
what kinds of learning (the perception-to-model side) will be done by machine or by humans.
      </p>
      <p>We envision an agent-oriented requirements modeling framework based on an abstract
“knowledgelevel” cognitive agent [14] covering the full cognitive cycle, so that each side of the cycle can potentially
be a mix of human and machine intelligence and enactment. Early requirements analysis can thus probe
requirements about model accuracy, potential bias, concept drift without prior commitment to
implementation technology.</p>
      <p>i* was inspired by classical AI and multi-agent systems where actors are modelled as having ability
to achieve goals for each other [14, 15]. To accommodate the newly powerful machine-learning variety
of AI, new constructs such as the new types of dependencies suggested in Section 3 could be added so
that i* actors will cover the entire cognitive cycle. The actors in the extended i* will be actors with
learning ability, though not necessarily machine learning.</p>
      <p>The abstract actors will be mapped to physical agents for a more implementation-oriented analysis
taking into account the strengths and limitations of various classes of machines and humans with respect
to their learning and other abilities.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Acknowledgements</title>
      <p>We acknowledge the support of the Natural Sciences and Engineering Research Council of Canada
(NSERC). Constructive comments from anonymous reviewers and from S. Nalchigar, Z. Babar, and A.
Lapouchnian are gratefully acknowledged.</p>
    </sec>
    <sec id="sec-7">
      <title>7. References</title>
      <p>[11] K. Angelopoulos, A. V. Papadopoulos, V. E. S. Souza, &amp; J. Mylopoulos. Engineering self-adaptive
software systems: From requirements to model predictive control. ACM Transactions on
Autonomous and Adaptive Systems (TAAS), 13(1), (2018): 1-27.
[12] X. Franch, N. Seyff, M. Oriol, S. Fricker, I. Groher, M. Vierhauser, &amp; M. Wimmer. Towards
Integrating Data-Driven Requirements Engineering into the Software Development Process: A
Vision Paper. In International Working Conference on Requirements Engineering: Foundation for
Software Quality, Springer, Cham. 2020, pp. 135-142.
[13] E. S. Yu. Towards modelling and reasoning support for early-phase requirements engineering. In
Proceedings of 3rd IEEE International Symposium on Requirements Engineering, IEEE. 1997, pp.
226-235.
[14] A. Newell. The knowledge level. Artificial intelligence, 18(1), (1982):87-127.
[15] E. Yu. Agent orientation as a modelling paradigm. Wirtschaftsinformatik, 43(2), (2001):123-132.
[16] S. Nalchigar &amp; Yu, E. Designing business analytics solutions. Business &amp; Information Systems</p>
      <p>Engineering, 62(1), (2020): 61-75.
[17] L. Liu &amp; E. Yu. Designing information systems in social context: a goal and scenario modelling
approach. Information systems, 29(2), (2004): 187-203.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Laird</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lebiere</surname>
          </string-name>
          , &amp; P. S.
          <string-name>
            <surname>Rosenbloom</surname>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>A standard model of the mind: Toward a common computational framework across artificial intelligence, cognitive science, neuroscience, and robotics</article-title>
          .
          <source>AI Magazine</source>
          ,
          <volume>38</volume>
          .4 (
          <year>2017</year>
          ):
          <fpage>13</fpage>
          -
          <lpage>26</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Kramer</surname>
          </string-name>
          .
          <article-title>Is abstraction the key to computing? Communications of the ACM</article-title>
          ,
          <volume>50</volume>
          .4 (
          <year>2017</year>
          ):
          <fpage>36</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Amershi</surname>
          </string-name>
          et al.
          <article-title>Software engineering for machine learning: A case study</article-title>
          .
          <source>In IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP) IEEE</source>
          <year>2019</year>
          , pp.
          <fpage>291</fpage>
          -
          <lpage>300</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Nalchigar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Yu</surname>
          </string-name>
          , &amp;
          <string-name>
            <surname>K. Keshavjee.</surname>
          </string-name>
          <article-title>Modeling machine learning requirements from three perspectives: a case report from the healthcare domain</article-title>
          .
          <source>Requirements Engineering</source>
          (
          <year>2021</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          . doi:
          <volume>10</volume>
          .1007/s00766-020-00343-z
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Giorgini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. A.</given-names>
            <surname>Maiden</surname>
          </string-name>
          , &amp; J.
          <string-name>
            <surname>Mylopoulos</surname>
          </string-name>
          . Social Modeling for Requirements Engineering: MIT Press,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Chung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.A</given-names>
            <surname>Nixon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Yu,</given-names>
            &amp; J.
            <surname>Mylopoulos</surname>
          </string-name>
          .
          <article-title>Non-functional requirements in software engineering</article-title>
          . Springer Science &amp; Business
          <string-name>
            <surname>Media</surname>
          </string-name>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Horkoff</surname>
          </string-name>
          .
          <article-title>Non-functional requirements for machine learning: Challenges and new directions</article-title>
          .
          <source>In IEEE 27th International Requirements Engineering Conference (RE) IEEE</source>
          .
          <year>2019</year>
          . pp.
          <fpage>386</fpage>
          -
          <lpage>391</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Selbst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Boyd</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. A.</given-names>
            <surname>Friedler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Venkatasubramanian</surname>
          </string-name>
          , &amp; J.
          <string-name>
            <surname>Vertesi</surname>
          </string-name>
          .
          <article-title>Fairness and abstraction in sociotechnical systems</article-title>
          .
          <source>In Proceedings of the conference on fairness, accountability, and transparency</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>59</fpage>
          -
          <lpage>68</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sothilingam</surname>
          </string-name>
          &amp;
          <string-name>
            <given-names>E. Yu. Modeling</given-names>
            <surname>Agents</surname>
          </string-name>
          , Roles, and
          <article-title>Positions in Machine Learning Project Organizations</article-title>
          .
          <source>In Proceedings of 13th Int. iStar workshop. CEUR workshop proceedings</source>
          Vol 2641.
          <year>2020</year>
          . pp.
          <fpage>61</fpage>
          -
          <lpage>66</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>T.</given-names>
            <surname>Madl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. J.</given-names>
            <surname>Baars</surname>
          </string-name>
          , &amp;
          <string-name>
            <surname>S. Franklin.</surname>
          </string-name>
          <article-title>The timing of the cognitive cycle</article-title>
          .
          <source>PloS one</source>
          ,
          <volume>6</volume>
          (
          <issue>4</issue>
          ),
          <year>2011</year>
          ,
          <year>e14803</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>