<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Castiglione della Pescaia, Italy, May</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Trade-offs in the Design of Multimodal Interaction for Older Adults</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gianluca Schiavo</string-name>
          <email>gschiavo@fbk.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michela Ferron</string-name>
          <email>ferron@fbk.eu</email>
          <email>mich@fbk.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ornella Mich</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nadia Mana</string-name>
          <email>mana@fbk.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>FBK - Fondazione Bruno Kessler</institution>
          ,
          <addr-line>Trento</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <volume>29</volume>
      <issue>2018</issue>
      <fpage>15</fpage>
      <lpage>20</lpage>
      <abstract>
        <p>This position paper presents key aspects and trade-offs that designers, ICT and HCD practitioners might encounter when designing multimodal interaction for a specific target group: older adults. The paper draws together literature on multimodal interaction and assistive technology and presents a set of specific findings for older users. Building on these findings, we describe a number of challenges that should be taken into account when designing multimodal technology for this target group. Further reflections highlight the design trade-offs that such considerations might bring, presenting an overview of the design choices and their potential effects.</p>
      </abstract>
      <kwd-group>
        <kwd>Multimodality</kwd>
        <kwd>older adults</kwd>
        <kwd>design trade-offs</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Multimodal interfaces seek to combine multiple sensory input and output channels in
similar ways as in natural interaction. This similarity has led to the expectation that
multimodality in human-computer interaction can provide a more natural, robust and
flexible form of interaction with respect to more traditional input modalities such as
mouse and keyboard. [1, 2]. In this respect, multimodal human–computer interaction
has sought to provide not only more powerful and compelling interactive experiences,
but also more accessible interfaces to technological devices. Moreover, following the
principle of “design for all” and “inclusive design”, multimodal technology has been
proposed as a possible solution that allows users to use the modality matching their
preferences and/or needs, thus making the interaction more flexible [3]. However,
despite these potential advantages of multimodal interfaces, the literature reports
significant disadvantages. For example, different modalities may interfere with each
other and a synchronization problem might arise.</p>
      <p>Additionally, combining and coordinating more than one modality might also
require more effort for the users [4, 5] and a higher cognitive load [4]. Current research
provides findings supporting both assumptions reporting advantages [1, 2] as well as
disadvantages [4, 5]. This position paper has the goal to further advance the discussion
on this topic by presenting design trade-offs in multimodal technology when designing
technology for older adults, a large and heterogeneous user population.</p>
    </sec>
    <sec id="sec-2">
      <title>Multimodal Interaction for Older Users</title>
      <p>
        Multimodal interfaces have been considered to improve accessibility for a number of
users and usage contexts [3], including the diverse needs of older users [
        <xref ref-type="bibr" rid="ref7">6, 7</xref>
        ].
Multimodal systems can integrate a wider range of modalities (such as speech, writing,
gaze, touch or mid-air gestures) and better accommodate users’ preferences with
respect to unimodal interfaces. Furthermore, multimodal interfaces can be more
userfriendly for people who have little or no experience with common computer devices,
since they offer the possibility to use multiple interaction channels without relying on
a single source of input [3]. However, other studies point out that multimodality must
be carefully introduced since it might require more cognitive effort to coordinate
different input modalities (especially when more than two modalities are involved) and
additional physical demand [4]. This may become particularly relevant when
considering the cognitive and physical characteristics of older users [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        Numerous examples of multimodal technology for older adults can be found in the
research and on the market. For instance, social robots or telepresence technology are
two representative examples of multimodal systems believed to support older users [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
Before listing the design trade-offs that these or similar interaction modalities might
bring in the technology, we summarize a list of design considerations.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Design Challenges and Recommendations</title>
      <p>
        Different recommendations for the design of multimodal interfaces for older adults
have been discussed in the literature (see for example, [
        <xref ref-type="bibr" rid="ref7 ref9">4, 7, 9</xref>
        ]) and are summarized
below.
      </p>
      <p>
        Diverse abilities. Multimodal systems provide users with the choice of using the most
efficient interaction modality among those proposed by the system. Moreover, users
can switch to another interaction modality after a recognition error has occurred in the
previous one [2]. However, this requires that the user knows which is the best modality
for her/him, or at least “intuitively” uses the best set of multimodal inputs.
Personalization. One-solution-fits-all models are inadequate as they do not consider
individual characteristics. The interaction and the interface should be made adaptable
and personalized to user preferences and device characteristics. Users should be able to
customize the multimodal channels they want to use for a given task in an application.
Interaction patterns. There are large individual differences in users’ multimodal
interaction patterns [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]: some individuals tend to integrate different modalities in a
simultaneous manner and overlap them temporally (simultaneous integrators), whereas
others tend to finish one mode before beginning the second (sequential integrators).
Older adults demonstrate either a simultaneous or sequential dominant integration
pattern [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ], however designers should be aware of such differences and multimodal
interfaces should accommodate individual interaction patterns.
Independence. Multimodal interfaces should empower older users to independently
interact with the technology, even when there is a specific impairment (for example
hearing loss or reduced sight). Multimodal interfaces can also contribute to seniors’
perceived independence [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], if they can empower the user to function independently.
Technology reliability. Users should be able to rely on the multimodal technology,
especially in the case of assistive technology. For this reason, multimodal processing
should be accurate and robust. However, since recognition algorithms are mainly
trained on data from non-older population, this might pose limitations on the
performance (e.g., due to vocal features characteristics of older adults for speech
recognition [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], or slower gesture speed for gesture recognition).
      </p>
      <p>
        Privacy and context of use. Context of use should be carefully considered when
designing multimodal technology [
        <xref ref-type="bibr" rid="ref13 ref14">13, 14</xref>
        ]: older people have privacy and social
acceptability concerns about using some modalities in public spaces (as in the case of
speech commands or mid-air gestures [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]). However, one of the advantages of
multimodal interaction is the possibility of using one modality rather than the other
according to the specific context (e.g., gestures instead of speech commands in noisy
environments).
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Design Trade-offs</title>
      <p>When considering the recommendations mentioned above, practitioners and designers
might experience a number of design trade-offs, which are situations that involve losing
one quality or aspect of the design in return for gaining another quality or aspect. Table
1 presents a summary of design challenges and design trade-offs.</p>
      <p>
        Complexity vs Simplicity. Providing the user with the possibility to interact with more
than one modality might increase the interaction complexity. For example, it has been
showed that the oldest of older adults (80+) found some modalities or combination of
modalities too complex to use when using a multimodal tablet-based application [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
This trade-off might also affect the system usability: a technology that supports many
different modalities might increase in complexity and thus be less usable.
Personalization vs Customization. Multimodal interaction can be tailored to the
specific preferences or needs of the user. This process might end up in an
overpersonalization of the interaction, making it difficult to the user to discover or
experiment with alternative interaction modalities. There is indeed a trade-off between
personalization, where the system personalizes the interaction to the user, and
customization, where the user is in control of the customization process. The latter
allows the users to control the interaction, assuming that they know how and what
feature to control. The former gives control to the system without requiring an effort
from the user, but it heavily relies on system reliability and performance.
Independency vs Assistance. The cognitive effort required to older users for the
personalization and customization of system interaction may be avoided by allowing
other users to take care of the process. For instance, a multimodal technology could be
designed to be personalized by caregivers or therapists even though the final user is an
older adult (assistance). However, delegating actions to these users might further
increase the caregiver load and could be perceived as an additional demand or burden.
This might also decrease older adults independent use of the technology.
Automation vs Control. Multimodal sensors can be used as background controls, to
which the interface automatically adapts without any intentional and direct user
engagement. In this sense, a proactive system might come forward with suggestions, or
automatic responses, based on the sensed context and without engaging the user
(automation). On the other hand, a reactive system requires the user to initiate action
(control), which implies direct attention and control on the activity.
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>This position paper has briefly presented some of the challenges and trade-offs that
designers, ICT and HCD practitioners might encounter when designing multimodal
interfaces. Even though the analysis focuses on older adults as target user group, most
of these challenges hold also for the wider user population.</p>
      <p>
        On one side, this paper presents a reflection on how to identify the design trade-offs
for multimodal interaction for older adults. On the other side, further discussion is
needed to provide practitioners with methods and processes for dealing with such
tradeoffs. In this direction, participatory design, co-design and value-centered design
approaches [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] can help designers to balance different (and sometime competing)
design choices. We think that further discussion on this topic can emerge from the
discussion during the workshop.
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work is supported by the EU HORIZON project ECOMODE - Event-Driven
Compressive Vision for Multimodal Interaction with Mobile Devices
(http://www.ecomode-project.eu/), under Grant Agreement 644096.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Oviatt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Cohen,
          <string-name>
            <surname>P.</surname>
          </string-name>
          :
          <article-title>Perceptual User Interfaces: Multimodal Interfaces That Process What Comes Naturally</article-title>
          .
          <source>Commun. ACM</source>
          .
          <volume>43</volume>
          ,
          <fpage>45</fpage>
          -
          <lpage>53</lpage>
          (
          <year>2000</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Turk</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Multimodal interaction: A review</article-title>
          .
          <source>Pattern Recognition Letters</source>
          .
          <volume>36</volume>
          ,
          <fpage>189</fpage>
          -
          <lpage>195</lpage>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Obrenovic</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abascal</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Starcevic</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Universal Accessibility As a Multimodal Design Issue</article-title>
          .
          <source>Commun. ACM</source>
          .
          <volume>50</volume>
          ,
          <fpage>83</fpage>
          -
          <lpage>88</lpage>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Naumann</surname>
            ,
            <given-names>A.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wechsung</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hurtienne</surname>
          </string-name>
          , J.:
          <article-title>Multimodal Interaction: A Suitable Strategy for Including Older Users? Interact</article-title>
          . Comput.
          <volume>22</volume>
          ,
          <fpage>465</fpage>
          -
          <lpage>474</lpage>
          (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Wechsung</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Naumann</surname>
            ,
            <given-names>A.B.</given-names>
          </string-name>
          :
          <article-title>Evaluation Methods for Multimodal Systems: A Comparison of Standardized Usability Questionnaires</article-title>
          . In: André,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Dybkjaer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Minker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Neumann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Pieraccini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            , and
            <surname>Weber</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>(eds.) Perception in Multimodal Dialogue Systems</article-title>
          . pp.
          <fpage>276</fpage>
          -
          <lpage>284</lpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Himmelsbach</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garschall</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Egger</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steffek</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tscheligi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Enabling Accessibility Through Multimodality?: Interaction Modality Choices of Older Adults</article-title>
          .
          <source>In: Proceedings of the 14th International Conference on Mobile and Ubiquitous Multimedia</source>
          . pp.
          <fpage>195</fpage>
          -
          <lpage>199</lpage>
          . ACM, New York, NY, USA (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Munteanu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salah</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          :
          <article-title>Multimodal technologies for seniors: challenges and opportunities</article-title>
          . In: Oviatt,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Schuller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Cohen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.R.</given-names>
            ,
            <surname>Sonntag</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Potamianos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            , and
            <surname>Krüger</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . (eds.)
          <article-title>The Handbook of Multimodal-Multisensor Interfaces</article-title>
          . pp.
          <fpage>319</fpage>
          -
          <lpage>362</lpage>
          .
          <article-title>Association for Computing Machinery</article-title>
          and Morgan &amp; Claypool, New York, NY, USA (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Fisk</surname>
            ,
            <given-names>A.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rogers</surname>
            ,
            <given-names>W.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Charness</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Czaja</surname>
            ,
            <given-names>S.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sharit</surname>
          </string-name>
          , J.:
          <article-title>Designing for Older Adults: Principles and Creative Human Factors Approaches, Second Edition</article-title>
          . CRC Press (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>McGee-Lennon</surname>
            ,
            <given-names>M.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolters</surname>
            ,
            <given-names>M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brewster</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>User-centred Multimodal Reminders for Assistive Living</article-title>
          .
          <source>In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems</source>
          . pp.
          <fpage>2105</fpage>
          -
          <lpage>2114</lpage>
          . ACM, New York, NY, USA (
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Xiao</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lunsford</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Coulston</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wesson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oviatt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Modeling Multimodal Integration Patterns and Performance in Seniors: Toward Adaptive Processing of Individual Differences</article-title>
          .
          <source>In: Proceedings of the 5th International Conference on Multimodal Interfaces</source>
          . pp.
          <fpage>265</fpage>
          -
          <lpage>272</lpage>
          . ACM, New York, NY, USA (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Schiavo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mich</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferron</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mana</surname>
          </string-name>
          , N.:
          <article-title>Mobile Multimodal Interaction for Older and Younger Users: Exploring Differences and Similarities</article-title>
          .
          <source>In: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia</source>
          . pp.
          <fpage>407</fpage>
          -
          <lpage>414</lpage>
          . ACM, New York, NY, USA (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Vacher</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Portet</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rossato</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aman</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Golanski</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dugheanu</surname>
          </string-name>
          , R.:
          <article-title>Speechbased interaction in an AAL-context.</article-title>
          <issue>1-7</issue>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Neves</surname>
            ,
            <given-names>B.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Franz</surname>
            ,
            <given-names>R.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Munteanu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baecker</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ngo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          : “
          <string-name>
            <surname>My Hand Doesn'T Listen</surname>
          </string-name>
          to Me!
          <article-title>”: Adoption and Evaluation of a Communication Technology for the 'Oldest Old</article-title>
          .'
          <source>In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems</source>
          . pp.
          <fpage>1593</fpage>
          -
          <lpage>1602</lpage>
          . ACM, New York, NY, USA (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Rico</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brewster</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Gesture and Voice Prototyping for Early Evaluations of Social Acceptability in Multimodal Interfaces</article-title>
          .
          <source>In: International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction</source>
          . pp.
          <volume>16</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          :
          <fpage>9</fpage>
          . ACM, New York, NY, USA (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Fischer</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Exploring design trade-offs for quality of life in human-centered design</article-title>
          .
          <source>interactions</source>
          .
          <volume>25</volume>
          ,
          <fpage>26</fpage>
          -
          <lpage>33</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>