<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>UTAssistant: a Web Platform Supporting Usability Testing in Italian Public Administrations</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Giuseppe Desolda</string-name>
          <email>giuseppe.desolda@uniba.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rosa Lanzilotti</string-name>
          <email>rosa.lanzilotti@uniba.it</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giancarlo Gaudino</string-name>
          <email>antonello.cocco@mise.gov.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefano Federici</string-name>
          <email>stefano.federici@unipg.it</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>ACM Classification Keywords</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>H.5.2 Human-centered computing~Usability testing.</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>ISCOM-MISE</institution>
          ,
          <addr-line>Viale America 201, 00144 Roma</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Università di Bari</institution>
          ,
          <addr-line>via Orabona, 4, 70125 Bari</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Università di Bari</institution>
          ,
          <addr-line>via Orabona, 4, 70125 Bari</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Università di Perugia</institution>
          ,
          <addr-line>Piazza G. Ermini, 1, 06123 Perugia</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>18</fpage>
      <lpage>20</lpage>
      <abstract>
        <p>Even if the benefits of the usability testing are remarkable, it is scarcely adopted in the software development process. To foster its adoption, this paper presents a Web platform, UTAssistant, that supports people, also without skills in Human-Computer Interaction (HCI), in evaluating Web site usability.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        HCI literature proposes different methods for evaluating
usability of interactive systems. Among them, usability
testing is generally considered the most complete form
of evaluation, because it assesses usability through
samples of real users and permits to gather a wide
range of qualitative and quantitative data, whose
analysis allow to detect possible issues about the
system and the interface elements which cause
them [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        Although the benefits of this technique are
unquestionable, it is hardly adopted due to the required
effort and time needed to set up reliable user testing. A
frequently used technique is the thinking aloud in which
users are asked to verbalize their actions during the
interaction with the system to complete a set of critical
tasks [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. During the task execution, the evaluators can
collect quantitative data, e.g. task time/clicks, and
qualitative data, e.g. facial expressions and
externalized comments. In addition, questionnaires are
typically administered to evaluate the users’ perceived
usability. Then, all the collected data are analyzed to
evaluate the software usability with respect to the
administered tasks, thus producing statistics like the
task completion time or number of clicks to accomplish
tasks, task success rate, heat map of the user
behavior, questionnaire results and graphs, and so on.
To foster the adoption of usability testing technique, in
the past few years, there has been massive growth of
usability testing tools. For example, professional
platforms like Morae, Ovo Logger and User Testing
support HCI experts in most of the evaluation activities
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ][
        <xref ref-type="bibr" rid="ref11 ref9">9, 11</xref>
        ]. Despite their powerful features and
automatization of data gathering and analysis, their use
is limited by their price and/or the need to install them
on dedicate machines (e.g., Morae and Ovo Logger).
Simpler and cheaper tools are also available but they
typically cover part of the evaluation process, for
example the user behavior analysis obtained gathering
and elaborating the mouse movements/clicks (e.g.,
UsabilityTool) or the facial expressions and externalized
comments recording (e.g. Camtasia).
      </p>
      <p>This paper proposes a Web platform, called
UTAssistant, which assists evaluators, even without
skills in HCI, in performing usability testing.</p>
      <p>UTAssistant automatizes and simplifies the usability
testing process in all its phases, i.e. design, execution
and data analysis.</p>
    </sec>
    <sec id="sec-2">
      <title>UTAssistant</title>
      <p>
        UTAssistant is a Web platform designed and developed
within the PA++ Project, an initiative coordinated by
the High Institute of Communication and Information
Technology (ISCOM) scientific and technical body of the
Italian Ministry of Economic Development (MISE),
whose main goal is to promote the usability practices in
the Italian Public Administration (PA). In particular, the
PA++ (A Public Administration + mobile and + usable:
design and evaluation of Public Administration Web
sites) project involves a grouo of people that are
members of the GLU (Gruppo di Lavoro per l’Usabilità,
in English, Working Group on Usability). The GLU is
related to the Italian Ministry of Public Administration
and is working to support people of the PA website staff
in performing usability evaluation activities of their
websites and other e-government systems. Among the
different activities of the GLU, in May 2015, a new
version of the eGLU and eGLU-M Protocol 2.1 has been
published [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]; its aim is to guide web masters and web
editors, who do not have experience on usability and
UX evaluation, in performing usability testing of the
websites they work on, by committing limited resources
in terms of time and people.
      </p>
      <p>UTAssistant has been conceived to support remote
usability testing executed by using desktop and mobile
devices, from the test design to the analysis of the data
collected following the guidelines provided by the eGLU
2.1 and eGLU-M evaluation protocol. UTAssistant has
been entirely developed as Web platform. The idea is to
provide Italian PA with a lightweight and simple service
that does not require any installation on user devices or
special requirements. In fact, thanks to the use of the
recent evolutions of the HTML5 standard, the browser
gathers data from desktop and mobile peripherals like
webcam, microphone, mouse, keyboard, and so on.
From a technological point of view this is an important
step forward with respect to the state-of-the-art, which
can foster a wide adoption of this tool and,
consequently, of the usability testing technique.
Indeed, the existing software for usability testing,
require dedicated devices (e.g. Morae) or cover part of
the evaluation process and often do not support the
evaluation of web site used on mobile devices that
today represent the most common interaction device.
In the following sections, it is described how
UTAssistant supports evaluators in designing a usability
test and in analyzing the results, as well as how
participants are driven by UTAssistant in completing
usability tests.</p>
    </sec>
    <sec id="sec-3">
      <title>Test Design</title>
      <p>
        The first phase of a usability testing is its design. The
most important issue of this phase is the study creation
that mainly consists in defining a set of tasks users
have to complete during the test execution. After the
logging into the system, evaluators create a new study:
a wizard procedure guides them in designing the test
following three main steps. At the first step of the test
design, the evaluators have to define: a) general
information about the study (e.g., a title, a short
introduction), b) data UTAssistant has to gather while
participants carry out the tasks (e.g., mouse/keyboard
data log, audio/video from participant
webcam/microphone), and c) questionnaires
participants have to complete during/at the end of the
study. According to the eGLU protocol, UTAssistant
actually implements post-questionnaires like SUS [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ],
NPS [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], UMUX-Lite [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], USE [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Another
postquestionnaire, not mentioned by the eGLU protocols
but also implemented in UTAssinstant are: AttrakDiff,
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and the NASA-TLX questionnaire [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. After defining
the study general information, at the second step
evaluators define the task list. For each task, they have
to provide the title, the starting/ending URL, the goal,
the duration. Finally, the third step requires evaluators
to specify the study participants, by selecting them
from a list of users already registered in the platform or
by typing their email. Then the study has to be saved.
The invited participants receive an email describing all
the instructions to participate in the usability testing.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Test Execution</title>
      <p>A usability testing execution is typically managed by
one or more HCI experts, which assist participants in
carrying out the study tasks, gather data (e.g.,
externalized comments, task times/click) and
administer the questionnaires. UTAssistant aims to
automatize all the execution steps guiding participants
even without the physical presence of evaluators.
Once the participant receives an email, by clicking the
URL reported inside it, UTAssistant is open in a browser
on the desktop or mobile device. First, UTAssistant
shows general indications about the platform (e.g., a
short description of the toolbar with all the useful
commands) as well as the privacy policies indicating
that data like mouse/keyboard log data and
webcam/microphone stream could be captured. Then,
UTAssistant administers the tasks one at a time. For
each task, the platform shows the goal in a pop-up
window. A minimal toolbar is positioned on top of the
task web page reporting the current task goal and
description, the duration time, the task number, and
the buttons to go to the next task, or stop the study. If
the user is executing the last task, the “Next task”
button becomes “Fill in the questionnaire”. All the
steps, are completely assisted by UTAssistant that
simulates the presence of the HCI expert.</p>
    </sec>
    <sec id="sec-5">
      <title>Test Data Analysis</title>
      <p>One of the most time-consuming phase of a usability
testing is the data analysis. HCI experts have to store,
merge and analyze the mouse logs, the video/audio
recordings, the questionnaire results, and so on.
UTAssistant assists evaluators in performing these
activities, thus making easier and lighter the entire
analysis. UTAssistant provides different types of tools
for the automatic data analysis and to produce
information that facilitate the evaluators’ work in
detecting usability issues.</p>
      <p>
        One of the most important usability metric is the task
success rate, i.e. the percentage of tasks that users
complete correctly during the study [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. UTAssistant
creates a report showing the general success rate
(“Tasso di successo medio complessivo” in Italian as
shown at the bottom of Figure 1) and a table reporting
the details for each task (column)/user (row).
In addition, for each task UTAssistant shows: its title,
the number of pages the user visited to carry out it, the
number of notes evaluator made during the analysis,
the date and time in which the task starts and finishes,
the average time and the average number of clicks all
the users needed to complete the task.
      </p>
      <p>A different analysis of the mouse/keyboard logs is
provided by the heatmaps, i.e., graphical visualizations
overlapped to a web page that show where people
click/touch, move the mouse, scroll. It uses hotter
colors, e.g. orange or bright red, on areas that get the
most clicks/movements and colder colors, e.g. blues
and purples, on areas that get the less
clicks/movements. For each web page, UTAssistant
builds a heatmap that evaluators can analyze to
understand if some area of the web page wrongly
attracted the users’ attention due to some user
interface elements. Figure 2 shows an example of
mouse heatmap.</p>
      <p>UTAssistant automatically analyzes questionnaire data
and provides proper statistics and graphs to summarize
the results and to help evaluators in understanding the
perceived usability, cognitive load and so on.</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>This paper has presented a web-based platform, called
UTAssistant, developed in the project PA++, to support
people working in PA website staff in performing
usability tests. A preliminary evaluation of the
UTAssistant, involving 2 people acting as evaluators
and 6 people as participants, showed that evaluators
appreciated a lot the support to the evaluation
activities, while participants enjoyed the possibility to
participate in a user test, without being observed.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>This work is supported by ISCOM under grant PA++ (A
Public Administration + mobile and + usable: design
and evaluation of Public Administration Web sites).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. eGLU and
          <string-name>
            <surname>eGLU-M Protocol</surname>
          </string-name>
          <article-title>2</article-title>
          .1. Retrieved from http://www.funzionepubblica.gov.it/glu.aspx Last Access June 4th,
          <year>2017</year>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Morae</surname>
          </string-name>
          . Retrieved from https://www.techsmith.com/morae.html Last Access June 10th,
          <year>2017</year>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Hart</surname>
            <given-names>S. G.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Staveland</surname>
            <given-names>L. E.</given-names>
          </string-name>
          (
          <year>1988</year>
          ).
          <article-title>Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research</article-title>
          .
          <source>Advances in psychology 52(1)</source>
          ,
          <fpage>139</fpage>
          -
          <lpage>83</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Hassenzahl</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burmester</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>and Koller F.</surname>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>AttrakDiff: A questionnaire to measure perceived hedonic and pragmatic quality</article-title>
          .
          <source>In Proc. of the Mensch &amp; Computer</source>
          . 187-
          <fpage>96</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lewis</surname>
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Sauro</surname>
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>The Factor Structure of the System Usability Scale</article-title>
          .
          <source>In Human Centered Design - HCD</source>
          <year>2009</year>
          , Kurosu M.
          <source>Ed. Lecture Notes in Computer Science</source>
          , Vol.
          <volume>5619</volume>
          , Springer Berlin Heidelberg,
          <fpage>94</fpage>
          -
          <lpage>103</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Lewis</surname>
            <given-names>J. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Utesch</surname>
            <given-names>B. S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Maher</surname>
            <given-names>D. E.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>UMUX-LITE: when there's no time for the SUS</article-title>
          .
          <source>In Proc. of the Conference on Human Factors in Computing Systems (CHI '13)</source>
          . Paris, France, ACM, New York, NY, USA,
          <year>2099</year>
          -
          <fpage>102</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Preece</surname>
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sharp</surname>
            <given-names>H.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Rogers</surname>
            <given-names>Y.</given-names>
          </string-name>
          <year>2015</year>
          .
          <article-title>Interaction Design-beyond human-computer interaction</article-title>
          . John Wiley &amp; Sons.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Reichheld</surname>
            <given-names>F. F.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>The one number you need to grow</article-title>
          .
          <source>Harvard business review</source>
          <volume>81</volume>
          (
          <issue>12</issue>
          ),
          <fpage>46</fpage>
          -
          <lpage>55</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Ovo</given-names>
            <surname>Logger</surname>
          </string-name>
          . Retrieved from http://www.ovostudios.com/ Last Access June 10th,
          <year>2017</year>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Terenzi</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Di</surname>
            Nocera F.,
            <given-names>and Ferlazzo F.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Firmitas, Utilitas, Venustas: Assessing the Validity of the Usability Evaluation (Us</article-title>
          . E.)
          <string-name>
            <surname>Questionnaire</surname>
          </string-name>
          .
          <article-title>Developments in Human Factors in Transportation,</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. UserTesting. Retrieved from https://www.usertesting.com/ Last Access June 10th,
          <year>2017</year>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>