=Paper=
{{Paper
|id=Vol-2450/paper6
|storemode=property
|title=Investigating Mechanisms for User Integration in the Activity Goal Recommendation Process by Interface Design
|pdfUrl=https://ceur-ws.org/Vol-2450/paper6.pdf
|volume=Vol-2450
|authors=Katja Herrmanny,Simone Löppenberg,Michael Schwarz
|dblpUrl=https://dblp.org/rec/conf/recsys/HerrmannyL019
}}
==Investigating Mechanisms for User Integration in the Activity Goal Recommendation Process by Interface Design==
Investigating Mechanisms for User Integration in the Activity
Goal Recommendation Process by Interface Design
Katja Herrmanny Simone Löppenberg Michael Schwarz
katja.herrmanny@uni-due.de michael.schwarz@uni-due.de
University of Duisburg-Essen University of Duisburg-Essen University of Duisburg-Essen
Duisburg, Germany Duisburg, Germany Duisburg, Germany
ABSTRACT interface alternatives and an evaluation regarding their potential
In the field of physical activity recommendation, we have to deal to integrate the user in the process in the way described above. We
with many confounding variables that lead to high result uncer- further investigated understanding, usability, and aesthetics which
tainty. Assuming that users’ competence is an essential factor for re- are relevant factors for user engagement.
duction of the problem of inaccurate recommendations, we present
and evaluate an approach on how to integrate users in the recom- 2 RELATED WORK
mendation process. We investigate if and how interface element Recommender systems intend to support users in the decision mak-
design can contribute to understanding, reflection and modification ing process based on their preferences and needs [5]. There has
of the recommendation result. In the work described here, we use been a lot of research focusing on the prediction accuracy [5]. How-
interface elements that allow for planning of physical activity goal ever, recently it has been shown that accuracy of the algorithm
striving. Results show that such interface elements can principally influences the user experience only partially [13] and that the key
empower users, support recommendation reflection and stimulate to success are the functions provided by the user interface of rec-
user interaction with the recommendation. ommender systems [10]. User interface design and dialogs affect
usability, acceptance, item rating behaviour, selection behaviour,
KEYWORDS trust, and willingness to buy and reuse the system [5]. In order to
user integration, recommendation, goal setting, user empowerment, improve recommender systems and user satisfaction, it is beneficial
user interface, activity tracking to provide users with the opportunity to interact with recommenda-
tions and to make adjustments if needed [12]. However, it is often
1 INTRODUCTION AND SCOPE OF THE not possible to provide feedback to the system, which is impor-
PAPER tant to adapt its assumptions about the user [12]. In the context
of rating based recommender systems (e.g. movies, music), it has
Besides algorithm accuracy, design of user interfaces is an impor- also been shown that interactive recommender systems are advan-
tant component of recommender systems, that has gained more tageous since they can factor in changed user interests over time
and more interest in the last years [10, 13, 21]. Especially health- or corrections to previously made mistaken ratings [11]. Yet most
related personalised recommendations have to deal with many recommender systems consider user ratings as always correct [11].
confounding variables, that are unknown to the algorithm or not The authors [11] therefore suggest to support user interaction with
quantifiable and thus difficult or even impossible to be considered in the recommendation by allowing an adjustment of previous rat-
the recommender’s reasoning [8]. Due to this and other aspects like ings. This would provide explicit feedback to the system, instead
autonomy issues, integrating the user in the process is an essential of implicit feedback, which is typically done by monitoring users’
part of health-related recommender systems [8]. However, in this behavior [3].
field little research has been done to investigate how the user could In some contexts, giving explicit user feedback requires an un-
be integrated. In this paper, we investigate how to design a user derstanding of the recommendation. This can be supported by
interface to integrate the user in the recommendation process of explanation. Explanation interfaces are used in different fields -
physical activity goals by: such as expert systems, medical decision support systems, intelli-
• empowering users to understand the recommendation and gent tutoring systems, data exploration systems, and recommender
its implications, systems [18]. By explaining the recommendation result, they aim
• reflecting and evaluating the recommendation, and at providing transparency and, in consequence, trust and user ac-
• providing the opportunity to actively manipulate it. ceptance [18]. Such explanation is also termed user empowerment.
In our approach, empowerment and reflection are mainly achieved Empowerment can also be found in the health sector, especially
by the planning of recommendation realisation, which helps the under the concept of patient empowerment [1, 7, 14]. In this do-
user to assess whether the recommended goal is realistic or not. It main, empowerment includes knowledge transfer and persuasion.
also supports the user in appropriate modifications. We present two Following Kondylakis et al. [14], patient empowerment is achieved
through the accessibility of information (e. g. the opportunity to get
Copyright ©2019 for this paper by its authors. Use permitted under Creative Commons information on the internet). This is in line with Alpay et al. [1] who
License Attribution 4.0 International (CC BY 4.0).
state that the term empowerment “is frequently used to describe a
IntRS ’19: Joint Workshop on Interfaces and Human Decision Making for Recommender
Systems, 19 Sept 2019, Copenhagen, DK situation where patients are encouraged to be active in their own
health management”. Regarding the effect of empowerment, it has
IntRS Workshop, September 2019, Copenhagen, DK Herrmanny, Löppenberg, and Schwarz
been shown that empowerment is provided by e-health applica- context data of the specific user. As described above, recommenda-
tions and can positively influence patients’ long-term health status tions in this field are very error-prone because of large variations
[7]. Specifically for goal recommendations in activity tracking ap- over time and a large number of confounding variables indeter-
plications, it has been shown that users wish to get explaining or minable for the algorithm. To meet this challenge we propose to
illustrating information [9]. One approach to providing such infor- integrate the user in the recommendation process. Therefore, it is
mation was to illustrate the impact of the activity goal by reference necessary (a) to empower users to understand the recommenda-
routes [9]. Doing so, users should get a better sense of the amount tion and its implications, (b) to reflect and evaluate it and (c) to
of activity of the selected goal. In summary, in the health sector, provide adequate opportunities to actively manipulate it. To reach
empowerment includes knowledge transfer and persuasion. [19] dis- this aim, we used two main strategies. The first one is transparency.
tinguish between empowerment and persuasion. They use the term By showing the uncertainty of the algorithm, we want to make
empowerment to describe explanation interfaces in health recom- the user aware of the necessity of his/her influence. The second
mender systems. In a literature review they found that "empowering strategy is implementation planning to illustrate the impact of the
the user by interactively guiding his decision, and creating trust, recommendation. The specific (exemplary) user interface elements
using [...] explanatory interfaces" are relevant concepts in health we designed, are described in the following.
recommender research. Following Schäfer et al. [19], in health rec- Firstly, we presented the recommendation - which is a numerical
ommender systems, empowerment is achieved through explanation value on a continuous scale, in contrast to discrete items like in
of the internal logic of the recommender system, which they claim conventional recommender systems - as a range on a modifiable
to be one of the key challenges for health-related recommender slider. The recommendation with the highest probability to be the
systems. Our understanding of empowerment goes even beyond. best fitting one, is used as default value. We further added a colour
Beside explanation of the recommender’s reasoning, empowerment gradient to the slider indicating uncertainty regarding the system’s
as we define it also includes transferring domain knowledge and recommendation (for different values of the range). Presenting a
explaining of the recommendation’s implications. range instead of a single value and indicating uncertainty of the
algorithm should support users in recommendation interpretation
3 CONTRIBUTION by understanding that the recommendation is a non-exact one that
needs to be reflected and probably to be adjusted. The slider is
To overcome the shortcomings of algorithmic recommendations,
intended to encourage and enable users to adjust it.
we (as well as other researchers mentioned in the previous section)
Kilocalories (kcal) were used to present the recommendation,
propose to integrate the user in the recommendation process to
which is, besides the number of steps, one of the most common units
achieve better results.
for measuring physical activity. The advantage of that approach
In contrast to previous approaches, in this work we don’t focus
is that all kind of physical activity can be subsumed in this unit.
on textual explaining elements. Instead, we investigate, if and how
However, it is problematic that the recommendation may be very
interface design can be used to integrate users in the recommenda-
abstract for the user, especially as kcal are more common in the
tion process.
field of nutrition. So secondly, we converted the recommendation
Our idea is to
to a unit that is more intuitive and better to interpret by the user.
• support user empowerment by interface elements that ex- Therefore, it was converted to the time needed to be active in three
plain the impact of the recommendation, different intensity levels (low, moderate, high) in order to achieve
• support recommendation reflection by interface elements the weekly goal. The reasons for dividing the goal into different
that indicate inaccuracy as well es elements that explain the activity levels are explained in section 5. The relation between the
impact of the recommendation and , three activity levels could be modified by the user (which leads to
• support user engagement by interface elements that allow for more time needed to achieve the goal if the user reduces the more
manipulation of the recommendation and its implementation intensive activity and increases the low-intensive one and vice
planning. versa). By giving the user a sense of the amount of time needed for
goal striving and thus for the difficulty of the goal, the user should be
In order to investigate the general potential of our idea to foster
empowered to understand the implications of the recommendation
user integration through interface design, we designed exemplary
and be encouraged to reflect it.
interface elements, which are described in detail in the following
In addition to this, we thirdly allowed for a more detailed plan-
section. This work does not focus on the evaluation of the interface
ning to give an even better sense of how difficult it is to achieve the
elements itself or the app they are framed in. They are just used as
chosen goal in daily life. The interface therefore provides the user
tools to investigate our research questions.
with the opportunity to plan, how to distribute the required time
In summary, this paper contributes by investigating the potential
to different activity units. This also aims at fostering reflection of
of interface design to support user integration in the recommenda-
the goal and, as a consequence, encouraging users to modify it.
tion process for the purpose of improving the recommendation.
4 APPROACH 5 MOCK-UPS
The scenario of our work is a physical activity tracking app, provid- We designed mock-ups with three main interface elements which
ing recommendations for an appropriate (i.e. challenging, but not we implemented for an android application. The targeted appli-
overburdening), numerical weekly activity goal based on user and cation should be an activity tracking app, which recommends a
IntRS Workshop, September 2019, Copenhagen, DK Herrmanny, Löppenberg, and Schwarz
physical activity goal considering user and context parameters. The regarding intensity level. Below there is a table in which
goal unit should be consumption of kilocalories per week. The three each row represents an activity unit (for each intensity level)
interface elements are: and each column represents 10 minutes. By tapping on the
buttons (+ -) on the left and on the right, the duration of the
(1) Goal selection element (see figure 1 (a)): The first element activity unit can be modified, whereby the respective cell is
is a slider. The slider range represents the recommendation coloured. By using the buttons (+ -) below the table, a new
range and the default slider position represents the most rec- activity unit (row) can be added.
ommended value, i.e. the value with the highest probability
to be the most suitable one. For the study, the goal is initially
equipped with a default value of 1400 kcal. As this study
focuses on the interface and not the algorithm, this recom- Table 1: Intensity categories with corresponding MET values
mendation is the same for all participants to exclude this
as a confounding variable. (The recommendation algorithm Intensity MET Example Activity
is not part of this paper and thus described elsewhere.) A
colour gradient at the slider range indicates the probabilities low 3 slow walking, walking downstairs, golf
resulting from the recommendation algorithm. The user has moderate 5 walking, weight lifting, dancing
the opportunity to modify the goal within the given range. high 7 running, playing soccer
(2) Conversion into time element (see figure 1 (b)): The sec-
ond element converts the kcal goal into amount of time,
which is less abstract. As time needed to consume a certain The separation of the activity into low, moderate and high inten-
amount of kilocalories strongly depends on the intensity of sity is done in order to convert the goal into time and thus better
the performed activity, it is necessary to select which ratio estimate the calorie consumption and check if the selected goal is
of different activity levels will be performed. Otherwise the realistic.
conversion into time would be far too inexact and thus not The distribution into activity units should help the users to get a
helpful. To get an approximation without cognitively over- feeling for their own activity behaviour and to integrate activities
burdening the user, we provided three intensity categories into their everyday life through precise planning. This also serves as
used in the health sector, which can be seen in table 1. For a supporting element to check whether the selected goal is realistic
calculation we used an average MET value of each category, or not.
which is also presented in table 1. MET (= metabolic equiva- The three elements of the page (weekly goal, conversion into
lent of task) is a unit to indicate the intensity of an activity time, activity units) are interdependent. When the weekly goal
and can be converted to calories and vice versa. Depending is increased or decreased, the amount of time to be distributed
on the activity category, more or less active time is needed among the intensity levels also increased or decreased. This has
to achieve the goal. Different exhausting activities or sports an immediate effect on the minutes of activity units to be planned,
fall into different categories (see table 1). We enable users to which are adapted to the intensity distribution. The application can
specify, which amount of each intensity level they plan to provide helping information for each area if required.
do to achieve the goal. This is done by an adjustable circular Especially for reasons of reflection, we assumed that it might be
seekbar (circle), which allows for modification of the inten- helpful to show all interface elements on one single page. When
sity types portions within the circle. The default state in our users modify their goal, they immediately see the corresponding im-
study is 1/3 for each part of the circle, i.e. for each intensity plications on the other planning steps, and thus can better evaluate
level. Within the circle, the selected goal is shown and how which amount of adjustment is adequate. On the other hand, space
much of the goal has already been distributed among the is limited on smart-phone displays. Layouts are quickly overloaded
intensities. If the circle is filled, all kilocalories of the goal are which could result in information overload for the users. This is
planned. Below the circle, the derived duration in minutes presumably not helpful in the reflection process. Thus, we decided
of each intensity level is shown. Three different icons repre- to design two interfaces with mainly the same elements. The first
sent the different intensity levels (walking for low intensity; one is scrollable and presents all described interface elements on
cycling for moderate intensity; running for high intensity). a single page. The second one presents them on successive pages
(3) Activity unit planning element (see figure 1 (c)): The third with the opportunity to navigate between the pages.
interface element consists of input elements for planning
units of activity for each intensity level. For example, if the
user has chosen to do 60 minutes of high-intense activity 5.1 Interface 1: Single Page
(in interface element 2), those could be distributed to one Figure 1 shows the single page interface. At the top of the page,
exercise or activity unit of 40 minutes and one of 20 minutes. the goal selection slider is presented. The conversion into time
10 minutes is the minimum duration. By interacting with is presented below the goal selection element with an adjustable
the circular seekbar or by tapping on the icons below, it is circle as described above. Below the circle, the resulting duration in
possible to switch between the activity units of low, moderate minutes of each intensity unit is shown. The third interface element,
and high intensity. Below the heading it is presented how which is the planning of specific activity units, is presented on the
much time remains to be distributed to activity units for the bottom of the page.
IntRS Workshop, September 2019, Copenhagen, DK Herrmanny, Löppenberg, and Schwarz
Figure 2: Multiple Pages Interface (translated from German)
We evaluated both interfaces with respect to our research ques-
Figure 1: Single Page Interface (translated from German) tions in a laboratory setting with a mixed-method design. A between-
with (a) goal selection, (b) conversion into time, (c) activity subject design was chosen, in which the participants were randomly
unit planning assigned to one of two groups. Depending on the group, one of
the two interfaces was presented to the participants and they were
given the following tasks:
5.2 Interface 2: Multiple Pages First, we asked the participants to imagine to use an activity
In addition to the single page interface a second one was designed tracking app and presented them the interface in order to explore
and implemented. This interface presents the three areas on succes- it. They were asked to select an activity goal for the next week
sive pages (Figure 2). The first page contains the weekly goal with based on the (simulated) system recommendation. Meanwhile, they
the slider for goal adjustment. The heading of this section has been should describe their thoughts and impressions with the think
changed and formulated as a request (“Choose your goal for the aloud method [6], which was recorded. Interaction behaviour was
next week”). In addition, a visualisation below the slider converts observed and documented. Afterwards, an online questionnaire
the calorie goal to minutes of activity for each intensity level. Above was presented to the participants to collect quantitative data re-
the button “Intensity Distribution”, which announces the next page, garding the perception of the interface. In the online questionnaire
there is also a short explanation for the next page’s content and why we assessed usability, user experience and aesthetics. Therefore,
it is needed. This page’s content is identical to the content of the the User-Experience questionnaire (UEQ) [15], the System Usabil-
intensity distribution area in the single page interface. The circle ity Scale (SUS) [4] , the Visual Aesthetics of Websites Inventory
can be used to increase and decrease the intensities. The kcal unit (VisAWI) [20] and the After-Scenario Questionnaire (ASQ) [16]
is converted into a temporal unit accordingly. Here, too, the next were used. Finally, a semi-structured interview was conducted to
step of the display is briefly explained before the button “Activity get the participants’ opinion about the interface and its supportive
Units”, which leads to the following page. The last page contains the potential. The study took approximately 30 - 45 minutes.
input elements for the planning of specific activity units. Due to the
increased amount of space compared to the single page interface, 7 RESULTS AND IMPLICATIONS
all intensity types can be listed at the same time, and don’t have to In total, 27 persons (group 1: n=13; group 2: n=14) participated in
be switched. Besides that, the interaction opportunities with the the study. They were aged 19 to 67 years (M=30.32, SD=17.19). 23 of
elements remain the same. them stated that they were interested in apps for physical activity
support and 12 already had experience in using them.
6 EVALUATION
The main aim of the evaluation was to investigate if the above 7.1 Behaviour Observation and Think Aloud
described mechanisms and interface elements help to: We used behaviour observation and the think aloud method to
• empower users to understand the recommendation and its objectively assess empowerment, reflection and appropriateness
implications, of manipulation elements. Therefore, interaction behaviour and
• reflect and evaluate the recommended goal, and the comments made during the interaction were documented, tran-
• provide the opportunity to actively manipulate it. scribed and analysed.
Moreover, we investigated if these supportive functions depend The think aloud results indicate empowerment and reflection
on the apps presentation mode (single or multiple pages). Further processes. When initially interacting with the goal selection ele-
aspects of interest concerned user experience, usability, and aesthet- ment, as expected, some participants (n = 9) mentioned problems
ics. All of these aspects can influence whether the user is willing to in estimating, which number of kcal would be an adequate goal.
interact with the interface or not. For example, one participant said:
IntRS Workshop, September 2019, Copenhagen, DK Herrmanny, Löppenberg, and Schwarz
"I have no idea. 1000 kcal sounds good." (participant participants (n = 19) seemed to understand the intensity distribu-
22; multiple pages interface) tion without help of the supervisor, eight asked for help. Regarding
Later, when seeing the conversion into a temporal unit, she the activity units (interface element 3), 15 participants asked for
commented: help. After hinting at the help texts integrated in the interface, help
"It’s quite convenient, that it is given how much [time] was provided, if still necessary. The goal selection element was
that would be." (participant 22; multiple pages inter- explained to two participants, the conversion into time with the
face) intensity distribution to seven participants and the activity units to
14 participants. Despite problems with usability and understanding,
Another participant first commented:
the elements were commended in their function and meaning.
"No idea. I don’t know what’s average." (participant
24; multiple pages interface)
7.2 Interview
Later on when interacting with the activity units, she reflected:
For the interviews, we used the qualitative content analysis [17].
"I guess I could have chosen more in total. As it is, I The interviews were transcribed, afterwards the statements were
have to do very little per day." (participant 24; multiple clustered and analysed. Due to recording problems, one data set
pages interface) is missing, so that the interview section is based on 26 data sets.
Transferring kcal into a temporal unit (intensity distribution In the following, the results are presented and enriched by a few
element), helped to understand the recommended goal and reflect exemplary translated statements. Since the two interface versions
whether it is realistic or not. The aspect of interpretation of the do primarily differ in the distribution of the elements on different
kcal unit and the supportive potential of the conversion into time pages, the results are mainly presented for both interfaces simulta-
is also explicitly addressed in the interview (see below). neously.
Also the order of interaction with the interface elements was
analysed to objectively assess reflection. A non-linear order of in- 7.2.1 Overall Satisfaction. The overall evaluation of the partici-
teraction and revision of the choices previously made indicates that pants was positive for both interfaces. In general, the interfaces
following elements fostered reflection of the selected goal planning. were rated positively.
A linear order does not provide information about whether the "It’s quiet good. I usually use a pedometer. There it
recommendation was reflected (but intendedly not modified) or [goal planning] is just implicit. Here I can exactly plan
not. The observed order of interaction with the interface elements how much I can make a day, [...] do I just want low
was the same with both interfaces. The majority of the participants intensity or modify and increase it." (participant 7;
completed the task in linear order (n=21). Seven persons (single single page interface)
page: n = 4; multiple pages: n = 3) operated with the elements in
non-linear order and made adjustments to elements already used 7.2.2 Understanding and Supportive Potential. As an important fac-
during processing. The order of interaction with the elements 1 tor for the supportive potential of the interface elements, we asked
(goal selection element), 2 (conversion into time element) and 3 participants if they understood the dependencies of the three inter-
(activity unit planning element) can be seen in table 2. face elements. In order to objectively evaluate the understanding,
we further asked them to explain these dependencies. In most cases
Table 2: Order of Interaction with Interface Elements (n = 25), the dependencies between weekly goal, conversion into
time and activity units were explained correctly. However, some
Participant Interface Interaction Order participants reported initial comprehension problems.
04 multiple 1→2→3→1→2→3→2→3 "At the beginning you are thrown in a bit, but that
07 single 1→2→3→2→3→2→3 is actually okay, because it is so clearly structured."
08 multiple 1→2→3→2→1→2→3 (participant 13; single page interface)
11 single 1→2→3→2→3 Although we did not explicitly ask for it, the main idea that the
12 multiple 1→2→3→2→3 goal could be adjusted was mentioned positively.
24 multiple 1→2→3→2→3 Regarding the first element, we asked the participants if they
25 single 1→2→1→2→3 found the unit kilocalories meaningful. As stated above, we ex-
pected that kilocalories might be too abstract for the users. But,
To evaluate if the provided opportunities to manipulate the rec- since it is the common unit for activity tracking, it was known by
ommendation are appropriate, we analysed correctness of handling all participants. However, eight of them (single page interface: n =
as well as reported and observed difficulties: During the interac- 5; multiple pages interface: n = 3) reported problems in interpreting
tion with both interfaces, the first interface element (goal selection it as they were not used to the unit. For instance, they asked for
through slider) was clear for most participants. Two participants reference values that would have been helpful. Although there were
had general questions of understanding and stated that it could not also many participants who did not report problems with the kilo-
be set precisely. The time unit was clear, also. The only interpre- calories unit, all except one participant agreed that the conversion
tation problem that occurred, was that one participant asked for a into time (interface element 2) helped to interpret and estimate the
conversion from minutes into hours. Some participants were con- goal. The separation of activities into different intensities was not
fused by the different activity levels. However, the majority of the easy to understand for 6 persons. Others liked this aspect:
IntRS Workshop, September 2019, Copenhagen, DK Herrmanny, Löppenberg, and Schwarz
"I liked that you could divide it into intensive, moder- for physical activity support and 50% also had experience in using
ate and low activity, because you could split it up even them.
if you were not an athletic kind of person. You can
see that you can achieve a lot even by just climbing Table 3: Descriptive Results of the Single Page Interface
stairs or going for a walk." (participant 20; multiple
pages interface) Single Page Multi Page
M SD M SD
With regard to the third interface element - the planning of
UEQ attractiveness 1.58 .66 1.80 .60
activity units - most participants (n = 20) found it helpful. For
UEQ perspicuity 1.91 .94 2.08 .49
example one participant said:
UEQ efficiency 1.92 .86 1.36 .56
"[...] It allows a bit of back and forth planning." (par- UEQ dependability 1.34 .71 1.52 .75
ticipant 11; single page interface) UEQ stimulation 1.11 .62 1.30 .44
Three of them even wished to have further functionalities: One UEQ novelty 1.11 .73 1.17 .59
participant wished to assign the selected units to concrete activ-
VisAWI 5.85 .38 5.78 .63
ities, two would like to assign them to specific days. A calender
VisAWI simplicity 6.00 .51 5.69 .63
function was also desired by further participants when directly
VisAWI diversity 5.07 .71 5.45 .77
asked for additional functionality in another interview question.
VisAWI colourfulness 6.41 .45 6.05 .99
Three of the participants who found the planning of activity units
VisAWI craftsmanship 6.09 .38 6.05 .66
helpful, reported (initial) problems in understanding. Four partic-
ipants (tendentially) did not find the activity units helpful. Three ASQ 3.00 1.38 2.45 1.66
of them reported problems in understanding. Two participants did SUS 77.27 13.80 80.69 10.43
not make a precise statement whether the unit planning element
was supportive for them or not.
7.3.1 UEQ. Both interfaces have a good UEQ score (see table 3). A
7.2.3 Usability. Despite positive usability ratings (see section 7.3) value higher than .80 is an indicator for a positive rating, while a
some usability problems that occurred when operating with the value lower than -.80 is an indicator for a negative rating. Results
interface elements, could be identified in the interview. In general, of the multiple pages interface are descriptively higher in all sub
difficulties in operation were often (n = 13) mentioned. scales, except for the result of efficiency. We tested for significance
"What bothered me was that sometimes it was hard with a t-test for the UEQ perspicuity scale and - due to missing
to select the things". (participant 11; single page inter- prerequisites (normal distribution or variance homogeneity) - a
face) Mann-Whitney-U-test for the remaining subscales. There were no
Although there were many positive responses to the circular seek- significant results.
bar that was used for intensity distribution, four participants had
7.3.2 VisAWI. For the VisAWI there are no reference values for
operating problems when moving the pointers. Regarding the plan-
interpretation given by the authors. They are stating, that lower
ning of activity units (third interface element), it was still desired
values imply a negative rating and higher values a positive one. In
that the addition and removal of activity units should not only be
this case, ratings from 1 to 3.5 are interpreted negative and ratings
possible by using the buttons, but also by touching the respective
from 3.5 to 7 positive, since the scale is from 1 to 7. In general, the
units.
single page interface (M=5.85, SD=.38) as well as the multiple page
7.2.4 Aesthetics. Also the aesthetics were rated predominantly interface (M=5.78, SD=.63) were rated positive (table 3) and do not
positive for both interfaces. The colour and graphical presentation differ significantly from each other (Mann-Whitney-U-Test: U =
of the page was particularly rated as outstanding. 57.700, p = .847). The same applies for the subscales.
"The colours fit well together. It is not obtrusive or 7.3.3 ASQ. For the ASQ questionnaire, there were also no refer-
boring". (participant 19; single page interface) ence values given for interpretation. The scale ranges from 1 (very
However, other participants found the colour design to be too positive) to 7 (very negative). As before, for interpretation we split
uniform (n = 3). Especially the circular seekbar was criticised for the scale. Values from 1 to 3.5 are interpreted positively and values
being too similar or boring. from 3.5 to 7 negatively. As presented in table 3, the single page
"The colours were too similar that you had to look interface has an average of 3.0 (SD=1.38) and the multiple pages
exactly what you had just selected." (participant 12; interface of 2.45 (SD=1.66). Both values are therefore evaluated
multiple pages interface) positive, whereby the multiple pages interface performed better.
However, the differences are not significant (Mann-Whitney-U-Test:
7.3 Online Questionnaire U = 41.000, p = .217).
With an online questionnaire we assessed user experience, aes- 7.3.4 SUS. The SUS score can have a range from 0 to 100. Accord-
thetics and usability. Due to extreme values, identified by box plot ing to Bangor et al. [2] the single page interface was rated “good”
diagrams, some participants were excluded from the analysis of and the multiple pages interface “excellent” (see table 3). Therefore,
the questionnaires. 22 participants (group 1: n=11; group 2: n=11) the multiple pages interface had a better result, what however did
remained. 91% of them stated that they were interested in apps not appear significant (Mann-Whitney-U-Test: U = 54.500, p=.699).
IntRS Workshop, September 2019, Copenhagen, DK Herrmanny, Löppenberg, and Schwarz
8 DISCUSSION Some understanding problems occurred. As explanation was
Three main interface elements were designed and evaluated in two given during the study, this should not confound the results. Prob-
slightly different interfaces. The three elements are 1) an adjustable ably, these problems would partially be solved through extended,
slider showing a recommendation range together with an indication unobserved exploration in real-world settings. As already suggested
of uncertainty, 2) conversion into time depending on activity level, above, functionality and exploration of the interface could be en-
and 3) activity unit planning. They were designed to empower hanced week by week. Additionally, we addressed understanding
users to understand the recommendation and its implications, to problems in the re-design (see section 9) as far as possible.
reflect and evaluate it, and to provide the opportunity to actively The questionnaires UEQ and VisAWI addressed the aesthetics
manipulate it. In the following, we discuss if and how the interface of the interfaces and showed positive ratings. Regarding usability,
elements reached those aims. ASQ ratings were positive in tendency and very positive values
were achieved in the SUS. In the interview, the colours were also pre-
8.1 Comparison between the Interfaces dominantly commended, whereas a few people felt that the colour
design was too uniform. However, the interview and interaction
Results do not show any significant differences between the two showed that some participants had problems with the operation, e.g.
interfaces in interaction behaviour, feedback regarding the per- with the exact setting of the goal selection element or with the selec-
ceived support of the interface elements, usability, user experience tion of elements, which can be corrected by technical adjustments.
or aesthetics. Returning to previous interface elements to make In total, overall ratings for usability, user experience and aesthetics
adjustments, was assumed to be more difficult for navigation on are good. This is an important precondition for whether the user is
different pages in the multiple pages interface. However, since the willing to interact with the interface or not. This precondition can
order of interaction does not vary between the interfaces, both be seen as fulfilled for the study and the interfaces.
arrangements of elements (single page interface: on one page, mul-
tiple pages interface: on more consecutive pages) seem possible.
Consequently, whether those interrelated elements are arranged on 8.3 Interface Element 1: Goal Selection
single or on multiple pages doesn’t seem to be an influencing factor All participants made use of the slider, which indicates that a modi-
for recommendation reflection and modification. As all results are fiable value within a range seems more appropriate than one single
independent from presentation mode, we will subsequently discuss recommended value. Think aloud comments show, that they re-
them without separation between the two interface variants. flected, what would be an appropriate goal. However, as expected
and in line with the interview results, the limited range and the
8.2 Overall Interface default value alone do not seem to sufficiently support users in
evaluating, what an appropriate value would be. Additional em-
All participants interacted with all interface elements. This indi-
powerment is needed here (see 8.4). One participant suggested a
cates that they were perceived as helpful and have the potential to
reference value to better interpret the recommended goal. This is
integrate users in the process. As a limitation, it can not surely be
surprising as we intended the goal range and the default value (and
said, which amount of interaction was fostered by the study situa-
the colour gradient) to serve as such a reference. At least this one
tion and which by the interface design itself. However, participants
participant does not seem to interpret it in the intended way. As
were not explicitly told to interact with each interface element. On
participants did not seem to pay attention to the colour gradient
the one hand, the study situation could have fostered interaction
indicating the certainty of the result, this could not help in terms
with the elements. On the other hand, it is likely, that users of an
of empowerment and reflection, which contradicts former research
activity tracking app in a real-world setting (compared to a study
[8]. One reason might be that the interface contained more ele-
setting) are more intrinsically motivated to interact with the system
ments than the interfaces used in the cited literature and therefore
and choose the most adequate goal. Moreover, in real-world settings
the focus of participants was different. Another reason might be,
there is much more time to get used to the system and its interface
that in the presented work the colour gradient was from light blue
elements. Exploration of and interaction with the interface could
to dark blue. In the cited study, the gradient had a colour coding
be enhanced week by week.
with the colours red, yellow and green.
It has been shown that most people interacted with the elements
in a sequential order. For the single page interface this means that
the elements were operated from top to bottom and for the multiple 8.4 Interface Element 2: Conversion into Time
pages interface that the elements were operated page by page. Pos- Think aloud, behaviour observation and interview results all in-
sible reason are (1) that they were satisfied with the initially chosen dicate, that conversion into a temporal unit strongly empowered
value, or (2) that they considered their study task to be completed users to better estimate and reflect what is an appropriate goal
when testing and understanding all elements and did not see the value on the recommended range. For some participants this re-
need to actually find an appropriate goal in the study setting, or (3) flection lead to revision of the initially chosen goal. Conversion
that for those people reflection was not stimulated sufficiently to into time goes along with separation into different activity levels,
lead to interaction. However, more than 20 percent of the partici- which was difficult to understand for some participants and thus
pants returned to previous elements during the interaction process lowered the intended empowerment. Unfortunately, this separation
to make adjustments. Stepping back from one interface element is unavoidable, as the conversion otherwise would have been far too
to another, indicates that exploration and interaction has not only inexact and not meaningful anymore. However, results show that
been done for the study, but actually stimulated reflection. nevertheless the interface element can have the intended supportive
IntRS Workshop, September 2019, Copenhagen, DK Herrmanny, Löppenberg, and Schwarz
potential. Although, there were some problems in handling, which There are some limitations and potential for improvement. Al-
need to be resolved, all participants interacted with the interface though the work described focuses on investigating the research
element. questions, it is part of the user centered development of a broader
behavior change application. Therefore, we re-designed the inter-
face (Figure 3). We changed the slider’s colour gradient and added
8.5 Interface Element 3: Activity Unit Planning colour coding from green to orange. As some few participants
Also the activity unit was used by all participants and the majority encountered problems in distinguishing the intensities in the cir-
found it helpful. Some wished for modified elements or opportuni- cular display, we increased the contrast of the colours so that they
ties that allow for an even more detailed planning. Reconsidering are easier to distinguish. Addressing the understanding problems,
the initially selected goal and stepping back to the first interface we revised the help texts for all elements. Regarding the activity
element demonstrates that such a planning element can in princi- unit planning element, we redesigned the element to make it more
pal empower users to reflect the recommendation. The usability understandable and meet the user demand for a more detailed plan-
and understanding problems reported by some participants are ning, such as a calendar function. Instead of the fields of 10 minute
addressed in the re-design of the interface presented below. intervals per activity unit, we now provide one field per day of the
week. Duration of the daily activity time can be modified via plus
or minus buttons. We further modified the label.
9 SUMMARY, RE-DESIGN AND CONCLUSION
We investigated if user interface design can in principal support
users of an activity tracking system in understanding and reflect-
ing the system’s goal recommendation as a basis for appropriately
exerting influence on the recommendation result. This kind of user
integration is important as recommendations in this field are very
error-prone because of large variations over time and a large num-
ber of confounding variables that are indeterminable for the system.
In our presented approach, we pursue three main aims, which are
(a) to empower users to understand the recommendation and its
implications, (b) to reflect and evaluate it and (c) to provide the
opportunity to actively manipulate it. In our approach, these aims
are pursued through transparency of the algorithm uncertainty
and by providing activity planning elements, which are intended
to have explanatory function regarding the impact of the recom-
mendation and to stimulate reflection of the recommended goal.
Further, these elements should enable and support users in ap-
propriate modifications of the recommended goal. We designed
two different interfaces with three elements: A modifiable slider
for goal selection showing a recommendation range, default value
and colour indicator for probability of suitability of the recommen-
dation; a conversion of the goal unit (kcal) into a temporal unit
(minutes) in conjunction with different activity levels; an element Figure 3: Interface Re-Design (translated from German)
to plan concrete activity units, i.e. how often and how long users
plan to be active to achieve their goal. The two interface variants
differed in the arrangement of the elements (single page or multiple
pages). We evaluated the interface with regard to the three main Those improvements refer to the specific design of the specific
aims presented above. Results were the same for both interface vari- interface elements used in this study. As they are exemplary imple-
ants. They show that there is a need of user empowerment and that mentations for interface elements, the revealed limitations do not
empowerment can be reached by interface elements that explain affect the gain of knowledge regarding the research question. It can
the impact of the recommendation. In this case, the second interface be concluded that implementation planning elements in particular
element achieved this by converting the goal from an abstract unit and interface elements in general have the potential to empower
to a unit, participants are more used to and which is more conceiv- users, support recommendation reflection and foster user interac-
able. The third interface element works by further illustrating what tion with the recommendation.
is necessary in the daily life to achieve the goal. The study shows,
that both of these interface elements can support reflection of the
recommendation. Exerting active influence on the recommendation ACKNOWLEDGMENTS
was initially stimulated by just providing the opportunity to do so, This work is part of the research project Personal Analytics, funded
with the first interface element. Additionally, results show that as a by the Federal Ministry of Education and Research (Bundesmin-
consequence of reflection, stimulated by interface elements 2 and isterium für Bildung und Forschung, BMBF), reference number:
3, further manipulation of the recommended goal was fostered. 16SV7110, aquired and headed by Aysegül Dogangün.
IntRS Workshop, September 2019, Copenhagen, DK Herrmanny, Löppenberg, and Schwarz
REFERENCES Carlos Castillo, Yelena Mejova, and Arnold Bosman (Eds.). ACM Press, New York,
[1] Laurence L. Alpay, Olivier Blanson Henkemans, Wilma Otten, Ton A. J. M. New York, USA, 157–161. https://doi.org/10.1145/3079452.3079499
Rövekamp, and Adrie C. M. Dumay. 2010. E-health applications and services [20] Meinald T Thielsch and Morten Moshagen. 2011. Erfassung visueller Ästhetik
for patient empowerment: directions for best practices in The Netherlands. mit dem VisAWI. Tagungsband UP11 (2011).
Telemedicine journal and e-health : the official journal of the American Telemedicine [21] Bo Xiao and Izak Benbasat. 2007. E-commerce product recommendation agents:
Association 16, 7 (2010), 787–791. https://doi.org/10.1089/tmj.2009.0156 use, characteristics, and impact. MIS quarterly 31, 1 (2007), 137–209.
[2] Aaron Bangor, Philip Kortum, and James Miller. 2009. Determining what indi-
vidual SUS scores mean: Adding an adjective rating scale. Journal of usability
studies 4, 3 (2009), 114–123.
[3] Jesús Bobadilla, Fernando Ortega, Antonio Hernando, and A. Gutiérrez. 2013.
Recommender systems survey. Knowledge-Based Systems 46 (2013), 109–132.
https://doi.org/10.1016/j.knosys.2013.03.012
[4] John Brooke. 1996. SUS-A quick and dirty usability scale. Usability evaluation in
industry 189, 194 (1996), 4–7.
[5] Alexander Felfernig, Robin Burke, and Pearl Pu. 2012. Preface to the special issue
on user interfaces for recommender systems. User Modeling and User-Adapted
Interaction 22, 4-5 (2012), 313–316. https://doi.org/10.1007/s11257-012-9120-5
[6] Marsha E Fonteyn, Benjamin Kuipers, and Susan J Grobe. 1993. A description of
think aloud method and protocol analysis. Qualitative health research 3, 4 (1993),
430–441.
[7] Juan M. García-Gómez, Isabel de La Torre-Díez, Javier Vicente, Montserrat Robles,
Miguel López-Coronado, and Joel J. Rodrigues. 2014. Analysis of mobile health ap-
plications for a broad spectrum of consumers: a user experience approach. Health
informatics journal 20, 1 (2014), 74–84. https://doi.org/10.1177/1460458213479598
[8] Katja Herrmanny and Aysegül Dogangün. 2017. The Impact of Prediction Uncer-
tainty in Recommendations for Health-Related Behavior. In Proceedings of the
2nd International Workshop on Health Recommender Systems co-located with the
11th International Conference on Recommender Systems (RecSys 2017), David El-
sweiler, Santiago Hors-Fraile, Bernd Ludwig, Alan Said, Hanna Schäfer, Christoph
Trattner, Helma Torkamaan, and André Calero Valdez (Eds.). RWTH, 14–17.
http://ceur-ws.org/Vol-1953/healthRecSys17_paper_8.pdf
[9] Katja Herrmanny, Jürgen Ziegler, and Aysegül Dogangün. 2016. Supporting
Users in Setting Effective Goals in Activity Tracking. In Persuasive Technology,
Alexander Meschtscherjakov, Boris de Ruyter, Verena Fuchsberger, Martin Murer,
and Manfred Tscheligi (Eds.). Springer International Publishing, Cham, 15–26.
[10] Jamil Hussain, Wajahat Ali Khan, Muhammad Afzal, Maqbool Hussain,
Byeong Ho Kang, and Sungyoung Lee. 2014. Adaptive User Interface and User
Experience Based Authoring Tool for Recommendation Systems. In Ubiquitous
Computing and Ambient Intelligence. Personalisation and User Adapted Services,
Ramón Hervás, Sungyoung Lee, Chris Nugent, and José Bravo (Eds.). Lecture
Notes in Computer Science, Vol. 8867. Springer International Publishing, Cham,
136–142. https://doi.org/10.1007/978-3-319-13102-3{_}24
[11] Dosam Hwang, Xuan Hau Pham, and Jason J. Jung. 2011. Preference-based user
rate correction process for interactive recommendation systems. In iiWAS 2011,
David Taniar, Eric Pardede, Hong-Quang Nguyen, Wenny Rhayu, and Ismail
Khall (Eds.). ACM Press, New York, New York, USA, 412. https://doi.org/10.1145/
2095536.2095619
[12] Dietmar Jannach, Ingrid Nunes, and Michael Jugovac. 2017. Interacting with
Recommender Systems. In Proceedings of the 22nd International Conference on
Intelligent User Interfaces Companion - IUI ’17 Companion, George A. Papadopou-
los, Tsvi Kuflik, Fang Chen, Carlos Duarte, and Wai-Tat Fu (Eds.). ACM Press,
New York, New York, USA, 25–27. https://doi.org/10.1145/3030024.3030027
[13] Bart P. Knijnenburg, Martijn C. Willemsen, Zeno Gantner, Hakan Soncu, and
Chris Newell. 2012. Explaining the user experience of recommender systems.
User Modeling and User-Adapted Interaction 22, 4-5 (2012), 441–504. https://doi.
org/10.1007/s11257-011-9118-4
[14] Haridimos Kondylakis, Lefteris Koumakis, Eleni Kazantzaki, Maria Chatzimina,
Maria Psaraki, Kostas Marias, and Manolis Tsiknakis. 2015. Patient Empowerment
through Personal Medical Recommendations. In MEDINFO 2015: eHealth-enabled
health, Indra Neil Sarkar, Andrew Georgiou, and Paulo Mazzoncini de Azevedo
Marques (Eds.). IOS Press, Amsterdam and Berlin and Tokyo and Waschington,
DC, 1117. https://doi.org/10.3233/978-1-61499-564-7-1117
[15] Bettina Laugwitz, Theo Held, and Martin Schrepp. 2008. Construction and
Evaluation of a User Experience Questionnaire. In HCI and Usability for Education
and Work, Andreas Holzinger (Ed.). Lecture Notes in Computer Science, Vol. 5298.
Springer Berlin Heidelberg, Berlin, Heidelberg, 63–76. https://doi.org/10.1007/
978-3-540-89350-9{_}6
[16] James R. Lewis. 1991. Psychometric evaluation of an after-scenario questionnaire
for computer usability studies: the ASQ. ACM Sigchi Bulletin 23, 1 (1991), 78–81.
[17] Philipp Mayring. 2010. Qualitative Inhaltsanalyse: Grundlagen und Techniken.
Beltz. https://books.google.de/books?id=BJlxSQAACAAJ
[18] Pearl Pu and Li Chen. 2007. Trust-inspiring explanation interfaces for recom-
mender systems. Knowledge-Based Systems 20, 6 (2007), 542–556.
[19] Hanna Schäfer, Santiago Hors-Fraile, Raghav Pavan Karumur, André Calero
Valdez, Alan Said, Helma Torkamaan, Tom Ulmer, and Christoph Trattner. 2017.
Towards Health (Aware) Recommender Systems. In Proceedings of the 2017 Inter-
national Conference on Digital Health - DH ’17, Patty Kostkova, Floriana Grasso,