<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An empirical study on the persuasiveness of fact-based explanations for recommender systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Markus Zanker</string-name>
          <email>mzanker@acm.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Schoberegger</string-name>
          <email>m3schobe@edu.uni-klu.ac.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Alpen-Adria-Universitaet Klagenfurt</institution>
          ,
          <addr-line>9020 Klagenfurt</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Recommender Systems (RS) help users to orientate themselves in large product assortments and provide decision support. Explanations help recommender systems to enhance their impact on users by, for instance, justifying made recommendations. Arguments provide reason in a more structured way, by denoting a conclusion that follows from one or more premises. While expert systems' explanation have a long tradition in using argumentative patterns, argumentative explanations for recommendations have not yet been systematically researched. This paper compares therefore the persuasion potential of different explanation styles (sentences, facts or argument style) by comparing the robustness of subjects' preferences when employing an additive utility model from conjoint analysis.</p>
      </abstract>
      <kwd-group>
        <kwd>Recommender Systems</kwd>
        <kwd>Explanation styles</kwd>
        <kwd>Persuasion potential of explanations</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        Recommender Systems (RS) support online customers in
their decision making and should help them to avoid poor
decisions [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Persuasive systems [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] are focusing on
changing a user’s belief or actions in an intended way. In this
context recommender systems need to be also seen as
persuasive systems, as their purpose lies in pointing users
towards unknown items that presumably match their interest,
i.e. making serendipitous propositions. This clearly
differentiates a recommendation system (RS) from an information
retrieval (IR) system that assumes an objective information
need of a user that can satisfied. In general explanations can
be seen as an attempt to fit a particular phenomenon into
a general pattern in order to increase understanding and
remove bewilderment or surprise [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In the context of product
recommendation scenarios explanations can be seen as
additional information about recommendations [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] that serves
the purpose of justifying why a specific item is part of a
Permission to make digital or hard copies of all or part of this work for
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that copies
bear this notice and the full citation on the first page. To copy otherwise, to
republish, to post on servers or to redistribute to lists, requires prior specific
permission and/or a fee.
      </p>
      <p>
        Workshop IntRS at RecSys ’14 Foster City, CA USA
Copyright 20XX ACM X-XXXXX-XX-X/XX/XX ...$15.00.
recommendation list and promote objectives such as users’
trust in the system and confidence in decision making. In
the domain of expert systems explanations have already a
long tradition, where formal argumentation traces can serve
as explanations that justify the output of a system [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
According to [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] an argument is (a) a series of sentences,
statements, or propositions (b) where some are premises (c) and
one is the conclusion (d) where the premises are intended
to give a reason for the conclusion. As we believe that
research on explanations in general and comparative studies
on competing explanation styles are rare (a few pointers to
more recent exceptions [
        <xref ref-type="bibr" rid="ref3 ref6 ref7">7, 6, 3</xref>
        ]), we conducted a supervised
lab study that had the purpose to research the impact of
different explanation styles of knowledgeable explanations
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. In particular we are interested in effects on the
robustness of users’ preferences when confronted with additional
explanations, i.e. exploring the persuasion potential of
explanations. More concretely we compared fact-based
explanations, that presented keywords as explanations to users,
such as A, B, C, with a basic argument style with A and B
as premises and C as a consequent, i.e. A, B therefore C.
Furthermore, we compared these fact-based explanations to
sentence-based explanations requiring more cognitive effort
to understand them. We selected three different item
domains that typically trigger high involvement of users, i.e.
hiking routes from the tourism and leisure domain (hiking
routes), energy plans and mobile phone plans, and controlled
for user preferences, item portfolio and the semantics of the
explanations themselves. We would like to note that the
study was conducted in the scope of the O-STAR project
that researches techniques for personalized route planning
for hikers in alpine regions. Next we will provide details on
our study design and finally discuss results and conclusions.
2.
      </p>
    </sec>
    <sec id="sec-2">
      <title>STUDY DESIGN</title>
      <p>We researched the question if the introduction of an
argumentbased writing style, i.e. use of the keyword therefore to
denote the conclusion of the preceding premises, has an
impact on the robustness of users’ preferences in face of
additional explanations. As already mentioned we asked users
to disclose their preferences for three different item domains
(hiking routes, mobile phone plans and energy plans) in a
supervised offline questionnaire. Figures 1 and 2 depict two
exemplary items from the hiking domain. Subjects were
invited to participate in a seminar room, where they had to
answer a paper &amp; pencil survey with two parts. The first
part included for each of the three domains exactly 6 items,
that are described by either 4 or 5 characteristics.
Table 1 depicts the three item domains and the artificial
design space of the item portfolios. To avoid confusion the
semantics of the domain attributes were defined in a sidebar
(e.g. Smartphone: denotes a device in the range of HTC
Desire X or Nokia Lumia 625). Participants had to rank the
6 options according to their general preference with respect
to the particular item domain. After disclosing their
preferences in the first part of the questionnaire (see Figure 1
for a translated excerpt of the questionnaire) users had to
solve a picture puzzle, where 10 different errors were hidden.
The purpose of this task is twofold: first, it distracts users
from their thoughts on the ranking tasks and, second, we
could use the numerical measure of correctly marked errors
to assess how concentrated participants followed the
questionnaire. Once participants had finished the first part they
handed it in and received the second part of the survey. This
way we were able to avoid that participants could have taken
a look on their first-round ranking when answering the
second part. In the second part participants had again to
rank sets of five items from the three item domains.
However, in addition to the item characteristics already used in
the first-round, additional explanations were given for each
item. The explanation style acts as the manipulated variable
(solely fact-based, argumentative facts and argumentative
sentences). Explanation style is permuted within subjects,
i.e. participants are confronted with all three explanation
styles for a different item domain and in different orders,
while the combination of item domain and explanation style
is varied between subjects. For each item exactly two
arguments, each with two premises and one conclusion, are
added as additional information (see examples in Table 2).
See Figure 2 for a depiction of two exemplary items from
the hiking domain with explanations following the style of
argumentative facts.</p>
      <p>Finally, the questionnaire controlled for demographic
characteristics and checked if participants noticed the
intervention, i.e. one question asked what was relevant for
ranking the items with multiple answering options. For analysis
we selected only participants that considered the additional
explanations provided in the second part in their ranking
decision.</p>
      <p>
        In Figure 3 we sketch the big picture of the study design.
Thus, participants rank sets of items from three different
domains twice, where item sets in the first and second part
of the questionnaire do not overlap. Due to measuring user
preferences twice for each domain (without and with
intervention of a specific explanation style), we can control for
the participants’ preferences on item sets and their
presentation. We employ an additive model from conjoint analysis,
that allows us to estimate the utilities for each item
characteristic [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], i.e. the overall utility of an item yi is
computed as the sum + ∑Z Z , where is a basic utility and
      </p>
      <sec id="sec-2-1">
        <title>Solely facts</title>
      </sec>
      <sec id="sec-2-2">
        <title>Argumentative facts</title>
      </sec>
      <sec id="sec-2-3">
        <title>Argumentative sentences</title>
      </sec>
      <sec id="sec-2-4">
        <title>Energy plans</title>
      </sec>
      <sec id="sec-2-5">
        <title>Solely facts</title>
      </sec>
      <sec id="sec-2-6">
        <title>Argumentative facts</title>
      </sec>
      <sec id="sec-2-7">
        <title>Argumentative sentences</title>
      </sec>
      <sec id="sec-2-8">
        <title>Mobile phone plans</title>
      </sec>
      <sec id="sec-2-9">
        <title>Solely facts</title>
      </sec>
      <sec id="sec-2-10">
        <title>Argumentative facts Argumentative sentences</title>
        <p>low altitude
easy distance
very family-friendly
low altitude
easy distance
therefore very family-friendly
This route is of low altitude
and easy distance, therefore
it is very family-friendly.
100% renewable energy
low environmental impact
high sustainability
100% renewable energy
low environmental impact
therefore high sustainability
This energy plan offers 100%
renewable energy with a low
environmental impact, therefore
its sustainability is high.
low basic fee
many anytime minutes
ideal for heavy use
low monthly basic fee
many anytime minutes
therefore ideal for heavy use
This mobile phone plan
offers a low monthly basic fee
with many anytime minutes,
therefore it is ideal for
heavy use.</p>
        <p>Z denotes the positive or negative utility contributed by
a specific item characteristic Z (for instance, the possibility
to have your meal on route in the hiking domain). Having
estimated the individual utilities of each item characteristic
we computed an a priori ranking for the unseen item sets
in the survey’s second part that is then compared with the
observed ranks for each user.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>RESULTS AND DISCUSSION</title>
      <p>In total 136 subjects, mostly students from
Alpen-AdriaUniversta¨t Klagenfurt, participated in our survey. From
each participant we received three rankings in the second
part of the survey (one for each domain), i.e. a total of
408 computed rank correlations before cleaning. More than
80% of all participants were young people aged between 18
and 25. Two thirds of our participants were females. All
respondents had a high-school degree and a few of them had
already a graduation degree from a university. Before
analysis we rigorously excluded participants whose answers might
be unreliable due to the following criteria:
1. Only respondents who demonstrated a thorough
attitude by identifying at least 50% of all hidden errors in
the picture puzzle.
2. We asked participants what they considered to be
relevant for making their decisions on the rankings. Based
on the answers to this multiple choice question we
included only respondents who had noticed the
additional information (explanations) and excluded all
respondents who answered that they relied on their gut
feelings.
3. We also asked participants how they experienced this
survey with the answering options interesting,
challenging, boring, unclear and useless. For further
consideration we only kept respondents that answered
challenging and were thus captivated by the ranking tasks.
We assumed that the option interesting is a polite way
of saying boring or useless.
4. Finally we cleaned records from the dataset, where the
estimation of individual utilities for product
characteristics was not reliable, i.e. rank correlation between
the a priori rankings based on estimated utility weights
and the actual a priori ranking of participants had to
be above 0.7.</p>
      <p>After applying this extremely restrictive selection procedure
we derived at the following size of the dataset (see Table
3). In order to check for the robustness of preferences
af</p>
      <sec id="sec-3-1">
        <title>Hiking Energy Mobile Solely facts</title>
        <p>styled facts that included the keyword therefore to denote
a conclusion reduced the robustness of participants’
preferences more than the pure fact-based explanations, i.e.
supporting our hypothesis that an argumentative explanation
style would influence users more. Argumentative sentences
preserved user preferences more than the fact-based
explanation styles. Obviously, sentences need more cognitive effort
from users to be understood and the effect of the keyword
therefore was seemingly lost in the sentence structure. The
difference between Spearman’s in all three categories is
statistically significant according to Kruskall-Wallis test (p
= 0.037).</p>
        <p>In addition we checked for interaction effects between
explanation style and product domain. As can be seen from
Table 5 fact-based explanation styles lead to less robust
preferences than sentence-based explanation styles.
Furthermore, argumentative facts seem to reduce participant’s
robustness of preferences even more than a pure facts based
explanation style. The only exception is the hiking domain,
where the order between facts and argumentative facts is
inverted. However, in this product domain preference
robustness is generally lower and it might have been harder
for respondents to determine own preferences in the hiking
domain than in the other two domains.
able features of recommender systems. Limitations or
possible lines of future research include varying the complexity
of arguments (i.e the number of premises) or its number as
well as additional item domains.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements</title>
      <p>Authors acknowledge the financial support from the
European Union (EU), the European Regional Development
Fund (ERDF), the Austrian Federal Government and the
State of Carinthia in the Interreg IV Italien- O¨sterreich
programme (project acronym O-STAR).
5.</p>
      <sec id="sec-4-1">
        <title>Solely facts</title>
        <p>Argumentative facts
Argumentative sentences
0.27
0.38
0.58</p>
        <p>per domain and expl. style</p>
        <p>
          This study therefore showed, that fact-based explanations
and an argumentative explanation style impacted
participants’ preferences stronger than full sentence explanations.
Objections against these conclusions might be the lack of
a control group and the paper &amp; pencil design without a
real recommendation situation. A control group would
allow us to estimate the natural stability of preferences
between both rounds and without any intervention. However,
in this study we were not interested in absolute rank
correlation measures, but only in the comparison of robustness
of respondents’ preferences between different conditions and
assumed that some natural instability would affect all
explanation styles the same way. In order to assess the impact
of an argumentative explanation style we wanted to control
for other effects and biases as good as possible. The
supervised paper &amp; pencil approach allowed us to control for user
preferences, the item portfolio and the persuasiveness of the
explanation content itself as well as insisting on a high
reliability of the measurements by excluding participants, who
made arbitrary rankings or did not notice the additional
explanations. In a previous study [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] we already compared
the sentence-based explanations with a no-explanations
control group and observed their positive impact on the
perception of the recommender system as a whole. However, one
could not isolate the impact on the robustness of
preferences by controlling for the different recommendation lists,
the different explanation content that would apply to
different recommendations or the differing appreciation of the
recommendation results themselves by participants.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>CONCLUSIONS</title>
      <p>This short paper presented an innovative study design for
measuring the impact of different explanation styles on
participants’ robustness of preferences in face of additional
explanations. The results indicate that fact-based
explanations have a stronger impact on participants preference
stability than sentence-based explanations. Furthermore, the
use of the keyword therefore indicating a conclusion drawn
from premises and an argumentative explanation style had
already a measurable impact on participants. Thus
arguments and fact-based explanations make users change their
minds about the item portfolio and can therefore be
valu</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Klaus</given-names>
            <surname>Backhaus</surname>
          </string-name>
          , Bernd Erichson, Wulff Plinke, and
          <string-name>
            <given-names>Rolf</given-names>
            <surname>Weiber</surname>
          </string-name>
          .
          <article-title>Multivariate Analysemethoden: Eine anwendungsorientierte Einfuhrung</article-title>
          . Springer, Berlin,
          <volume>12</volume>
          .,
          <article-title>vollsta¨ndig u¨berarbeitete auflage</article-title>
          .
          <source>edition</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Gerhard</given-names>
            <surname>Friedrich</surname>
          </string-name>
          and
          <string-name>
            <given-names>Markus</given-names>
            <surname>Zanker</surname>
          </string-name>
          .
          <article-title>A taxonomy for generating explanations in recommender systems</article-title>
          .
          <source>AI Magazine</source>
          ,
          <volume>32</volume>
          (
          <issue>3</issue>
          ):
          <fpage>90</fpage>
          -
          <lpage>98</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Fatih</given-names>
            <surname>Gedikli</surname>
          </string-name>
          , Dietmar Jannach, and
          <string-name>
            <given-names>Mouzhi</given-names>
            <surname>Ge</surname>
          </string-name>
          .
          <article-title>How should i explain? a comparison of different explanation types for recommender systems</article-title>
          .
          <source>Int. J. Hum.-Comput</source>
          . Stud.,
          <volume>72</volume>
          (
          <issue>4</issue>
          ):
          <fpage>367</fpage>
          -
          <lpage>382</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Dietmar</given-names>
            <surname>Jannach</surname>
          </string-name>
          , Markus Zanker, Alexander Felfernig, and
          <string-name>
            <given-names>Gerhard</given-names>
            <surname>Friedrich</surname>
          </string-name>
          .
          <source>Recommender Systems: An Introduction</source>
          . Cambridge Univ Pr,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>W.</given-names>
            <surname>Sinnott-Armstrong</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Fogelin</surname>
          </string-name>
          .
          <source>Cengage Advantage Books: Understanding Arguments. Wadsworth</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Nava</given-names>
            <surname>Tintarev</surname>
          </string-name>
          and
          <string-name>
            <given-names>Judith</given-names>
            <surname>Masthoff</surname>
          </string-name>
          .
          <article-title>Evaluating the effectiveness of explanations for recommender systems</article-title>
          .
          <source>User Modeling</source>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>22</volume>
          (
          <issue>4-5</issue>
          ):
          <fpage>399</fpage>
          -
          <lpage>439</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Jesse</given-names>
            <surname>Vig</surname>
          </string-name>
          , Shilad Sen,
          <string-name>
            <given-names>and John</given-names>
            <surname>Riedl</surname>
          </string-name>
          . Tagsplanations:
          <article-title>Explaining recommendations using tags</article-title>
          .
          <source>In Proceedings of the 14th International Conference on Intelligent User Interfaces</source>
          ,
          <source>IUI '09</source>
          , pages
          <fpage>47</fpage>
          -
          <lpage>56</lpage>
          , New York, NY, USA,
          <year>2009</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L. R.</given-names>
            <surname>Ye</surname>
          </string-name>
          and
          <string-name>
            <surname>P. E. Johnson.</surname>
          </string-name>
          <article-title>The impact of explanation facilities in user acceptance of expert system advice</article-title>
          .
          <source>MIS Quarterly</source>
          ,
          <volume>19</volume>
          (
          <issue>2</issue>
          ):
          <fpage>157</fpage>
          -
          <lpage>172</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Kyung</given-names>
            <surname>Hyan</surname>
          </string-name>
          <string-name>
            <surname>Yoo</surname>
          </string-name>
          , Ulrike Gretzel, and
          <string-name>
            <given-names>Markus</given-names>
            <surname>Zanker</surname>
          </string-name>
          .
          <source>Persuasive Recommender Systems - Conceptual Background and Implications</source>
          . Springer Briefs in Electrical and Computer Engineering. Springer,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Markus</given-names>
            <surname>Zanker</surname>
          </string-name>
          .
          <article-title>The influence of knowledgeable explanations on users' perception of a recommender system</article-title>
          .
          <source>In Padraig Cunningham</source>
          ,
          <string-name>
            <given-names>Neil J.</given-names>
            <surname>Hurley</surname>
          </string-name>
          , Ido Guy, and Sarabjot Singh Anand, editors,
          <source>RecSys</source>
          , pages
          <fpage>269</fpage>
          -
          <lpage>272</lpage>
          . ACM,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Markus</given-names>
            <surname>Zanker</surname>
          </string-name>
          and
          <string-name>
            <given-names>Daniel</given-names>
            <surname>Ninaus</surname>
          </string-name>
          .
          <article-title>Knowledgeable explanations for recommender systems</article-title>
          . In Jimmy Xiangji Huang, Irwin King,
          <string-name>
            <given-names>Vijay V.</given-names>
            <surname>Raghavan</surname>
          </string-name>
          , and Stefan Rueger, editors,
          <source>Web Intelligence</source>
          , pages
          <fpage>657</fpage>
          -
          <lpage>660</lpage>
          . IEEE,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>