<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Exploiting Distributional Semantics Models for Natural Language Context-aware Justifications for Recommender Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Giuseppe Spillo</string-name>
          <email>giuseppe.spillo@studenti.uniba.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cataldo Musto</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco de Gemmis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pasquale Lops</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giovanni Semeraro</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Bari - Dip. di Informatica</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper1 we present a methodology to generate context-aware natural language justifications supporting the suggestions produced by a recommendation algorithm. Our approach relies on a natural language processing pipeline that exploits distributional semantics models to identify the most relevant aspects for each different context of consumption of the item. Next, these aspects are used to identify the most suitable pieces of information to be combined in a natural language justification. As information source, we used a corpus of reviews. Accordingly, our justifications are based on a combination of reviews' excerpts that discuss the aspects that are particularly relevant for a certain context. In the experimental evaluation, we carried out a user study in the movies domain in order to investigate the validity of the idea of adapting the justifications to the different contexts of usage. As shown by the results, all these claims were supported by the data we collected.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Recommender Systems (RSs)
        <xref ref-type="bibr" rid="ref21 ref22">(Resnick and
Varian, 1997)</xref>
        are now recognised as a very effective
mean to support the users in decision-making tasks
        <xref ref-type="bibr" rid="ref20 ref23">(Ricci et al., 2015)</xref>
        . However, as the importance
of such technology in our everyday lives grows, it
is fundamental that these algorithms support each
suggestion through a justification that allows the
user to understand the internal mechanisms of the
recommendation process and to more easily
discern among the available alternatives.
      </p>
      <p>1Copyright ©2020 for this paper by its authors. Use
permitted under Creative Commons License Attribution 4.0
International (CC BY 4.0).</p>
      <p>
        To this end, several attempts have been
recently devoted to investigate how to introduce
explanation facilities in RSs
        <xref ref-type="bibr" rid="ref19 ref6">(Nunes and Jannach,
2017)</xref>
        and to identify the most suitable
explanation styles
        <xref ref-type="bibr" rid="ref15 ref8">(Gedikli et al., 2014)</xref>
        . Despite such a
huge research effort, none of the methodologies
currently presented in literature diversifies the
justifications based on the different contextual
situations in which the item will be consumed. This is a
clear issue, since context plays a key role in every
decision-making task, and RSs are no exception.
Indeed, as the mood or the company (friends,
family, children) can direct the choice of the movie to
be watched, so a justification that aims to convince
a user to enjoy a recommendation should contain
different concepts depending on whether the user
is planning to watch a movie with her friends or
with her children.
      </p>
      <p>
        In this paper we fill in this gap by proposing
an approach to generate a context-aware
justification that supports a recommendation. Our
methodology exploits distributional semantics models
        <xref ref-type="bibr" rid="ref3">(Lenci, 2008)</xref>
        to build a term-context matrix that
encodes the importance of terms and concepts in
each context of consumption. Such a matrix is
used to obtain a vector space representation of
each context, which is in turn used to identify the
most suitable pieces of information to be
combined in a justification. As information source, we
used a corpus of reviews. Accordingly, our
justifications are based on a combination of reviews’
excerpts that discuss with a positive sentiment the
aspects that are particularly relevant for a certain
context. Beyond its context-aware nature, another
distinctive trait of our methodology is the fact that
we generate post-hoc justifications that are
completely independent from the underlying
recommendation models and completely separated from
the step of generating the recommendations.
      </p>
      <p>To sum up, we can summarize the contributions
of the article as follows: (i) we propose a
methodology based on distributional semantics models
and natural language processing to automatically
learn a vector space representation of the different
contexts in which an item can be consumed; (ii)
We design a pipeline that exploits distributional
semantics models to generate context-aware
natural language justifications supporting the
suggestions returned by any recommendation algorithm;</p>
      <p>The rest of the paper is organized as follows:
first, in Section 2 we provide an overview of
related work. Next, Section 3 describes the main
components of our workflow and Section 4
discusses the outcomes of the experimental
evaluation. Finally, conclusions and future work of the
current research are provided in Section 5.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>The current research borrows concepts from
review-based explanation strategies and
distributional semantics models. In the following, we will
try to discuss relevant related work and to
emphasize the hallmarks of our methodology.</p>
      <p>
        Review-based Explanations. According to
the taxonomy discussed in
        <xref ref-type="bibr" rid="ref7">(Friedrich and Zanker,
2011)</xref>
        , our approach can be classified as a
contentbased explanation strategy, since the justifications
we generate are based on descriptive features of
the item. Early attempts in the area rely on the
exploitation of tags
        <xref ref-type="bibr" rid="ref26 ref5">(Vig et al., 2009)</xref>
        and
features gathered from knowledge graphs
        <xref ref-type="bibr" rid="ref1 ref16">(Musto et
al., 2016)</xref>
        . With respect to classic content-based
strategies, the novelty of the current work lies
in the use of review data to build a natural
language justification. In this research line,
        <xref ref-type="bibr" rid="ref19 ref6">(Chen
and Wang, 2017)</xref>
        Chen et al. analyze users’
reviews to identify relevant features of the items,
which are presented on an explanation interface.
Differently from this work, we did not bound on
a fixed set of static aspects and we left the
explanation algorithm deciding and identifying the
most relevant concepts and aspects for each
contextual setting. A similar attempt was also
proposed in
        <xref ref-type="bibr" rid="ref1 ref16">(Chang et al., 2016)</xref>
        . Moreover, as
previously emphasized, a trait that distinguishes our
approach with respect to such literature is the
adaptation of the justification based on the different
setting in which the item is consumed. The only
work exploiting context in the justification
process has been proposed by Misztal et al. in
        <xref ref-type="bibr" rid="ref11">(Misztal and Indurkhya, 2015)</xref>
        . However, differently
from our work, they did not diversify the
justifications of the same items on varying of different
contextual settings in which the item is consumed,
since they just adopt features inspired by context
(e.g., "I suggest you this movie since you like this
genre in rainy days") to explain a
recommendation. Distributional Semantics Models. Another
distinctive trait of the current work is the
adoption of distributional semantics models (DMSs) to
build a vector space representation of the different
contextual situations in which an item can be
consumed. Typically, DSMs rely on a term-context
matrix, where rows represent the terms in the
corpus and columns represents contexts of usage. For
the sake of simplicity, we can imagine a context as
a fragment of text in which the term appears, as a
sentence, a paragraph or a document. Every time a
particular term is used in a particular context, such
an information is encoded in this matrix. One of
the advantages that follows the adoption of DSMs
is that they can learn a vector space
representation of terms in a totally unsupervised way. These
methods, recently inspired methods in the area of
word embeddings, such as WORD2VEC
        <xref ref-type="bibr" rid="ref10">(Mikolov
et al., 2013)</xref>
        and contextual word representations
        <xref ref-type="bibr" rid="ref24">(Smith, 2020)</xref>
        . Even if some attempts
evaluating RSs based on DSMs already exists
        <xref ref-type="bibr" rid="ref12 ref14 ref15 ref26 ref5 ref8">(Lops et
al., 2009; Musto et al., 2011; Musto et al., 2012;
Musto et al., 2014)</xref>
        , in our attempt we used DSMs
to build a vector-space representation of the
different contextual dimensions. Up to our knowledge,
the usage of DSMs for justification purposes this
is a completely new research direction in the area
of explanation.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <p>Our workflow to generate context-aware
justifications based on users’ reviews is shown in Figure 1.
In the following, we will describe all the modules
that compose the workflow.</p>
      <p>Context Learner. The first step is carried out
by the CONTEXT LEARNER module, which
exploits DSMs to learn a vector space representation
of the contexts. Formally, given a set reviews R
and a set of k contextual settings C = fc1 : : : ckg,
this module generates as output a matrix Cn;k that
encodes the importance of each term ti in each
contextual setting c j. In order to build such a
representation, we first split all the reviews r 2 R
in sentences. Next, let S be the set of
previously obtained sentences, we manually annotated
a subset of these sentences in order to obtain a
set S0 = fs1 : : : smg, where each si is labeled with
one or more contextual settings, based on the
concepts mentioned in the review. Of course, each
si can be annotated with more than one context.
As an example, a review including the sentence ’a
very romantic movie’ is annotated with the
contexts company=partner, while the sentence
’perfect for a night at home’ is annotated with the
contexts day=weekday. After the annotation step, a
sentence-context matrix Am;k is built, where each
asi;cj is equal to 1 if the sentence si is annotated
with the context c j (that is to say, it mentions
concepts that are relevant for that context), 0
otherwise.</p>
      <p>
        Next, we run tokenization and lemmatization
algorithms (Manning et al., 1999) over the sentences
in S to obtain a lemma-sentence matrix Vn;m. In
this case, vti;sj is equal to the TF/IDF of the term
ti in the sentence s j. Of course, IDF is
calculated over all the annotated sentences. In order
to filter out non-relevant lemmas, we maintained
in the matrix V just nouns and adjectives. Nouns
were chosen due to previous research
        <xref ref-type="bibr" rid="ref19">(Nakagawa
and Mori, 2002)</xref>
        , which showed that descriptive
features of an item are usually represented using
nouns (e.g., service, meal, location, etc.).
Similarly, adjectives were included since they play a
key role in the task of catching the
characteristics of the different contextual situations (e.g.,
romantic, quick, etc.). Moreover, we also decided
to take into account and extract combinations of
nouns and adjectives (bigrams) such as romantic
location, since they can be very useful to highlight
specific characteristics of the item.
      </p>
      <p>In the last step of the process annotation matrix
An;k and vocabulary matrix Vm;n are multiplied to
obtain our lemma-context matrix Cn;k, which
represents the final output returned by the CONTEXT
LEARNER module. Of course, each ci; j encodes
the importance of term ti in the context c j. The
whole process carried out by this component is
described in Figure 2.</p>
      <p>Given such a representation, two different
outputs are obtained. First, we can directly extract
column vectors ~cj from matrix C, which
represents the vector space representation of the
context c j based on DSMs. It should be pointed out
that such a representation perfectly fits the
principles of DSMs since contexts discussed through
the same lemmas will share a very similar
vector space representation. Conversely, a poor
overlap will result in very different vectors.
Moreover, for each column, lemmas may be ranked and
those having the highest TF-IDF scores may be
extracted. In this way, we obtain a lexicon of
lemmas that are relevant for a particular contextual
setting, and this can be useful to empirically
validate the effectiveness of the approach. In Table
1, we anticipate some details of our experimental
session and we report the top-3 lemmas for two
different contextual settings starting from a set of
movie reviews.</p>
      <p>
        Ranker. Given a recommended item (along
with its reviews) and given the context in which
the item will be consumed (from now on, defined
as ’current context’), this module has to identify
the most relevant review excerpts to be included
in the justification. To this end, we designed a
ranking strategy that exploits DSMs and similarity
measures in vector spaces to identify suitable
excerpts: given a set of n reviews discussing the item
i, Ri = fri;1 : : : ri;ng, we first split each ri in
sentences. Next, we processed the sentences through
a sentiment analysis algorithm
        <xref ref-type="bibr" rid="ref20 ref23 ref4">(Liu, 2012; Petz
et al., 2015)</xref>
        in order to filter out those
expressing a negative or neutral opinions about the item.
The choice is justified by our focus on review
excerpts discussing positive characteristics of the
item. Next, let c j be the current contextual
situation (e.g., company=partner), we calculate the
cosine similarity between the context vector ~cj
returned by the CONTEXT LEARNER and a vector
0v1;1 v1;2 : : : v1;m1 0a1;1
Bv2;1 v2;2 : : : v2;mC Ba2;1
B@B ::: ::: ::: ::: CACx@BB :::
vn;1 vn;2 : : : vn;m
      </p>
      <p>Vn;m
a1;2 : : : a1;k 1
a2;2 : : :</p>
      <p>::: :::
am;1 am;2 : : : am;k</p>
      <p>Am;k</p>
      <p>0c1;1 c1;2 : : : c1;k1
a2:::;k CCCA = BB@ ::: ::: ::: ::: CCA</p>
      <p>Bc2;1 c2;2 : : : c2;kC
cn;1 cn;2 : : : cn;k</p>
      <p>Cn;k
space representation of each sentence ~si. The
sentences having the highest cosine similarity w.r.t.
to the context of usage c j are selected as the most
suitable excerpts and are passed to the
GENERATOR.</p>
      <p>
        Generator. Finally, the goal of GENERATOR is
to put together the compliant excerpts in a single
natural language justification. In particular, we
defined a slot-filling strategy based on the
principles of Natural Language Generation
        <xref ref-type="bibr" rid="ref21 ref22">(Reiter and
Dale, 1997)</xref>
        . Such a strategy is based on the
combination of a fixed part, which is common to all
the justifications, and a dynamic part that depends
on the outputs returned by the previous steps. In
our case, the top-1 sentence for each current
contextual dimension is selected, and the different
excerpts are merged by exploiting simple
connectives, such as adverbs and conjunctions. An
example of the resulting justifications is provided in
Table 2.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Experimental Evaluation</title>
      <p>
        The experimental evaluation was designed to
identify the best-performing configuration of our
strategy, on varying of different combinations of the
parameters of the workflow (Research Question
1), and to assess how our approach performs
in comparison to other methods (both
contextaware and non-contextual) to generate post-hoc
justifications (Research Question 2). To this
end, we designed a user study involving 273
subjects (male=50%, degree or PhD=26.04%,
age 35=49,48%, already used a RS=85.4%) in
the movies domain. Interest in movies was
indicated as medium or high by 62.78% of the
sample. Our sample was obtained through the
availability sampling strategy, and it includes students,
researchers in the area and people not skilled with
computer science and recommender systems. As
in
        <xref ref-type="bibr" rid="ref25">(Tintarev and Masthoff, 2012)</xref>
        , whose
protocol was took as a reference in several subsequent
research in the area of explanation
        <xref ref-type="bibr" rid="ref17">(Musto et al.,
2019)</xref>
        , we evaluated the following metrics:
transparency, persuasiveness, engagement and trust
through a post-usage questionnaire.
      </p>
      <p>Experimental Design. To run the experiment,
we deployed a web application2 implementing
the methodology described in Section 3. Next,
as a first step, we identified the relevant
contextual dimensions for each domain. Contexts were
selected by carrying out an analysis of related
work of context-aware recommender systems in
the MOVIE domain. In total, we defined 3
contextual dimensions, that is to say, mood (great,
normal), company (family, friends, partner) and
level of attention (high, low). To collect the data
necessary to feed our web application, we
selected a subset of 300 popular movies
(according to IMDB data) discussed in more than 50
reviews in the Amazon Reviews dataset 3. This
choice is motivated by our need of a large set of
sentences discussing the item in each contextual
setting. These data were processed by exploiting
lemmatization, POS-tagging and sentiment
analysis algorithms available in CoreNLP4 and Stanford
Sentiment Analysis algorithm5. tool. Some
statistics about the final dataset are provided in Table
2http://193.204.187.192:8080/filmando-eng
3http://jmcauley.ucsd.edu/data/amazon/
links.html - Only the reviews available in the ’Movies and
TV’ category were downloaded.</p>
      <p>4https://stanfordnlp.github.io/CoreNLP/
5https://nlp.stanford.edu/sentiment/
3.</p>
      <p>
        In order to compare different configurations of
the workflow, we designed several variant
obtained by varying the vocabulary of lemmas. In
particular, we compared the effectiveness of
simple unigrams, of bigrams and their merge. In the
first case, we encoded in our matrix just single
lemmas (e.g., service, meal, romantic, etc.) while
in the second we stored combination of nouns and
adjectives (e.g., romantic location). Due to space
reasons, we can’t provide more details about the
lexicons we learnt, and we suggest to refer again
to Table 1 for a qualitative evaluation of some of
the resulting representations. Our representations
based on DSMs were obtained by starting from a
set of 1,905 annotations for the movie domain,
annotated by three annotators by adopting a majority
vote strategy. To conclude, each user involved in
the experiment carried out the following steps:
1. Training, Context Selection and Generation
of the Recommendation. First, we asked
the users to provide some basic demographic
data and to indicate their interest in movies.
Next, each user indicated the context of
consumption of the recommendation, by
selecting a context among the different
contextual settings we previously indicated (see
Figure 3-a). Given the current context, a
suitable recommendation was identified and
presented to the user. As recommendation
algorithm we used a content-based
recommendation strategy exploiting users’ reviews.
2. Generation of the Justification. Given the
recommendation and the current context of
consumption, we run our pipeline to
generate a context-aware justification of the item
adapted to that context. In this case, we
designed a between-subject protocol. In
particular, each user was randomly assigned to one
of the three configurations of our pipeline and
the output was presented to the user along
with the recommendation (see Figure 3-b).
Clearly, the user was not aware of the specific
configuration he was interacting with.
3. Evaluation through Questionnaires. Once
the justification was shown, we asked the
users to fill in a post-usage
questionnaire. Each user was asked to evaluate
transparency, persuasiveness, engagement
and trust of the recommendation process
through a five-point scale (1=strongly
disagree, 5=strongly agree). The questions the
users had to answer follow those proposed in
        <xref ref-type="bibr" rid="ref25">(Tintarev and Masthoff, 2012)</xref>
        . Due to space
reasons, we can’t report the questions and we
suggest to interact with the web application
to fill in the missing details.
4. Comparison to baselines. Finally, we
compared our method to two different baselines
in a within-subject experiment. In this case,
all the users were provided with two different
justifications styles (i.e., our context-aware
justifications and a baseline) and we asked
the users to choose the one they preferred.
As for the baselines, we focused on other
methodologies to generate post-hoc
justifications and we selected (i) a context-aware
strategy to generate justifications, which is
based on a set of manually defined relevant
terms for each context; (ii) a method to
generate non-contextual review-based
justifications that relies on the automatic
identification of relevant aspects and on the
selection of compliant reviews excerpts
containing such terms. Such approach partially
replicates that presented in
        <xref ref-type="bibr" rid="ref18">(Musto et al., 2020)</xref>
        .
      </p>
    </sec>
    <sec id="sec-5">
      <title>Discussions of the Results Results of the first</title>
      <p>experiment, that allows to answer to Research
Question 1, are presented in Table 4. The values
in the tables represent the average scores provided
by the users for each of the previously mentioned
questions. As for the movie domain, results show
that the overall best results are obtained by
us#Items #Reviews #Sentences #Positive Sent. Avg. Sent./Item
307 153,398 1,464,593 560,817 4,770.66
Avg. Pos. Sent./Item
1,826.76
ing a vocabulary based on unigrams and bigrams.</p>
      <p>This first finding provides us with an interesting
outcome, since most of the strategies to
generate explanations are currently based on single
keywords and aspects. Conversely, our experiment
showed that both adjectives as well as couples
of co-occurring terms are worth to be encoded,
since they catch more fine-grained characteristics
of the item that are relevant in a particular
contextual setting. Overall, the results we obtained
confirmed the validity of the approach. Beyond the
increase in TRANSPARENCY, high evaluations were
also noted for PERSUASION and ENGAGEMENT
metrics. This outcome confirms how the
identification of relevant reviews’ excerpts can lead to
satisfying justifications. Indeed, differently from
feature-based justifications, that typically rely on
very popular and well-known characteristics of the
movie, as the actors or the director, more specific
aspects of the items emerge from users’ reviews.</p>
      <p>Next, in order to answer to Research
Question 2, we compared the best-performing
configurations emerging from Experiment 1 to two
different baselines. The results of these experiments
are reported in Table 5 which show the percentage
of users who preferred our context-aware
methodology based on DSMs to both the baselines. In
particular, the first comparison allowed us to
assess the effectiveness of a vector space
representation of contexts based on DSMs with respect to
a simple context-aware justification method based
on a fixed lexicon of relevant terms, while the
second comparison investigated how valid was the
idea of diversifying the justifications based on the
different contextual settings in which the items is
consumed. As shown in the table, our approach
was the preferred one in both the comparisons. It
should be pointed out that the gaps are particularly
large when our methodology is compared to a
noncontextual baseline. In this case, we noted a
statistically significant gap (p 0:05) for all the
metrics, with the exception of trust. This suggests that
diversifying the justifications based on the context
of consumption is particularly appreciated by the
users. This confirms the validity of our intuition,
which led to a completely new research direction
in the area of justifications for recommender
systems.
5</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusions and Future Work</title>
      <p>In this paper we presented a methodology that
exploits DSMs to build post-hoc context-aware
natural language justifications supporting the
suggestions generated by a RS. The hallmark of this work
is the diversification of the justifications based on
the different contextual settings in which the items
will be consumed, which is a new research
direction in the area. As shown in our experiments,
our justifications were largely preferred by users.
This confirms the effectiveness of our approach
and paves the way to several future research
directions, such as the definition of personalized
justiMetrics / Configuration</p>
      <p>Transparency
Persuasion
Engagement</p>
      <p>
        Trust
fication as well as the generation of hybrid
justifications that combine elements gathered from
usergenerated content (as the reviews) with descriptive
characteristics of the items. Finally, we will also
evaluate the use of ontologies and rules
        <xref ref-type="bibr" rid="ref9">(Laera et
al., 2004)</xref>
        in order to implement reasoning
mechanisms to better identify the most relevant aspects
in the reviews.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [Chang et al.2016]
          <string-name>
            <given-names>Shuo</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F Maxwell</given-names>
            <surname>Harper</surname>
          </string-name>
          , and Loren Gilbert Terveen.
          <year>2016</year>
          .
          <article-title>Crowd-based Personalized Natural Language Explanations for Recommendations</article-title>
          .
          <source>In Proceedings of the 10th ACM Conference on Recommender Systems</source>
          , pages
          <fpage>175</fpage>
          -
          <lpage>182</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2004.
          <article-title>Sweetprolog: A system to integrate ontologies and rules</article-title>
          .
          <source>In International Workshop on Rules and Rule Markup Languages for the Semantic Web</source>
          , pages
          <fpage>188</fpage>
          -
          <lpage>193</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [Lenci2008]
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Lenci</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Distributional semantics in linguistic and cognitive research</article-title>
          .
          <source>Italian journal of linguistics</source>
          ,
          <volume>20</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [Liu2012]
          <string-name>
            <given-names>Bing</given-names>
            <surname>Liu</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Sentiment analysis and opinion mining</article-title>
          .
          <source>Synthesis lectures on human language technologies</source>
          ,
          <volume>5</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>167</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [Lops et al.2009]
          <string-name>
            <given-names>Pasquale</given-names>
            <surname>Lops</surname>
          </string-name>
          , Marco de Gemmis, Giovanni Semeraro, Cataldo Musto, Fedelucio Narducci, and
          <string-name>
            <given-names>Massimo</given-names>
            <surname>Bux</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>A semantic content-based recommender system integrating folksonomies for personalized access</article-title>
          .
          <source>In Web Personalization in Intelligent Environments</source>
          , pages
          <fpage>27</fpage>
          -
          <lpage>47</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <source>[Chen and Wang2017] Li Chen and Feng Wang</source>
          .
          <year>2017</year>
          . [Manning et al.1999]
          <article-title>Christopher D Manning, ChristoExplaining Recommendations based on Feature pher D Manning,</article-title>
          and
          <string-name>
            <given-names>Hinrich</given-names>
            <surname>Schütze</surname>
          </string-name>
          .
          <year>1999</year>
          .
          <article-title>FounSentiments in Product Reviews</article-title>
          .
          <source>In Proceedings dations of statistical natural language processing. of the 22nd International Conference on Intelligent MIT press. User Interfaces</source>
          , pages
          <fpage>17</fpage>
          -
          <lpage>28</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>[Friedrich and Zanker2011] Gerhard Friedrich and Markus Zanker</source>
          .
          <year>2011</year>
          .
          <article-title>A taxonomy for generating explanations in recommender systems</article-title>
          .
          <source>AI Magazine</source>
          ,
          <volume>32</volume>
          (
          <issue>3</issue>
          ):
          <fpage>90</fpage>
          -
          <lpage>98</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [Gedikli et al.2014]
          <string-name>
            <given-names>Fatih</given-names>
            <surname>Gedikli</surname>
          </string-name>
          , Dietmar Jannach, and
          <string-name>
            <given-names>Mouzhi</given-names>
            <surname>Ge</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>How should i explain? a comparison of different explanation types for recommender systems</article-title>
          .
          <source>International Journal of Human-Computer Studies</source>
          ,
          <volume>72</volume>
          (
          <issue>4</issue>
          ):
          <fpage>367</fpage>
          -
          <lpage>382</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [Laera et al.2004]
          <string-name>
            <given-names>Loredana</given-names>
            <surname>Laera</surname>
          </string-name>
          , Valentina Tamma, Trevor Bench-Capon, and Giovanni Semeraro.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [Mikolov et al.2013]
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , Ilya Sutskever, Kai Chen, Greg S Corrado, and
          <string-name>
            <given-names>Jeff</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          , pages
          <fpage>3111</fpage>
          -
          <lpage>3119</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <source>[Misztal and Indurkhya2015] Joanna Misztal and Bipin Indurkhya</source>
          .
          <year>2015</year>
          .
          <article-title>Explaining contextual recommendations: Interaction design study and prototype implementation</article-title>
          .
          <source>In IntRS@ RecSys</source>
          , pages
          <fpage>13</fpage>
          -
          <lpage>20</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [Musto et al.2011]
          <string-name>
            <given-names>C.</given-names>
            <surname>Musto</surname>
          </string-name>
          , G. Semeraro,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lops</surname>
          </string-name>
          , and M. de Gemmis.
          <year>2011</year>
          .
          <article-title>Random indexing and negative user preferences for enhancing content-based Recommender Systems</article-title>
          .
          <source>In EC-Web</source>
          <year>2011</year>
          , volume
          <volume>85</volume>
          <source>of Lecture Notes in Business Inf. Processing</source>
          , pages
          <fpage>270</fpage>
          -
          <lpage>281</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <source>workshop on computational terminology-</source>
          Volume
          <volume>14</volume>
          , pages
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          . Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [Musto et al.2012]
          <string-name>
            <given-names>C.</given-names>
            <surname>Musto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Narducci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lops</surname>
          </string-name>
          , G. Semeraro,
          <string-name>
            <surname>M. De Gemmis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Barbieri</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Korst</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Pronk</surname>
            , and
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Clout</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Enhanced semantic tv-show representation for personalized electronic program guides</article-title>
          .
          <source>Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</source>
          , 7379 LNCS:
          <fpage>188</fpage>
          -
          <lpage>199</lpage>
          . cited By 19.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [Musto et al.2014]
          <string-name>
            <given-names>C.</given-names>
            <surname>Musto</surname>
          </string-name>
          , G. Semeraro,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lops</surname>
          </string-name>
          , and M. de Gemmis.
          <year>2014</year>
          .
          <article-title>Combining distributional semantics and entity linking for context-aware content-based recommendation</article-title>
          .
          <source>Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</source>
          ,
          <volume>8538</volume>
          :
          <fpage>381</fpage>
          -
          <lpage>392</lpage>
          . cited By 19.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [Musto et al.2016]
          <string-name>
            <given-names>Cataldo</given-names>
            <surname>Musto</surname>
          </string-name>
          , Fedelucio Narducci, Pasquale Lops, Marco De Gemmis, and
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Semeraro</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Explod: A framework for explaining recommendations based on the linked open data cloud</article-title>
          .
          <source>In Proceedings of the 10th ACM Conference on Recommender Systems, RecSys '16</source>
          , pages
          <fpage>151</fpage>
          -
          <lpage>154</lpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [Musto et al.2019]
          <string-name>
            <given-names>Cataldo</given-names>
            <surname>Musto</surname>
          </string-name>
          , Pasquale Lops, Marco de Gemmis, and
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Semeraro</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Justifying recommendations through aspect-based sentiment analysis of users reviews</article-title>
          .
          <source>In Proceedings of the 27th ACM Conference on User Modeling, Adaptation and Personalization</source>
          , pages
          <fpage>4</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [Musto et al.2020]
          <string-name>
            <given-names>Cataldo</given-names>
            <surname>Musto</surname>
          </string-name>
          , Marco de Gemmis, Pasquale Lops, and
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Semeraro</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Generating post hoc review-based natural language justifications for recommender systems. User Modeling and User-Adapted Interaction</article-title>
          , pages
          <fpage>1</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <source>[Nakagawa and Mori2002] Hiroshi Nakagawa and Tatsunori Mori</source>
          .
          <year>2002</year>
          .
          <article-title>A simple but powerful automatic term extraction method</article-title>
          .
          <source>In COLING02 on COMPUTERM</source>
          <year>2002</year>
          <article-title>: second international [Nunes and Jannach2017] Ingrid Nunes</article-title>
          and
          <string-name>
            <given-names>Dietmar</given-names>
            <surname>Jannach</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>A systematic review and taxonomy of explanations in decision support and recommender systems</article-title>
          .
          <source>User Modeling</source>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>27</volume>
          (
          <issue>3-5</issue>
          ):
          <fpage>393</fpage>
          -
          <lpage>444</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [Petz et al.2015]
          <string-name>
            <given-names>Gerald</given-names>
            <surname>Petz</surname>
          </string-name>
          , Michał Karpowicz, Harald Fürschuß, Andreas Auinger, Václav Strˇítesky`, and
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Holzinger</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Reprint of: Computational approaches for mining user's opinions on the web 2.0</article-title>
          .
          <string-name>
            <given-names>Information</given-names>
            <surname>Processing</surname>
          </string-name>
          &amp; Management,
          <volume>51</volume>
          (
          <issue>4</issue>
          ):
          <fpage>510</fpage>
          -
          <lpage>519</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <source>[Reiter and Dale1997] Ehud Reiter and Robert Dale</source>
          .
          <year>1997</year>
          .
          <article-title>Building applied natural language generation systems</article-title>
          .
          <source>Natural Language Engineering</source>
          ,
          <volume>3</volume>
          (
          <issue>1</issue>
          ):
          <fpage>57</fpage>
          -
          <lpage>87</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <source>[Resnick and Varian1997] Paul Resnick and Hal R Varian</source>
          .
          <year>1997</year>
          .
          <article-title>Recommender systems</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>40</volume>
          (
          <issue>3</issue>
          ):
          <fpage>56</fpage>
          -
          <lpage>58</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [Ricci et al.2015]
          <string-name>
            <given-names>Francesco</given-names>
            <surname>Ricci</surname>
          </string-name>
          , Lior Rokach, and
          <string-name>
            <given-names>Bracha</given-names>
            <surname>Shapira</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Recommender systems: introduction and challenges</article-title>
          .
          <source>In Recommender systems handbook</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>34</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <source>[Smith2020] Noah A Smith</source>
          .
          <year>2020</year>
          .
          <article-title>Contextual word representations: putting words into computers</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>63</volume>
          (
          <issue>6</issue>
          ):
          <fpage>66</fpage>
          -
          <lpage>74</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <source>[Tintarev and Masthoff2012] Nava Tintarev and Judith Masthoff</source>
          .
          <year>2012</year>
          .
          <article-title>Evaluating the Effectiveness of Explanations for Recommender Systems</article-title>
          . UMUAI,
          <volume>22</volume>
          (
          <issue>4-5</issue>
          ):
          <fpage>399</fpage>
          -
          <lpage>439</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [Vig et al.2009]
          <string-name>
            <given-names>Jesse</given-names>
            <surname>Vig</surname>
          </string-name>
          , Shilad Sen,
          <string-name>
            <given-names>and John</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Tagsplanations: explaining recommendations using tags</article-title>
          .
          <source>In Proceedings of the 14th international conference on Intelligent user interfaces</source>
          , pages
          <fpage>47</fpage>
          -
          <lpage>56</lpage>
          . ACM.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>