<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Context-Aware Recommender Comparison Of Three Approaches Systems: A</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Umberto Panniello</string-name>
          <email>u.panniello@poliba.it</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michele Gorgoglione</string-name>
          <email>m.gorgoglione@poliba.it</email>
        </contrib>
      </contrib-group>
      <abstract>
        <p>Methods for generating context-aware recommendations were classified into the pre-filtering, post-filtering and contextual modeling approaches. This paper proposes a novel type of contextual modeling, that is called contextual neighbors, based on the idea of using context to compute the neighborhood in a collaborative filtering approach, and introduces four variants of this method. In addition, the paper presents the results of the comparison among these four approaches and among the contextual neighbors approach to the other contextual approaches and to the un-contextual one. While some of these methods have been studied independently, few prior research has compared their performance to determine which of them is better.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Most traditional Recommender Systems (RSs) provide recommendations of items
to users and vice versa and do not take into consideration the circumstances and
other contextual information when recommendations take place. For example,
when an online travel agency recommends a vacation package, it is important to
know when the person plans to go on vacation. Recently, some companies started
taking into account the contextual information (i.e., Sourcetone). In academia,
several studies, such as [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], demonstrated that context induces important changes
in a customer purchasing behavior. Experimental research on customer modeling
suggests that including context in a customer behavior model improves the ability
to predict her behavior in some cases because it allows the identification of more
homogeneous patterns in the data describing the purchasing history of a customer
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Therefore, the accuracy of predicting consumer preferences should depend on
the degree to which we have incorporated the relevant contextual in-formation.
The usage of contextual information in context-aware recommender systems
(CARS) can be broadly classified into two groups: (1) recommendation via
context-driven querying and search, and (2) recommendation via contextual
preference elicitation and estimation. The approach based on the context-driven
querying and search has been used by several mobile and tourist recommender systems,
e.g., [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], that typically use contextual information to query or search a certain
repository of resources (e.g., restaurants) and present the best matching resources
(e.g., nearby restaurants that are currently open) to the user.
      </p>
      <p>
        The other approach to CARS, that we follow in this paper, is based on contextual
preference elicitation and estimation, e.g., [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], [
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ], including elicitation and
estimation of ratings of various items provided by different users. This approach
can be traced back to the work in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] hypothesized that the inclusion
of knowledge about the user’s task into the recommendation algorithm in certain
applications can lead to better recommendations. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] described a way to
incorporate the contextual information into recommender systems by using a
multidimensional approach in which the traditional 2-dimensional (2D) User/Item paradigm
was extended to support additional contextual dimensions, such as Time, Location
and Company. Since then, several contextual preference elicitation and estimation
approaches to CARS have been proposed, all of them emphasizing the need to
model and learn user’s context-sensitive preferences. Many of these methods are
reviewed in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Once the context-sensitive preferences of users are
learned, recommendations are generated by either adapting the existing
collaborative filtering, content-based, or hybrid recommendation methods to context-aware
recommendation settings or by developing novel intelligent data analysis
techniques from data mining and machine learning. Furthermore, several scholars [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ],
[
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ] have shown that adding contextual information helps to improve estimations
of unknown ratings in this approach. For example, [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] described a way to
include the contextual information by using a multidimensional approach, as it
was mentioned above. Also, it was shown in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] that the multidimensional
contextual information does matter and can lead to better recommendations in
comparison to the traditional un-contextual 2D recommender systems.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2 Pre-filtering, post-filtering and contextual modeling</title>
      <p>
        In this paper, we use implicit ratings following the approach outlined by [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ],
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. We measure the utility of product j for user i with the purchasing
frequency xij specifying how often user i purchased product j. Unlike the traditional
two-dimensional (2D) recommender systems that try to estimate unknown ratings
in the Users × Items matrix, where Users and Items are the sets of users and items
respectively, context-aware recommender systems (CARS) also take into account
contextual information in a Users × Items × Context matrix. Context is a set of
contextual attributes C each having a hierarchical structure defined by a set of q
atomic attributes, i.e., C = (C1,…, Cq) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Further, the values taken by attribute
Cq define finer (more granular) levels, while C1 coarser (less granular) levels of
(a)
(b)
(c)
contextual knowledge [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. For example, Fig. 2 presents the hierarchy for the
contextual attributes used in our analysis (see Section 3).
      </p>
      <p>
        In this paper we compare the three contextual approaches proposed in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] (also
shown in Errore. L'origine riferimento non è stata trovata.), that start with
“Data” on users, items, ratings and contextual information (“Context”) and results
in generating context-specific recommendations, are presented. In the contextual
pre-filtering (PreF) the contextual information is used to filter out irrelevant
ratings before they are used for computing recommendations with standard (2D)
methods. In the contextual post-filtering (PoF) context is used after the classical (2D)
recommendation methods are applied to the standard recommendation data. In the
contextual modeling (CM) context is used inside the recommendation-generating
algorithms. The work presented in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] helped researchers to understand different
aspects of using the contextual information in the recommendation process.
However, [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] did not examine which of these methods are more effective for
providing contextual recommendations, and therefore, left this important topic
unaddressed and without any prescriptive recommendations for which method to
use. In this paper we empirically compare the three contextual approaches, i.e.,
pre-filtering, post-filtering and contextual modeling among themselves to
determine which one is better, and also with the un-contextual approach when
contextual information is ignored and not used in the recommendation methods. We also
present several types of contextual post-filtering and contextual modeling methods
and evaluate their performance in order to identify the best-performing methods in
these two categories. We show that one particular post-filtering method dominates
other approaches. However, this dominance comes with a certain complexity
“cost” that prevents that method to be a clear winner in all the practical settings.
(a)
(b)
(c)
C1
C2
Fig. 2 Hierarchical structure of the contextual attributes (a) Time Of The Year, (b) Intent
Of Purchase and (c) Store
      </p>
      <sec id="sec-2-1">
        <title>2.1 The pre- and the post-filtering approaches</title>
        <p>
          According to the pre-filtering approach (Errore. L'origine riferimento non è
stata trovata.(a) and [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]), the contextual information is used as a label for
filtering out those ratings that do not correspond to the specified contextual
information. This filtering is done before the main recommendation method is launched
on the remaining data that passed the filter. In other words, if a particular context
of interest is C, then the pre-filtering method selects from the initial set of all the
ratings only those corresponding to the specified context C. As a result, it
generates the Users × Items matrix containing only the data pertaining to context C.
Then the 2D recommendation method (e.g., collaborative filtering) is launched on
this remaining dataset that passed the filter to generate recommendations for
context C. We call this approach Exact Pre-Filtering (EPF). According to the
postfiltering (PoF) approach (see Errore. L'origine riferimento non è stata
trovata.(b) and [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]), we first ignore all the contextual information in the data and
apply a traditional 2D recommendation method, such as collaborative filtering, on
the whole un-contextual data set, where the contextual information is dropped.
Once the unknown ratings are estimated using the 2D method on this data and the
un-contextual recommendations are produced, we “contextualize” these
recommendations as follows. Although there exist various methods for contextualizing
the 2D recommendations [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], in this paper we consider the following two
approaches, called “Weight” and “Filter”. Both approaches analyze data for a given
user in a given context to calculate the probability with which the user chooses a
certain item in the given context. After that, the recommendations obtained using
this 2D method are “contextualized” by using the contextual probabilities. For
detailed information on the methods, see [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2 The contextual modeling approach</title>
        <p>
          According to the contextual modeling approach, we present a new method called
contextual-neighbors CM. This approach is based on user-based collaborative
filtering (CF) and works as follows. First, for each user i and context c, we define
the user profile in context c, i.e. the contextual profile Prof(i, c). For example, if
the contextual variable c has two values (e.g., Winter and Summer), then we have
two contextual profiles for each user, one for the Winter and the other for the
Summer. Note that these contextual profiles can be defined in many different
ways, some of which are presented in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], and our approach does not depend on
any particular choice of a profiling method. However, in the experimental study
described in Section 4 we use the following specific contextual profiling
technique. As explained in Section 2, we follow the transaction-based approach to RSs
and measure the utility rijc of product j for user i in context c with the purchasing
frequency xijc specifying how often user i purchased product j in context c. Then
we use this measure to define contextual profile as Prof(i, c) = (ri1c , …, rikc).
We use these profiles to define similarity among users and also to define and find
N nearest “neighbors” of user i in context c, where “neighbors” are determined
using contextual profiles Prof(i’, c’) and similarity measures between the profiles.
Although this similarity can be defined in various ways, we use a popular CF
approach and define the similarity measure d using the cosine measure as:
∑
d(Prof(i' ,c' ), Prof(i, c)) = PrPorf(oif,(ci,)c2)×• PPrroof(fi('i', c,c' ')) 2 = ∑s∈rSiis2'ccrisc ri∑'sc' ri'2sc' (1)
s∈Si 'c s∈Si 'c
where risc’ and ri’sc’ are the ratings of item s assigned by user i and user i’
respectively in context c and c’. Sii’c={s ∈ Items | risc ≠ ∅ ∧ ri’sc’ ≠ ∅} is the set of all
items co-rated by both user i and user i’ in context c.
        </p>
        <p>Then we find N nearest neighbors for the (i, c) pair by identifying pairs (i’, c’)
such that d(Prof(i’, c’), Prof(i, c)) is the largest among all the candidate pairs (i’,
c’) subject to the following constraints:</p>
        <p>• Mdl1: there are no constraints on the set of (i’, c’) pairs, and we select N pairs
that are the most similar to (i, c).</p>
        <p>• Mdl2: we select an equal proportion of pairs (i’, c’) corresponding to each
context c (e.g., if the contextual variable has only two values, Winter and Summer
respectively, and the neighborhood size is 80, we select 40 neighbors from Winter
and 40 from Summer).</p>
        <p>• Mdl3: we select N pairs (i’, c’) that are the most similar to (i, c) corresponding
to each context c at the same level of the context of interest (e.g., if the context of
interest is “Winter Holiday” in Fig. 2(a), we select the neighborhood by using only
profiles referred to level C2 of that contextual variable).</p>
        <p>• Mdl4: we select an equal proportion of pairs (i’, c’) corresponding to each
context c at the same level of the context of interest (e.g., if the context of interest
is “Winter Holiday” in Fig. 2(a) and the neighborhood size is 80, we define the
neighborhood by using 20 users from the context “Winter Holiday”, 20 users from
the context “Winter Not Holiday”, 20 users from the context “Summer Holiday”
and 20 users from the context “Summer Not Holiday”).</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3 Experimental Setup</title>
      <p>We compared the pre-filtering, post-filtering, contextual modeling and
uncontextual recommendations across a wide range of experimental settings. First,
we selected three different data sets. The first dataset (DB1) comes from an
ecommerce website commercially operating in an European country which sells
electronic products to approximately 120,000 users and contains about 220,000
purchasing transactions during an observation period of three years. For this
dataset, we selected the time of the year as a contextual variable. Its hierarchical
structure is presented in Fig. 2(a). The classification into Summer or Winter and
Holiday or Not Holiday is based on the experiences of the CEO of the e-commerce
website that we used in our study. The data was pre-processed by excluding
customers who made only one single transaction or had abnormal behavior or had
transactions either only in the first two years or only in the third year. The
resulting dataset contained about 1,500 users and about 10,000 transactions thus having
about 7 transactions per user.</p>
      <p>
        The second dataset (DB2) is taken from the study described in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. First, a special
purpose browser was developed to help users navigate Amazon.com website and
purchase products on its site. This browser was made available to a group of
students who were asked to navigate and simulate purchases on Amazon.com during
a period of four months. Once a product was selected by a student to be
purchased, the browser recorded the selected item, the purchasing price and other
useful characteristics of the transaction and this information was stored in the
database. In addition, the student was asked at the beginning of each browsing session
to specify its context, which was the intent of a purchase in our case. The structure
of this contextual variable IntentOfPurchase is presented in Fig. 2(b). Further, the
data was pre-processed by excluding the students who made less than 40
transactions and eliminating the students who had any kind of misleading or abnormal
behavior. The resulting number of students was 556, and the total number of
purchasing transactions for the students was 31,925 thus having about 57 transactions
per user.
      </p>
      <p>The third dataset (DB3) comes from an e-commerce website which sells comics
and comics-related products, such as T-shirts, DVDs and gadgets. It contains
about 50,000 transactions and 5,000 users thus having about 10 transactions per
user. We used the store (i.e., the section in the Web site where products are
bought), as a contextual variable (see Fig. 2(c)). The importance of this contextual
variable comes from the expectation that customers’ behavior changes when
navigating and buying products in different sections of the Web site.</p>
      <p>
        In our study, we recommend product categories instead of individual items
because the e-commerce applications that we consider have very large numbers of
items (hundreds of thousands or even millions). We tried different item
aggregation strategies and found that the best results are for 14 categories for DB1, 24
categories for DB2 and 136 for DB3. We aggregated items into categories according
to the classification provided by the Web sites product catalogue. Estimations of
unknown utilities were done by using a standard user-based collaborative filtering
(CF) method [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. According to the CF approach, the neighborhood was formed
using the cosine similarity [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. The neighborhood size N was set to N = 80 users,
which proved to be the optimal size in our experiments.
      </p>
      <p>
        The experiments were performed for datasets DB1, DB2 and DB3 for all the
levels of contextual knowledge (un-contextual, C1 and C2), as presented in Fig. 2.
We have performed t-tests in order to determine if the chosen contextual variables
matter. The results of these tests were statistically significant (at 95% level) and
demonstrated that the contextual variables TimeOfTheYear, IntentOfPurchase and
Store matter. We used Precision, Recall, F-Measure, Mean Absolute Error (MAE)
and Root Mean Square Error (RMSE) [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] as the performance measures in our
experiments, as done in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. We computed MAE and RSME in a standard way by
taking absolute and squared differences between the estimated and actual values
of estimated utilities since the utilities are measured using discrete variables [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ],
[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Finally, we divided each dataset into the training and the validation
sets, the training set containing 2/3 and the validation set 1/3 of the whole dataset.
For the DB1 dataset, the first two years were the training set and the third year
was the validation set. For the DB2 dataset, we randomly split it in 2/3 for the
training set and the remaining 1/3 for the validation set (in this case, it was
impossible to make a good temporal split because all the transactions were made
within a couple of months). For the DB3 dataset, the first nine months were the
training set and the last three months were the validation set.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4 Results</title>
      <p>
        Firstly, we compare the traditional un-contextual 2D collaborative filtering and the
exact pre-filtering (EPF) methods. This type of comparison has been studied in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
before, where it was shown that in certain cases the un-contextual approach
dominates, while in other cases the contextual one dominates the un-contextual method.
However, the work reported in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] was done in the context of multi-dimensional
recommendations and only for one small dataset. In this study, we wanted to
provide a more extensive comparison of the un-contextual and the pre-filtering
methods across several and bigger datasets and across numerous other experimental
settings. Furthermore, we also needed to conduct this comparison for the
uniformity reasons because we also do the comparison of the post-filtering and the
contextual modeling approaches to the un-contextual approach in this paper.
We compared the un-contextual and the EPF contextual methods for the case of
user-based collaborative filtering, across the datasets DB1, DB2 and DB3, across
multiple levels of classification hierarchy (C1 and C2) and different performance
measures. We split the data into the training and testing sets as described in
Section 4. For the sake of brevity we only present a summary of these results in Table
1 where only the F-measure, averaged across the experimental setting, is reported
and Table 2 where only the MAE, averaged across the experiment settings, is
reported. Table 1 and Table 2 report all the accuracy gains (in terms of F-measure
and MAE, namely) across each recommender systems for DB1, DB2 and DB3
(negative values mean performance reduction). For example, the first row of Table
1 shows the performance gains (reductions), in terms of F-measure, for the
uncontextual RS vis-à-vis the EPF, Filter PoF, Weight PoF and Mdl1 methods. The
matrixes in Table 1 and Table 2 are anti-symmetric, as should be the case when
two methods are compared in terms of their relative performance. However, the
comparison among the other performance measures used in the experiments is
discussed throughout this section.
      </p>
      <p>
        The EPF dominates the un-contextual case in some cases in terms of the
Fmeasure and is dominated in other cases, consistently with [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The intuitive
explanation is that when the contextual information is very granular, this results in a
more homogeneous rating prediction model but also leads to data sparsity issues
(only few context-specific ratings are used to build the model). In contrast, when
the contextual information is too coarse, we can have enough data to build the
model, but it is more heterogeneous. This conflict between homogeneity of the
model and rating sparsity produces mixed results when comparing EPF and the
un-contextual model. Furthermore, EPF outperforms the un-contextual model in
terms of MAE and RMSE (see Table 2).
      </p>
      <p>Secondly, we compare the un-contextual and the post-filtering methods. Unlike
exact pre-filtering, there exist many post-filtering methods. The comparison can
depend very significantly on the choice. We compared the Weight PoF and Filter
PoF and the un-contextual method across various experimental conditions
described in Section 3. The Filter PoF dominates the un-contextual case across all
the levels of context for the F-measure: for DB1, the difference between
contextual and un-contextual models is 20% on average, for DB2 it is 27% on average
and for the DB3 it is 90%. On the contrary, Weight PoF is always dominated by
the un-contextual model. All this makes us to conclude that the comparison of the
post-filtering and un-contextual methods depends very significantly on the type of
the post-filtering method being used. This observation is not surprising because
the post-filtering method takes the results of the same 2D un-contextual method
and reorders them based on the context-based post-filtering heuristic. If the
heuristic is “good” (such as Filter), then this re-ordering improves recommendation
quality; otherwise, it can make the results even worse (as for heuristic Weight).
Thirdly, we compare the un-contextual and the contextual modeling methods. We
consider four methods Mdl1, Mdl2, Mdl3, Mdl4 in our study and compare them to
the un-contextual method across the experimental conditions. The experiments
proved that the performances of the four CM approaches are very similar and the
differences are not statistically significant. For this reason we only present the
performance of Mdl1 because it slightly dominates other CM methods in some cases.
This makes sense because the N neighbors are selected for Mdl1 in an
unconstrained manner, whereas they are selected based on various constraints for the
other three approaches. As Table 1 and Table 2 show, Mdl1 always outperforms
the un-contextual model.</p>
      <p>Finally, we compare the performance of the pre-filtering, post-filtering and
contextual modeling methods. Table 1 shows that Weight PoF is the worst
recommender system (it never outperforms any other RS) followed by the un-contextual
one (it outperforms only the Weight PoF while it is dominated by all the other
context-aware RSs). On the other side, the Filter PoF is the best recommender
system (it outperforms all the other systems and it is dominated by the Mdl1 only
for the DB1) followed by the Mdl1 (it generally outperforms all the other RSs with
the exception of Filter PoF). The EPF always outperforms the un-contextual
recommender system, while in some cases dominates the other context-aware RSs
and in other cases is dominated by them. Table 2 shows similar results. In
summary, the answer to the question of which contextual approach provides the best
performance depends on the type of the post-filtering method used. In fact,
although the Filter PoF does not outperform all the other approaches in every
setting, it does in most settings, while the Weight PoF is often outperformed by all
the other context-based approaches. Overall, the Filter PoF is the best approach in
78% of comparisons in terms of F-measure (i.e., seven times out of 9), 67% in
terms of Precision, 100% in terms of MAE and RMSE (i.e., three times out of
three). In terms of F-measure, the CM approach (Mdl1) is the best in 44% of
comparisons (percentage values do not sum to 100% because in some cases
approaches are equivalent), while EPF only 11%. The alternative post-filtering method,
Weight PoF, has never the highest performance in terms of F-measure. In contrast,
it has the lowest F-measure and Precision in 78% of cases, the lowest Recall in
33% of comparisons, and it is always the worst in terms of MAE and Recall.
Although we demonstrated that the post-filtering method Filter PoF dominates
alternative contextual approaches considered in the paper in terms of better
recommendation performance, it does not mean that it should always be used in practice
for the following reasons. First, using Filter PoF entails finding the right
parameters for the method, such as the size of the neighborhood N and the threshold
value. Determining good values of these parameters in a specific application can be
time consuming and expensive. Also, different post-filtering methods, such as
Weight vs. Filter, have different types of parameters, and this complicates the
selection of the best post-filtering method even further.</p>
      <p>Table 1 F-measure gains (reductions) across recommender systems.</p>
      <p>DB1
DB2
DB3</p>
      <p>Un-contextual
EPF
Filter PoF
Weight PoF
Mdl1
Un-contextual
EPF
Filter PoF
Weight PoF
Mdl1
Un-contextual
EPF
Filter PoF
Weight PoF
Mdl1</p>
      <p>Un-contextual EPF
0%
2%
20%
-27%
22%
0%
2%
27%
-14%
7%
Finally, post-filtering methods are computationally more expensive than the
prefiltering method EPF because only a relatively small subset of data is used in
estimations of unknown ratings for EPF. Therefore, the selection of the best
contextual modeling method in a given application is more complicated in practice than a
simple rule “always use Filter PoF in all the settings.” The CM approach is a good
alternative in practice, because it proved to be the second-best and the
performance is stable across various possible CM methods unlike the post-filtering
approach.</p>
    </sec>
    <sec id="sec-5">
      <title>5 Conclusions</title>
      <p>In this paper, we compared the un-contextual and contextual recommender
systems, for which we considered the pre-filtering, the post-filtering and the
contextual modeling methods of generating contextual recommendations. In particular,
we used the exact pre-filtering (EPF) and the Weight and the Filter post-filtering
methods for the first two approaches. Moreover, we proposed a new type of
contextual modeling, that we called contextual neighbors CM, and four specific types
of contextual neighbors methods, called Mdl1, Mdl2, Mdl3 and Mdl4, each of them
selecting contextual neighborhoods in a somewhat different way.</p>
      <p>We compared the un-contextual with the contextual methods across various
experimental settings, including three datasets, different level of item aggregation,
different neighborhood sizes, seven recommendation engines (un-contextual, EPF,
Filter PoF, Weight PoF, Mdl1, Mdl2, Mdl3 and Mdl4), different contextual levels
(C1 and C2) and several performance measures (Precision, Recall, F-Measure,
MAE and RMSE). We showed that the contextual Filter method dominates the
uncontextual one and that the un-contextual method dominates the Weight method.
We also showed that EPF dominates the un-contextual method in some cases and
is inferior in other cases on the datasets (and the corresponding applications) used
in our study. We also compared the contextual neighbors methods to identify the
best performing one. Although Mdl1 slightly outperforms the others, there are no
significant performance differences among them. This result is not surprising
because different ways of selecting contextual neighborhood do not fundamentally
change recommendation results. We have then selected Mdl1 as the best-of-breed
contextual modeling method and compared it with the pre-, post- filtering and
uncontextual methods. We showed that Mdl1 dominates the traditional un-contextual
approach and is comparable to the pre-filtering method (EPF). We have also
shown that Mdl1 dominates some of the less advanced post-filtering methods (such
as Weight PoF) but is inferior to the best post-filtering methods (such as Filter
PoF). This implies, among other things, that, in order to decide which approach
should be used in a particular recommendation application, various post-filtering
methods should be carefully compared which is a laborious and a time-consuming
strategy. However, the CM approach is a good alternative in practice, because it
proved to be the second-best and the performance is stable across various possible
CM methods unlike the post-filtering approach.</p>
      <p>As future work we intend to analyze deeper different aspects of the comparisons
between pre-filtering, post-filtering and contextual modeling approaches. In
particular, we plan to analyze the impact of different approaches in terms of
generated profit and sales, diversity of the recommended items and trust of the users on
the different recommender systems. Moreover, we will compare the contextual
neighbor approach to other contextual modeling approaches.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bettman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luce</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Payne</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>1998</year>
          ).
          <article-title>Constructive Consumer Choice Processes</article-title>
          .
          <source>Journal of Consumer Research</source>
          ,
          <volume>25</volume>
          (
          <issue>3</issue>
          ),
          <fpage>187</fpage>
          -
          <lpage>217</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Palmisano</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tuzhilin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Gorgoglione</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Using Context to Improve Predictive Modeling of Customers in Personalization Applications</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          ,
          <volume>20</volume>
          (
          <issue>11</issue>
          ),
          <fpage>1535</fpage>
          -
          <lpage>1549</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cena</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Console</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gena</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Modeo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , et al. (
          <year>2006</year>
          ).
          <article-title>Integrating heterogeneous adaptation techniques to build a flexible and usable mobile tourist guide</article-title>
          .
          <source>AI Commun</source>
          .,
          <volume>19</volume>
          (
          <issue>4</issue>
          ),
          <fpage>369</fpage>
          -
          <lpage>384</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. van Setten,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Pokraev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            , &amp;
            <surname>Koolwaaij</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Context-Aware Recommendations in the Mobile Tourist Application COMPASS</article-title>
          .
          <source>In Adaptive Hypermedia and Adaptive Web-Based Systems</source>
          (pp.
          <fpage>235</fpage>
          -
          <lpage>244</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Adomavicius</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sankaranarayanan</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Tuzhilin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Incorporating Contextual Information in Recommender Systems Using a Multidimensional Approach</article-title>
          .
          <source>ACM Transactions on Information Systems</source>
          ,
          <volume>23</volume>
          ,
          <fpage>103</fpage>
          -
          <lpage>145</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Adomavicius</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Tuzhilin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2001</year>
          ).
          <article-title>Multidimensional Recommender Systems: A Data Warehousing Approach</article-title>
          . In Electronic Commerce (pp.
          <fpage>180</fpage>
          -
          <lpage>192</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Anand</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mobasher</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berendt</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hotho</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mladenic</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Semeraro</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Contextual Recommendation</article-title>
          . In From Web to Social Web:
          <article-title>Discovering and Deploying User and Content Profiles</article-title>
          (Vol.
          <volume>4737</volume>
          , pp.
          <fpage>142</fpage>
          -
          <lpage>160</lpage>
          ): Springer Berlin / Heidelberg.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chin</surname>
          </string-name>
          , C.-Y.,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Men</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Supporting Context-Aware Media Recommendations for Smart Phones</article-title>
          . Pervasive Computing, IEEE,
          <volume>5</volume>
          (
          <issue>3</issue>
          ),
          <fpage>68</fpage>
          -
          <lpage>75</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Herlocker</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2001</year>
          ).
          <article-title>Content-Independent Task-Focused Recommendation</article-title>
          .
          <source>IEEE Internet Computing</source>
          ,
          <volume>5</volume>
          (
          <issue>6</issue>
          ),
          <fpage>40</fpage>
          -
          <lpage>47</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Adomavicius</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Tuzhilin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Context-Aware Recommender Systems</article-title>
          . In Springer (Ed.),
          <source>Handbook on Recommender Systems.</source>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>L. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hsu</surname>
            ,
            <given-names>F. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>M. C.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Hsu</surname>
            ,
            <given-names>Y. C.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Developing recommender systems with the consideration of product profitability for sellers</article-title>
          .
          <source>Information Sciences</source>
          ,
          <volume>178</volume>
          (
          <issue>4</issue>
          ),
          <fpage>1032</fpage>
          -
          <lpage>1048</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chung</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <article-title>A graph model for e-commerce recommender systems</article-title>
          .
          <source>In Journal of the American Society for Information Science and Technology</source>
          ,
          <source>2004</source>
          (Vol.
          <volume>55</volume>
          , pp.
          <fpage>259</fpage>
          -
          <lpage>274</lpage>
          , Vol.
          <volume>3</volume>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <article-title>Link prediction approach to collaborative filtering</article-title>
          .
          <source>In JCDL '05: Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries</source>
          ,
          <source>2005</source>
          (pp.
          <fpage>141</fpage>
          -
          <lpage>142</lpage>
          ): ACM Press
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Kwon</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>Concept lattices for visualizing and generating user profiles for context-aware service recommendations</article-title>
          .
          <source>Expert Systems with Applications</source>
          ,
          <volume>36</volume>
          (
          <issue>2</issue>
          ),
          <fpage>1893</fpage>
          -
          <lpage>1902</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Panniello</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tuzhilin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gorgoglione</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palmisano</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Pedone</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <article-title>Experimental comparison of pre- vs. post-filtering approaches in context-aware recommender systems</article-title>
          .
          <source>In RecSys '09: Proceedings of the third ACM conference on Recommender systems</source>
          , New York, New York, USA,
          <year>2009</year>
          (pp.
          <fpage>265</fpage>
          -
          <lpage>268</lpage>
          ): ACM
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Resnick</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Varian</surname>
            ,
            <given-names>H. R.</given-names>
          </string-name>
          (
          <year>1997</year>
          ).
          <article-title>Recommender systems</article-title>
          .
          <source>Commun. ACM</source>
          ,
          <volume>40</volume>
          (
          <issue>3</issue>
          ),
          <fpage>56</fpage>
          -
          <lpage>58</lpage>
          , doi:10.1145/245108.245121.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Breese</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heckerman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Kadie</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <article-title>Empirical analysis of predictive algorithms for collaborative filtering</article-title>
          .
          <source>In Proceedings of the 14th conference on Uncertainty in Artificial Intelligence 1998</source>
          (Vol.
          <volume>461</volume>
          , pp.
          <fpage>43</fpage>
          -
          <lpage>52</lpage>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Herlocker</surname>
            ,
            <given-names>J. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Terveen</surname>
            ,
            <given-names>L. G.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Riedl</surname>
            ,
            <given-names>J. T.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Evaluating collaborative filtering recommender systems</article-title>
          .
          <source>ACM Transactions on Information Systems</source>
          ,
          <volume>22</volume>
          (
          <issue>1</issue>
          ),
          <fpage>5</fpage>
          -
          <lpage>53</lpage>
          , doi:10.1145/963770.963772.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Nichols</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          <article-title>Implicit rating and filtering</article-title>
          .
          <source>In Proceedings of the Fifth DELOS Workshop on Filtering and Collaborative Filtering</source>
          ,
          <year>1998</year>
          (pp.
          <fpage>31</fpage>
          -
          <lpage>36</lpage>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Oard</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Kim</surname>
          </string-name>
          , J.
          <source>Implicit Feedback for Recommender Systems In Proceedings of the AAAI Workshop on Recommender Systems</source>
          ,
          <year>1998</year>
          (pp.
          <fpage>81</fpage>
          -
          <lpage>83</lpage>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Adomavicius</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Tuzhilin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Personalization and Recommender Systems</article-title>
          . In Z.
          <string-name>
            <surname>-L. C.</surname>
          </string-name>
          <article-title>a</article-title>
          . S. Raghavan (Ed.),
          <article-title>State-of-the-Art Decision Making Tools in the Information-Intensive Age</article-title>
          (pp.
          <fpage>55</fpage>
          -
          <lpage>100</lpage>
          ): Tutorials in Operations Research.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>