<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Eval u at i o n o f Co l l ab o r at i ve Fi l t er i n g Al g o r i t h m s f o r Rec o m m en d i n g Ar t i c l es o n Ci t eUL i k e</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Denis Parra</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Brusilovsky</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Information Sciences, University of Pittsburgh</institution>
          ,
          <addr-line>135 North Bellefield Avenue, Pittsburgh, PA 15260</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Motivated by the potential use of collaborative tagging systems to develop new recommender systems, we have implemented and compared three variants of user-based collaborative filtering algorithms to provide recommendations of articles on CiteULike. On our first approach, Classic Collaborative filtering (CCF), we use Pearson correlation to calculate similarity between users and a classic adjusted ratings formula to rank the recommendations. Our second approach, Neighbor-weighted Collaborative Filtering (NwCF), incorporates the amount of raters in the ranking formula of the recommendations. A modified version of the Okapi BM25 IR model over users' tags is implemented on our third approach to form the user neighborhood. Our results suggest that incorporating the number of raters into the algorithms leads to an improvement of precision, and they also support that tags can be considered as an alternative to Pearson correlation to calculate the similarity between users and their neighbors in a collaborative tagging system.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Collaborative-filtering</kwd>
        <kwd>recommender systems</kwd>
        <kwd>tagging</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        The new generation of collaborative tagging systems such as
Delicious or CiteULike presented a new challenge to researchers
and practitioners in the area of recommender systems. While both
content-based [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and collaborative filtering recommender
systems [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] achieved a remarkable success in traditional
information repositories, social tagging systems may need some
Permission to make digital or hard copies of all or part of this work for
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that
copies bear this notice and the full citation on the first page. To copy
otherwise, or republish, to post on servers or to redistribute to lists,
requires prior specific permission and/or a fee.
      </p>
      <p>
        Web 3.0: Merging Semantic Web and Social Web at
HyperText ’09 June 29th, 2009, Torino, Italy.
different recommendation approaches. First of all,
usercontributed content is more diverse in its nature and quality than
centrally created and structured content of traditional repositories.
Second, traditional 5-10 point ratings are typically not available –
only the fact that an item was contributed or bookmarked by the
user is present in the system. At the same time, the loss of quality
control and fine-grained ratings in collaborative tagging systems
is compensated by the presence of tags and (in most systems)
explicit connections between users. It looks evident that
recommendation approaches for collaborative tagging systems
should capitalize on the success of classic recommender system,
while trying to harness the new power provided by tags and social
links. However, there is no shared understanding of how these
features have to be taken into account to improve the quality of
personalization. A few pioneer projects explored different ways to
integrate social links or social tags into collaborative
recommendation [
        <xref ref-type="bibr" rid="ref3 ref4 ref6">3, 4, 6</xref>
        ], and content-based recommendation [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
approaches. To some extent, the results are encouraging -- both
social links and tags do indeed improve the personalization
quality. At the same time, the overall recommendation quality is
unusually low – the precision for both content based and
collaborative “tag-aware” recommendation reported in [
        <xref ref-type="bibr" rid="ref4 ref6">4, 6</xref>
        ] stays
in the range of 0.1-0.3. The lack of reliable success calls for
further research on recommendation in social tagging systems.
This paper contributes to this stream of research by exploring two
extensions of the traditional collaborative filtering approaches.
First, we argue that the diverse user-contributed nature of content
in collaborative tagging systems requires more evidence of
relevance and quality than in traditional systems where the
content is co-rated by the site developers. In this context,
recommender algorithms should favor items bookmarked by more
users. However, classic algorithms do not take the number of
raters into account. Second, we argue that due to the large volume
of items and low overlap between user bookmarks traditional
approach for neighborhood calculation may be not most efficient.
Two users who are very similar in their interests may still have too
few common items bookmarked. In this context, tags applied by
users can provide a more reliable approach to find similar users
and this to get better recommendation. To assess our hypotheses
we developed variants of user-based collaborative filtering, which
take into account the number of users who bookmarked an item
and one approach use tags-level similarity instead of traditional
Pearson correlation to form user neighborhood.
      </p>
      <p>The rest of the paper is addressed as follows. Section 2 describes
the characteristics of the data and how it was collected. Section 3
describes the three recommender approaches developed: Classic
Collaborative Filtering (CCF), Neighbor-weighted Collaborative
Filtering (NwCF) and BM25-based similarity (BM25). In Section
4 we describe the study conducted and present the results. Section
5 introduces relevant related work, in Section 6 we address the
discussion and in Section 7 we summarize conclusions and future
work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. DATASET</title>
      <p>We performed our study based on data that we crawled from
CiteULike1. The daily datasets provided by CiteULike lack a lot
of relevant information necessary to develop our algorithms, as
the title and the authors of each article.</p>
      <p>We selected a group of users to be our center users, i.e., those
who would receive the recommendations. For each one of these
center users, we crawled her posted articles (id, title, authors, post
timestamp, and tags associated), the neighborhood of users who
posted her same articles, and the neighborhood of users who share
her same tags. To avoid limiting the neighborhood due to tag
variations as hyphens, underscores and plurals, we enhanced the
spreading of tags by adding stemmed tags using Krovetz
algorithm, and modified tags changing hyphens and underscores
to eventually be added to the set of tags to be crawled.
The details of the final dataset are described in Table 1. We chose
7 center users and we crawled all their articles and tags. We chose
100 neighbors for each center user, selecting those neighbors with
more shared tags in amount and frequency. There was an overlap
between these neighbors, so we finally crawled 358 users,
including center users and neighbors. For each of these neighbors
we also crawled all their articles and tags. In Table 1, annotations
correspond to tuples of the style {user, article, tag}</p>
      <p>Table 1. Description of the dataset</p>
      <p>Item
users
articles</p>
      <p>tags
annotations
# of unique instances</p>
      <p>358
186,122
51,903
902,711</p>
    </sec>
    <sec id="sec-3">
      <title>3. ALGORITHMS</title>
      <p>To create user-based recommendations using collaborative
filtering, two processes are necessary. The first one is finding the
neighborhood of the center user, i.e., her most similar users. Once
the most similar users are identified, the second process is to rank
the articles to be recommended. These articles will be taken from
the set of articles which the neighbors have rated as their
favorites, yet discounting those articles that the center user already
has posted.</p>
      <p>We implemented three user-based collaborative filtering
approaches: Classic Collaborative Filtering (CCF),
Neighborweighted Collaborative Filtering (NwCF) and BM25-based
similarity (BM25).</p>
    </sec>
    <sec id="sec-4">
      <title>3.1 Classic Collaborative Filtering (CCF)</title>
      <p>
        This approach is described in detail in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In the CCF model, the
similarity between two users is calculated using the Pearson
correlation over the ratings of their common items. The formula
for the Pearson correlation, as stated in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], is:
(1)
(2)
(3)
userSim(u, n) =
      </p>
      <p>
        ∑i⊂CRu,n (rui − ru )(rni − rn )
∑i⊂CRu,n (rui − ru )2
∑i⊂CRu,n (rni − rn )2
In the formula, r stands for rating, u denotes the center user and n
a neighbor. CRu,n denotes the set of co-rated items between u and
n. After performing this calculation, we select the top ten most
similar users. Next, we rank the articles of these users to
recommend to the center user, using the formula of predicted
rating for user u with average adjusts described in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
pred (u, i) = ru +
∑
n⊂neighbors(u)
∑
n⊂neighbors(u)
userSim(u, n) ⋅ (rni − rn )
userSim(u, n)
      </p>
    </sec>
    <sec id="sec-5">
      <title>3.2 Neighbor-weighted Collaborative</title>
    </sec>
    <sec id="sec-6">
      <title>Filtering (NwCF)</title>
      <p>This method is an enhancement of our CCF implementation. The
neighborhood of ten users is obtained in exactly the same way,
using the Pearson correlation. However, we have incorporated the
number of raters in the calculation of the ranking of the articles.
We do it due to a large amount of the articles have been rated by
only one or at most two users. In this way, we push up in the
recommendation list those articles rated by a larger number of
neighbors. The new predicted rating is given by
pred ′(u, i) = log10 (1 + nbr(i)) ⋅ pred (u, i)</p>
    </sec>
    <sec id="sec-7">
      <title>3.3 BM25-based Similarity (BM25)</title>
      <p>
        BM25, also known as Okapi BM25, is a non-binary probabilistic
model used in information retrieval [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. It calculates the relevance
that the documents of one collection have, given a query. As we
try to take advantage of the set of tags of each user, we made two
analogies: comparing the tags of the center user with a query, and
the set of tags of each neighbor with a document. Based on this
idea, we performed a similarity calculation based on the BM25
model and thus we obtained her neighborhood. Our proposed
BM25-based similarity model is taken from the calculation of the
Retrieval Status Value of a document (RSVd) of a collection
given a query [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]:
RSVd = ∑ IDF ⋅
t∈q
      </p>
      <p>
        (k1 + 1)tftd
k1 ((1 − b) + b × (Ld / Lave )) + tftd
⋅
(k3 + 1)tftq (4)
k3 + tftq
In our model RSVd represents the similarity score between the
center user (the terms of the query q) and one neighbor (the terms
of the document d). This similarity is calculated as a sum over
every tag t posted by the center user. The neighbor d is
represented as her set of tags with their respective frequencies. Ld
is the document length, in our case is the sum of the frequencies
of each tag of the neighbor d. Lave is the average of the Ld of
every neighbor. The term tftd is the frequency of the tag t into the
set of tags of the neighbor d. tftq represents the frequency of the
tag t into the query, i.e., the set of tags of the center user. Finally,
k1, k3 and b are parameters that we have been set in 1.2, 1.2 and
0.8 respectively, values slightly different from those suggested by
default in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>After calculating the similarity between the center user and each
neighbor, we choose the top N similar neighbors, and then we
calculate the ranking of the recommended articles using the
formula (3).</p>
    </sec>
    <sec id="sec-8">
      <title>4. THE STUDY</title>
    </sec>
    <sec id="sec-9">
      <title>4.1 Experiments</title>
      <p>To perform our study, we selected seven active CiteULike users
which had posted at least 50 articles each. Four of the subjects are
part of the Personalized Adaptive Web Systems (PAWS) lab of
the School of Information Sciences at the University of
Pittsburgh. Three additional subjects were selected randomly from
a list of active CiteULike users.</p>
      <p>
        For each subject we generated 4 sets with 10 ranked articles each
one. The first three lists were generated using the methods CCF,
NwCF and BM25, considering 10 neighbors for each center user.
The fourth list was generated using BM25, yet considering 20
neighbors. To avoid pitfalls in the evaluation [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], for each subject
we combined the 4 sets of recommendations into one set, we
changed the order of the articles randomly and we ask them to
evaluate each article relevancy (relevant, somewhat relevant, and
not relevant), and novelty (novel, somewhat novel, and not novel)
using a 3-point scale. For example, one article can be evaluated as
relevant but not novel (because it was already known), and
another article can be judged to be relevant and also novel,
because the user just discovered and found it to be important to
her interests.
      </p>
      <p>Another aspect considered to control the evaluation was to
provide the URL on CiteULike of each article. We requested each
subject to evaluate the articles based on that information or
looking for the abstract on the internet, but don’t going further
than the abstract.</p>
    </sec>
    <sec id="sec-10">
      <title>4.2 Results</title>
      <p>
        For each subject, we calculated normalized Discounted
Cumulative Gain (nDCG) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], Precision_2 @ 5, Precision_2 @
10, Precision_2_1 @ 5 and Precision_2_1 @ 10 over the different
initial four lists of recommendations. In Precision_2_1, we
consider relevant those articles evaluated as Relevant and
Somewhat Relevant. In Precision_2, we only consider relevant the
articles evaluated as Relevant. Besides, we calculated the average
Novelty for each user on each method.
      </p>
      <p>Figure 1 (a) shows us smooth results on different subjects and not
so different results on the values of nDCG between different
algorithms. However, if we compare them further, we can see that
CCF performed the worst and is not so clear which one,
BM25_10, BM25_20 or NwCF are significantly the best. This
result suggests us that the ranking order of the recommendations,
in general, is very close to the optimal one, where the most
relevant articles are at the top and the less ones at the bottom. On
the other hand, CCF shows in general a better level of novelty.
The results on Precision_2 and Precision_2_1 do not let us infer
easily some ideas, but we can see some trends. In general, CCF
has the worst results, suggesting that including the amount of
raters in the ranking formula is an important factor to consider in
the success of these recommendations. In addition, the dissimilar
results of BM25 using 10 and 20 neighbors, suggests that we
should have taken a threshold to select the size of the
neighborhood instead of choosing a fixed number such as 10 or
20. For example, CiteULike shows a neighborhood for each user,
including just those who share at least the median number of
articles of the center user.</p>
    </sec>
    <sec id="sec-11">
      <title>5. RELATED WORK</title>
      <p>
        A few pioneer projects explored different ways to integrate social
links or social tags. In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], the authors incorporate social tags and
also the concept of web of trust for the issue of quality assessment
into a collaborative recommendation approach. The study in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
investigates the effect of incorporating tags to different CF
algorithms, testing their algorithms on last.fm, a musical social
tagging system, obtaining promising results. The approach
presented in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] compared a pure content-based with a
tagenhanced recommender, showing an improvement in predicted
accuracy in the context of cultural heritage personalization.
user_1 user_2 user_3 user_4 user_5 user_6 user_7
(a) nDCG
user_1 user_2 user_3 user_4 user_5 user_6 user_7
(b) Average Novelty score
user_1 user_2 user_3 user_4 user_5 user_6 user_7
(c) Precision_2 @ 5
user_1 user_2 user_3 user_4 user_5 user_6 user_7
user_1 user_2 user_3 user_4 user_5 user_6 user_7
user_1 user_2 user_3 user_4 user_5 user_6 user_7
0
CCF
NwCF
BM25_10
BM25_20
CCF
NwCF
BM25_10
BM25_20
1.2
1
0.2
0
1.2
0.2
0
CDG0.6
n
1.2
1
0.8
0.4
0.2
0
1.2
1
0.2
0
CCF
NwCF
BM25_10
BM25_20
CCF
NwCF
BM25_10
BM25_20
The study presented in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] describes the use of CiteULike for
recommending scientific articles to users. They compared three
different collaborative filtering algorithms, two item-based and
one user-based, and they found that the user-based performed the
best. They evaluated their algorithms using accuracy metrics as
MAP, MMR and Precision @ 10, with low accuracy levels, in the
range 0.1-0.3.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] McNee et al. developed three algorithms to recommend
articles to users, and they assessed them with a detailed survey on
real users. In some algorithms, the subjects provided strong
negative results, and the authors describe in their conclusion that
when evaluating a recommender system “the evaluation must be
done with real users, as current accuracy metrics cannot detect
these problems”. Based on this study we decided to ask the
subjects to evaluate the novelty in addition to the relevance of the
recommended articles. Four of our seven subjects commented at
the end of the survey that they found very interesting articles in
their recommendation list.
      </p>
    </sec>
    <sec id="sec-12">
      <title>6. DISCUSSION</title>
      <p>During the development of our approaches, we were stack for a
while on CCF and NwCF for the low quality of the preliminary
recommendations. We were using the ratings given by the users to
obtain their neighborhood, which are given by 5-star scale and an
“I've already read it” description. Since many users post articles
without taking care of the ratings (by default it is 2 stars), and
their evaluation criteria can vary a lot among different users, we
decided to change the scale for a 3-point one. Afterwards, the
results showed a significant improvement. We suggest paying
attention to the rating scale used in recommender algorithms for
social bookmarking systems, in order to diminish the impact of
noise and users’ criteria.</p>
      <p>We consider that the inclusion of the amount of raters in the
ranking formula is an important contribution. The Figure 1 shows
clearly that both nDCG and Precision metrics had better results
for NwCF than for CCF. This result supports our claim that the
“social knowledge” provided by the amount of raters helps to
decrease the uncertainty implicit on items with too few ratings.
However, this approach should be considered carefully depending
on the user information need. CCF shows, in general, the best
novelty values among the subjects, but this idea should be tested
with more users to be claimed as true.</p>
      <p>Regarding BM25-based similarity, in most cases it performs better
than CCF, but with no predictable results between using 10 or 20
neighbors, which implies that a threshold based on each user
characteristics should result better than a fixed number of
neighbors.
uncertainty produced by items with too few ratings. Third, a
tagbased approach to obtain the neighborhood of a user on social
tagging systems can be a suitable alternative to classical Pearson
correlation. Our survey to seven users was a preliminary study and
on eventual investigations we will consider more subjects to
support our findings.</p>
      <p>For our future research, we have already discussed two ideas.
Firstly, we want to incorporate tags on the ranking model. On this
study we used tags only to obtain the neighborhood, i.e., to
perform the user-similarity calculations. We believe extending the
use of tags can improve the results of precision of our BM25
approach. Secondly, we will cluster the users’ tags. Users can
have more than one interest of research, which is easy to observe
while examining their tags. We will implement clustering
algorithms to identify the different interests of the users and we
expect to provide more topic-oriented recommendations.</p>
    </sec>
    <sec id="sec-13">
      <title>8. ACKNOWLEDGMENTS</title>
      <p>This material is based upon work supported by the National
Science Foundation under Grant No. 0840597.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Pazzani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Billsus</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <year>2007</year>
          <article-title>Content-Based Recommendation Systems</article-title>
          .
          <source>The Adaptive Web. (May</source>
          <year>2007</year>
          ),
          <fpage>325</fpage>
          -
          <lpage>341</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Schafer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frankowski</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herlocker</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Sen</surname>
            ,
            <given-names>S. 2007</given-names>
          </string-name>
          <article-title>Collaborative Filtering Recommender Systems</article-title>
          .
          <source>The Adaptive Web. (May</source>
          <year>2007</year>
          ),
          <fpage>291</fpage>
          -
          <lpage>324</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Massa</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Avesani</surname>
            ,
            <given-names>P. 2004</given-names>
          </string-name>
          <article-title>Trust-Aware Collaborative Filtering for Recommender Systems</article-title>
          .
          <source>In Proceedings of OTM Confederated International Conferences</source>
          , CoopIS, DOA, and
          <string-name>
            <surname>ODBASE (Agia</surname>
            <given-names>Napa</given-names>
          </string-name>
          , Cyprus, Oct.
          <volume>25</volume>
          -
          <fpage>29</fpage>
          ,
          <year>2004</year>
          ).
          <fpage>492</fpage>
          -
          <lpage>508</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Tso-Sutter</surname>
            ,
            <given-names>K. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marinho</surname>
            ,
            <given-names>L. B.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Schmidt-Thieme</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>Tag-aware recommender systems by fusion of collaborative filtering algorithms</article-title>
          .
          <source>In Proceedings of the 2008 ACM Symposium on Applied Computing (Fortaleza, Brazil, March</source>
          <volume>16</volume>
          - 20,
          <year>2008</year>
          ).
          <source>SAC '08. ACM</source>
          , New York, NY,
          <fpage>1995</fpage>
          -
          <lpage>1999</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>de Gemmis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lops</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Semeraro</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Basile</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>Integrating tags in a semantic content-based recommender</article-title>
          .
          <source>In Proceedings of the 2008 ACM Conference on Recommender Systems (Lausanne, Switzerland, October 23 - 25</source>
          ,
          <year>2008</year>
          ).
          <source>RecSys '08. ACM</source>
          , New York, NY,
          <fpage>163</fpage>
          -
          <lpage>170</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Bogers</surname>
            , T. and van den Bosch,
            <given-names>A.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>Recommending scientific articles using citeulike</article-title>
          .
          <source>In Proceedings of the 2008 ACM Conference on Recommender Systems (Lausanne, Switzerland, October 23 - 25</source>
          ,
          <year>2008</year>
          ).
          <source>RecSys '08. ACM</source>
          , New York, NY,
          <fpage>287</fpage>
          -
          <lpage>290</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raghavan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Schutze</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <year>2008</year>
          Introduction to Information Retrieval. Cambridge University Press.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>McNee</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kapoor</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          <year>2006</year>
          .
          <article-title>Don't look stupid: avoiding pitfalls when recommending research papers</article-title>
          .
          <source>In Proceedings of the 20th Anniversary Conference on Computer Supported Cooperative Work (Banff</source>
          , Alberta, Canada,
          <source>November 04 - 08</source>
          ,
          <year>2006</year>
          ).
          <source>CSCW '06. ACM</source>
          , New York, NY,
          <fpage>171</fpage>
          -
          <lpage>180</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>