<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Real-World Recommender Systems for Academia: The Pain and Gain in Building, Operating, and Researching them</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Joeran Beel</string-name>
          <email>beel@nii.ac.jp</email>
          <email>joeran.beel@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Siddharth Dinesh</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Birla Institute of Technology and Science</institution>
          ,
          <addr-line>Pilani</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>National Institute of Informatics, Digital Content and Media Sciences Research Division</institution>
          ,
          <addr-line>Tokyo</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Trinity College Dublin, Department of Computer Science, ADAPT Centre</institution>
          ,
          <country country="IE">Ireland</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>6</fpage>
      <lpage>17</lpage>
      <abstract>
        <p>Research on recommender systems is a challenging task, as is building and operating such systems. Major challenges include non-reproducible research results, dealing with noisy data, and answering many questions such as how many recommendations to display, how often, and, of course, how to generate recommendations most effectively. In the past six years, we built three research-article recommender systems for digital libraries and reference managers, and conducted research on these systems. In this paper, we share some experiences we made during that time. Among others, we discuss the required skills to build recommender systems, and why the literature provides little help in identifying promising recommendation approaches. We explain the challenge in creating a randomization engine to run A/B tests, and how low data quality impacts the calculation of bibliometrics. We further discuss why several of our experiments delivered disappointing results, and provide statistics on how many researchers showed interest in our recommendation dataset.</p>
      </abstract>
      <kwd-group>
        <kwd>recommender system</kwd>
        <kwd>digital library</kwd>
        <kwd>reference management</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Recommender systems is a fascinating topic for both researchers and industry.
Researchers find in recommender systems a topic that is relevant for many disciplines:
machine learning, text mining, artificial intelligence, network analysis, bibliometrics,
databases, cloud computing, scalability, data science, visualization, human computer
interaction, and many more. That makes research in recommender systems interesting
and creates many opportunities to cooperate with other researchers. For industry,
recommender systems offer an opportunity to provide additional value to customers by
helping them finding relevant items. Recommender systems may also provide a
justification to store user-related data, which may be used for generating additional revenue.
A long version of this article is available online https://arxiv.org/abs/1704.00156
In addition, recommender systems may even become a major part of the business
model, as companies such as Netflix, Spotify, and Amazon demonstrate.</p>
      <p>Over the past six years, we built, operated, and researched three research-article
recommender systems in the context of digital libraries and reference management. The
work was often rewarding, but also challenging and occasionally even painful. We
share some of our experiences in this article. This article is not a research article but a
mixture of a project report, lessons learned, text-book, and summary of our previous
research, enriched with some novel research results.1</p>
      <p>The primary audience of this article are researchers and developers who think about
developing a real-world recommender system for research purposes, or for integrating
the recommender system on-top of a real product. While our focus lies on recommender
system in the context of digital libraries and reference managers, researchers and
developers from other disciplines may also find some relevant information in our article.
As this paper is an invited paper for the “5th International Workshop on
Bibliometricenhanced Information Retrieval”, we particularly discuss our work in the context of
bibliometric-enhanced recommender systems and information retrieval.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Our Recommender Systems</title>
      <sec id="sec-2-1">
        <title>SciPlore MindMapping</title>
        <p>
          In 2009, we introduced SciPlore MindMapping
          <xref ref-type="bibr" rid="ref7">(Beel, Gipp, &amp; Mueller, 2009)</xref>
          . The
software enabled researchers to manage their references, annotations, and PDF files in
mind-maps. In these mind-maps, users could create categories that reflect their research
interests or that represent sections of a new manuscript. Users could then sort their PDF
files, annotations, and references in these categories. In 2011, we integrated a
recommender system in SciPlore MindMapping
          <xref ref-type="bibr" rid="ref1 ref6">(Beel, 2011)</xref>
          . The system was rather simple.
Whenever users selected a node in a mind-map, the text of the node was sent as search
query to Google Scholar, and the first three results of Google Scholar were shown as
recommendations.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Docear</title>
        <p>
          Docear2 is the successor of SciPlore MindMapping, pursuing the same goal, i.e.
enabling researchers to manage their references, annotations, and PDF files in mind maps
          <xref ref-type="bibr" rid="ref1 ref6">(Beel, Gipp, Langer, &amp; Genzmehr, 2011)</xref>
          . 3 In contrast to SciPlore MindMapping,
Docear has more features, a neater interface, and a more sophisticated recommender
system
          <xref ref-type="bibr" rid="ref10 ref26 ref8 ref9">(Beel, Langer, Genzmehr, &amp; Nürnberger, 2013; Beel, Langer, Gipp, &amp;
Nürnberger, 2014)</xref>
          . The recommender system features a comprehensive user modeling
1 The data and scripts we used for the novel analyses is available at http://data.mr-dlib.org
2 http://docear.org
3 Currently, Docear’s recommender system is offline because we focus on the development of Mr. DLib.
engine, and uses Apache Lucene/Solr for content-based filtering as well as some
proprietary implementations of other recommendation approaches. Docear is a desktop
software but transfers users' mind maps to Docear's servers. On the servers, Docear's
recommender system calculates user specific recommendations. Recommendations are
shown every couple of days to users, and users may also request recommendations
explicitly. Docear has a corpus of around 2 million full-text documents freely available
on the Web.
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Mr. DLib</title>
        <p>
          Our latest recommender system is Mr. DLib4, a machine-readable digital library that
provides recommendations as-a-service to third parties
          <xref ref-type="bibr" rid="ref11 ref27 ref3 ref4">(Beel, Gipp, &amp; Aizawa, 2017)</xref>
          .
This means, third parties such as such as digital libraries and reference managers can
easily integrate a recommender system into their product via Mr. DLib. The
recommender system is hosted and operated by Mr. DLib and partners only need to request
recommendations for a specific item via a REST API (Figure 1).
Our first pilot partner is Sowiport
          <xref ref-type="bibr" rid="ref22">(Hienert, Sawitzki, &amp; Mayr, 2015)</xref>
          . Sowiport is
operated by ‘GESIS – Leibniz-Institute for the Social Sciences’, which is the largest
infrastructure institution for the Social Sciences in Germany. Sowiport contains about 9.6
million literature references and 50,000 research projects from 18 different databases,
mostly relating to the social and political sciences. Literature references usually cover
keywords, classifications, author(s) and journal or conference information and if
available: citations, references and links to full texts. We additionally integrated Mr. DLib
into the reference manager JabRef
          <xref ref-type="bibr" rid="ref11 ref18 ref27 ref3 ref35 ref4">(Feyer, Siebert, Gipp, Aizawa, &amp; Beel, 2017)</xref>
          , and
currently discuss the integration with two more organizations.
Recommendations in Mr. DLib are primarily calculated with Apache Lucene / Solr, i.e.
based on the metadata of the documents in Sowiports' corpus. We further experiment
with stereotype and most-popular recommendations
          <xref ref-type="bibr" rid="ref27 ref3 ref35 ref4">(Beel, Dinesh, Mayr, Carevic, &amp;
Raghvendra, 2017)</xref>
          , as well as with enhanced content-based filtering based on
keyphrase extraction and re-ranking with Mendeley readership statistics
          <xref ref-type="bibr" rid="ref18 ref3 ref35">(Siebert, Dinesh,
&amp; Feyer, 2017)</xref>
          . In the future, we plan to add further recommendation approaches;
especially link/citation-based approaches seem promising
          <xref ref-type="bibr" rid="ref33">(Schwarzer et al., 2016)</xref>
          .
3
3.1
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Recommender-System Development</title>
      <sec id="sec-3-1">
        <title>Required skills</title>
        <p>
          To build and operate a recommender system, more than just knowledge about
recommender systems and related disciplines such as text mining and machine learning is
required. Server administration, databases, web technologies, data formats (e.g. XML
or JSON), and data processing (crawling, parsing, transforming) are probably the most
important ones, but also knowledge about software engineering in general (e.g. agile
development, unit testing, etc.), scalability, data privacy laws, and project management
is helpful. Niche knowledge that does not directly relate to recommender systems may
also be beneficial and lead to novel recommendation approaches. For instance,
knowledge in bibliometrics could help to develop novel re-ranking algorithms for
content-based research-paper recommender systems
          <xref ref-type="bibr" rid="ref13">(Cabanac, 2011)</xref>
          . In such systems, a
list of recommendation candidates, would be re-ranked based on e.g. how many
citations the candidate papers have, or based on the h-index of the candidate papers’
authors, which, however, then might strengthen the Mathew effect
          <xref ref-type="bibr" rid="ref14">(Cabanac &amp; Preuss,
2013)</xref>
          . We have experimented with such algorithms but were only partly successful –
maybe because we lack the expert-knowledge in bibliometrics
          <xref ref-type="bibr" rid="ref18 ref35">(Siebert et al., 2017)</xref>
          .
3.2
        </p>
        <p>
          (No) help from the literature
There are hundreds of research articles about recommender systems in Academia
          <xref ref-type="bibr" rid="ref2 ref5">(Beel,
Gipp, Langer, &amp; Breitinger, 2016)</xref>
          , and probably thousands of articles about
recommender systems in other domains. One might expect that such a large corpus of
literature would provide advice on how to build a recommender system, and which
recommendation approaches to use. Unfortunately, this is not the case, at least in the domain
of research-paper recommender systems. The reasons are manifold. Many
recommendation approaches were not evaluated at all, compared against too simple baselines,
evaluated with too few users, or evaluated with highly tweaked datasets
          <xref ref-type="bibr" rid="ref2 ref5">(Beel, Gipp, et
al., 2016)</xref>
          . Consequently, the meaningfulness of the results is questionable.
        </p>
        <p>
          Even if evaluations were sound, recommendation effectiveness may vary a lot. In
other words, only because a recommendation approach performed well in one scenario,
does not mean it will perform well in another scenario. For instance, the TechLens team
proposed and evaluated several content-based filtering (CBF) and collaborative
filtering (CF) approaches for research-paper recommendations. In one experiment, CF and
CBF performed similarly well
          <xref ref-type="bibr" rid="ref28">(McNee et al., 2002)</xref>
          . In other experiments, CBF
outperformed CF
          <xref ref-type="bibr" rid="ref16 ref28 ref37">(Dong, Tokarchuk, &amp; Ma, 2009; McNee et al., 2002; Torres, McNee, Abel,
Konstan, &amp; Riedl, 2004)</xref>
          , and in some more experiments CF outperformed CBF
          <xref ref-type="bibr" rid="ref17 ref29 ref37">(Ekstrand et al., 2010; McNee, Kapoor, &amp; Konstan, 2006; Torres et al., 2004)</xref>
          . In other
words: it remains speculative how CBF and CF would perform in a scenario that differs
from one of those used in the existing evaluations.
        </p>
        <p>
          Some authors used bibliometrics to enhance recommender systems in digital
libraries. Popular metrics were PageRank
          <xref ref-type="bibr" rid="ref12">(Bethard &amp; Jurafsky, 2010)</xref>
          , HITS
          <xref ref-type="bibr" rid="ref21">(He, Pei, Kifer,
Mitra, &amp; Giles, 2010)</xref>
          , Katz
          <xref ref-type="bibr" rid="ref21">(He et al., 2010)</xref>
          , citation counts
          <xref ref-type="bibr" rid="ref12 ref21 ref32">(Bethard &amp; Jurafsky, 2010;
He et al., 2010; Rokach, Mitra, Kataria, Huang, &amp; Giles, 2013)</xref>
          , venues’ citation counts
          <xref ref-type="bibr" rid="ref12 ref32">(Bethard &amp; Jurafsky, 2010; Rokach et al., 2013)</xref>
          , citation counts of the authors’
affiliations
          <xref ref-type="bibr" rid="ref32">(Rokach et al., 2013)</xref>
          , authors’ citation count
          <xref ref-type="bibr" rid="ref12 ref32">(Bethard &amp; Jurafsky, 2010; Rokach
et al., 2013)</xref>
          , h-index
          <xref ref-type="bibr" rid="ref12">(Bethard &amp; Jurafsky, 2010)</xref>
          , recency of articles
          <xref ref-type="bibr" rid="ref12">(Bethard &amp;
Jurafsky, 2010)</xref>
          , title length
          <xref ref-type="bibr" rid="ref32">(Rokach et al., 2013)</xref>
          , number of co-authors
          <xref ref-type="bibr" rid="ref32">(Rokach et al.,
2013)</xref>
          , number of affiliations
          <xref ref-type="bibr" rid="ref32">(Rokach et al., 2013)</xref>
          , and venue type
          <xref ref-type="bibr" rid="ref32">(Rokach et al.,
2013)</xref>
          . Again, results are not always coherent. For instance,
          <xref ref-type="bibr" rid="ref12">Bethard and Jurafsky
(2010)</xref>
          reported that using citation counts in the recommendation process strongly
increased the effectiveness of their recommendation approach, while
          <xref ref-type="bibr" rid="ref21">He et al. (2010)</xref>
          reported that citation counts slightly increased the effectiveness of their approach.
        </p>
        <p>
          Our own research confirms that recommendation approaches perform very
differently in different scenarios. We recently applied five recommendation approaches on
six news websites
          <xref ref-type="bibr" rid="ref2 ref5">(Beel, Breitinger, Langer, Lommatzsch, &amp; Gipp, 2016)</xref>
          . The results
showed that recommendation approaches performing well on one news website
performed poorly on others (Figure 2). For instance, the most-popular approach performed
worst on tagesspiegel.de but best on cio.de.
        </p>
        <p>0.40
ino 0.30
isc 0.20
reP 0.10</p>
        <p>
          0.00
User-based CF
Mst. ppl. sequence
Item-based CF
Content-based
Most popular
There are several potential reasons for the unpredictability. In some cases, different
evaluation methods were used. In other cases variations in algorithms or user
populations might have had an impact
          <xref ref-type="bibr" rid="ref10 ref2 ref26 ref5 ref8 ref9">(Beel, Breitinger, et al., 2016; Beel, Langer,
Nürnberger, &amp; Genzmehr, 2013; Langer &amp; Beel, 2014)</xref>
          . However, it seems that, for
instance, the operator of a news website cannot estimate how effective the most-popular
recommendation approach would be until the operator has implemented and evaluated
the approach on that particular website. Therefore, our advice is to read a
recommendersystem text book to get a general idea of recommender systems
          <xref ref-type="bibr" rid="ref23 ref25 ref31">(Jannach, 2014;
Konstan &amp; Ekstrand, 2015; Ricci, Rokach, &amp; Shapira, 2015)</xref>
          . Then, choose a few
recommendation frameworks and try to find the best recommendation approach for one's
recommender system.
3.3
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>The randomization engine</title>
        <p>To build an effective recommender system, A/B testing is essential. This means, the
recommender system needs a pool of recommendation algorithms to choose from, and
a logging mechanism to record, which algorithm was used and what the user feedback
was (e.g. how many recommendations were clicked or downloaded). Unfortunately, in
most situations, a simple A/B test with only two alternative algorithms will not be
enough. In Mr. DLib, we implemented a “randomization engine” that first picks from
several recommendation classes randomly, and then varies the parameters of the
recommendation algorithms. The recommendation classes are content-based filtering
(90% chance), stereotyping (4.9% chance), most-popular recommendations (4.9%
chance), and random recommendations as baseline (0.2% chance). For each of the
recommendation classes, the randomization engine chooses some parameters randomly.
For instance, when content-based filtering is chosen, the system randomly selects
whether to use “normal” terms, or key-phrases5. When key-phrases are chosen, the
system randomly selects if key-phrases from the abstract, title, or title and abstract are
used. Then the system randomly selects if unigrams, bigrams, or trigrams are used.
Then the system randomly selects if one, two, three, … or twenty key-phrases are used
to calculate document similarity.</p>
        <p>
          Once the recommendations are calculated, the randomization engine chooses
randomly if the recommendation candidates should be re-ranked based on bibliometrics.
Re-ranking means that from the top x recommendations those ten documents are
eventually recommended that have, for instance, the highest bibliometric score. Again, there
are many parameters that are randomly chosen by the randomization engine. The engine
selects the bibliometric (plain readership count, readership count normalized by age of
the document, readership count normalized by the number of authors, etc.), the number
of recommendation candidates to re-rank (10 to 100), and how the bibliometric and text
relevance scores should be combined (bibliometric only, multiplication of scores, and
some more variations). While we currently only work with readership data from
Mendeley, we plan to obtain additional data from sources such as Google Scholar, Scopus,
or Microsoft Academic
          <xref ref-type="bibr" rid="ref36">(Sinha et al., 2015)</xref>
          .
        </p>
        <p>Developing such a randomization engine is a non-trivial task and we are not perfectly
satisfied with our solution. It occurs too often that a fallback algorithm must be used
because the randomly assembled algorithm cannot be applied because, for instance, a
document does not have 20 key-phrases in its title.
5 Keyphrases are the most meaningful phrases describing a document. Keyphrases are extracted with
stemming, part-of-speech tagging and other mechanisms that we describe in an upcoming paper.
3.4</p>
      </sec>
      <sec id="sec-3-3">
        <title>Data quality</title>
        <p>Crucial for content-based recommendations is the quality of metadata. When using
publicly available datasets, the quality may often be good because datasets were designed
and maybe even curated to be published. In the real-world, however, data quality is
often low. For instance, some of the most productive “authors” in Mr. DLib’s database
are “et al.”, “and others”, and “AnoN”. Obviously, these are no real authors. When this
data is used e.g. to recommend papers of co-authors, or to re-rank recommendations
based on h-index, the resulting recommendations will be of suboptimal quality.</p>
        <p>
          Cleaning data from third parties is a labor-intensive and usually boring task (it is
much easier to motivate a colleague to implement a novel recommendation algorithm
than convincing a colleague to spend some weeks cleaning data in a database). Data
cleaning becomes particularly challenging, when the data comes from a third party and
the data are updated occasionally by the partner. In that case, one would need a process
to decide how to deal with the updated data and judge if the new data from the partner
is better than the manually changed data in our system. From our experience in other
projects, we know that manually cleaning data usually causes many problems.
Therefore, we decided to do no manual data cleaning in Mr. DLib, and only apply a few
heuristics such as ignoring “et al.” when calculating bibliometrics.
On the Internet, users tend to be impatient: the longer they wait for content, the less
satisfied they become
          <xref ref-type="bibr" rid="ref19 ref24 ref25 ref34">(Guse, Schuck, Hohlfeld, Raake, &amp; Möller, 2015; Kim, Xiong,
&amp; Liang, 2017; Selvidge, Chaparro, &amp; Bender, 2002)</xref>
          . This holds true for recommender
systems, too. We observed that the longer users had to wait for recommendations, the
less likely they were to click a recommendation. Figure 3 shows that processing time6
for most recommendations (34%) was between 1 and 2 seconds. These
recommendations also had the highest click-through rate (CTR) of 0.15% on average. In contrast,
recommendations that needed 7 to 8 seconds for calculation had a CTR of 0.08%.
6 “Processing Time” is the time from receiving a request until delivering the response on side of Mr. DLib.
        </p>
        <p>It may be that a user who must wait too long, leaves the web page and does not even see the
recommendations.</p>
        <p>The re-ranking of recommendations based on bibliometrics requires rather a lot of
calculation (primarily due to the randomization engine and because we store many
statistics when calculating the bibliometrics). Consequently, when evaluating the
effectiveness of bibliometric re-ranking, one need to additionally consider if the additional
effectiveness if worth the additional time users need to wait.
4.2</p>
      </sec>
      <sec id="sec-3-4">
        <title>The need to deliver only good recommendations</title>
        <p>
          In the recommender-system community, it is often reported that a recommender system
should try to avoid making ‘bad’ recommendations as this hurts the users’ trust. In other
words, it is better to recommend 5 good and 5 mediocre items than recommending
9 excellent but 1 bad item. To avoid bad recommendations, some relevance score is
needed that indicates how good a recommendation is. Ideally, only recommendations
above a certain threshold would then be recommended. However, at least Lucene has
no such threshold that would allow a prediction of how relevant a recommendation is7.
The Lucene text relevance score only allows to rank recommendations for one given
query and compare the relevance of the results returned for that one query. In a recent
analysis, we found that Lucene relevance scores and CTR correlate, but still it is not
possible to avoid “bad” recommendations
          <xref ref-type="bibr" rid="ref27 ref3 ref4">(Langer &amp; Beel, 2017)</xref>
          .
        </p>
        <p>Even if Lucene had an “absolute” text relevance score, this score would only be able
to prevent bad recommendations to some extent. We see a high potential in
bibliometrics to support recommender systems in not recommending bad items.
4.3</p>
      </sec>
      <sec id="sec-3-5">
        <title>Number of recommendations</title>
        <p>
          Another question that may seem simple to answer is how many recommendations to
display? One? Two? ...Ten? We experimented in Mr. DLib with varying numbers of
recommendations between 1 and 15
          <xref ref-type="bibr" rid="ref11 ref27 ref3 ref4">(Beierle, Aizawa, &amp; Beel, 2017)</xref>
          . We observed that
the more recommendations were displayed, the lower click-through rate became
(Figure 4). From these results, one cannot conclude how many recommendations to
display, and more research is necessary.
5
5.1
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Recommender-System Research</title>
      <sec id="sec-4-1">
        <title>No tweaking of data</title>
        <p>
          In offline evaluations, it is common to tweak datasets. For instance, Caragea et al.
removed papers with fewer than ten and more than 100 citations from the evaluation
corpus, as well as papers citing fewer than 15 and more than 50 papers
          <xref ref-type="bibr" rid="ref15 ref32">(Caragea,
Silvescu, Mitra, &amp; Giles, 2013)</xref>
          . From originally 1.3 million papers in the corpus,
around 16,000 remained (1.2%). Pennock et al. removed documents with fewer than 15
implicit ratings from the corpus
          <xref ref-type="bibr" rid="ref30">(Pennock, Horvitz, Lawrence, &amp; Giles, 2000)</xref>
          . From
7 Using the standard More-Like-This or search function
https://wiki.apache.org/lucene-java/ScoresAsPercentages
originally 270,000 documents, 1,575 remained (0.58%). Such tweaking and pruning of
datasets may be convenient for the research and lead potentially to high precision.
However, applying a recommendation approach to only 0.58% of the documents in a corpus,
will lead to a very poor recall, i.e. the results that have little relevance for running a
real-world recommender system. In other words, in a real-world recommender system
such a tweaking would be difficult, unless one would accept that recommendations can
be delivered only for a fraction of documents in the corpus.
When we reviewed over 200 research articles about recommender systems, every
article that introduced a new recommendation approach reported to outperform the
stateof-the art
          <xref ref-type="bibr" rid="ref2 ref5">(Beel, Gipp, et al., 2016)</xref>
          . We were not that lucky.
        </p>
        <p>
          We re-ranked content-based recommendation with readership data from Mendeley
          <xref ref-type="bibr" rid="ref18 ref35">(Siebert et al., 2017)</xref>
          , but results were not as good as expected. We used a
key-phraseextraction approach to improve our content-based filtering approach, and experimented
with a variation of parameters: we varied the text-fields from which key-phrases were
extracted, we varied the number of key-phrases being used, and we varied the type of
key-phrases (unigrams, bigrams, trigrams, or a mix). None of these variations
performed better than an out-of-the-box Apache Lucene baseline.8 We experimented with
stereotype and most-popular recommendations, two approaches that are effective in
domains such as movie recommendations and hotel search
          <xref ref-type="bibr" rid="ref27 ref3 ref35 ref4">(Beel, Dinesh, et al., 2017)</xref>
          .
Again, the approaches performed not better than Apache Lucene.
        </p>
        <p>There are a few potential reasons why our experiments delivered disappointing
results (besides the possibility that the recommendation approaches are just not effective).
For instance, in case of the bibliometric re-ranking, we assume that click-through rate
might be not appropriate to measure the re-ranking effectiveness. However, even while
there might be plausible reasons for the results, the fact remains that many of our
experiments delivered rather disappointing results, but this is probably rather the rule than
the exception in a real-world scenario.
8 The analysis is still in process, and the final results will be published soon.
5.3</p>
      </sec>
      <sec id="sec-4-2">
        <title>Other researchers' interest in datasets</title>
        <p>
          One advantage of working on a real-world recommender system is the possibility to
release datasets, which then can be used (and cited) by other researchers. Some datasets
such as MovieLens are very popular in the recommender system community. The
MovieLens dataset was downloaded 140,000 times in 2014
          <xref ref-type="bibr" rid="ref20">(Harper &amp; Konstan, 2016)</xref>
          , and
Google Scholar lists 10,600 papers that mention the MovieLens dataset9.
        </p>
        <p>
          Not all datasets become that popular. In 2014, we released a dataset of Docear's
recommender system
          <xref ref-type="bibr" rid="ref26 ref9">(Beel et al., 2014)</xref>
          . The datasets contained metadata of 9.4 million
articles, including 1.8 million articles publicly available on the Web; the articles’
citation network; anonymized information on 8,059 Docear users; information about the
users’ 52,202 mind-maps and personal libraries; and details on the 308,146
recommendations that the recommender system delivered. In the 2.5 years since publication, 31
researchers requested to download the dataset. To the best of our knowledge, none of
these researchers has eventually analyzed the dataset and published their results.
6
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Summary &amp; Conclusion</title>
      <p>Building and operating real-world recommender systems is a time-consuming and
challenging task, as is research on such systems.</p>
      <p>To develop recommender systems, knowledge from various disciplines is required
such as machine learning, server administration, databases, web technologies, and data
formats. When building our own recommender systems, we could find no guidance in
the literature. Most published research results had questionable evaluations and even if
evaluations were sound, recommender systems seem to perform just too differently in
different scenarios. Consequently, we used Apache Lucene as recommendation
framework and began from scratch. Evaluating different recommendation approaches
requires a randomization engine that selects and assembles recommendation algorithms
automatically. In addition, a detailed logging mechanism is required that records which
algorithm created which recommendation and how the user reaction was to the different
algorithms. Another challenge lies in dealing with sub-optimal data quality from
partners. Due to time constraints, we decided to not manually improve the data but just
work with what we got.</p>
      <p>Running a recommender system requires fast generation and delivery of
recommendations. Otherwise, users become dissatisfied and click-through rates decrease.
Although it is widely known that recommender systems should avoid making bad
recommendations, this is not easily accomplished in practice, at least if Lucene is used as
recommendation framework. Lucene has no absolute relevance score and hence no
mechanism to recommend only items above a certain relevance threshold.</p>
      <p>Research on real-world recommender systems is probably more frustrating than
research in a lab-environment, mostly because data cannot be tweaked that easily.
Consequently, we had to accept that many experiments with novel recommendation
approaches failed. Similarly, while some recommendation datasets such as MovieLens
are widely used in the community, we could not yet manage to establish our datasets as
interesting source for other researchers.</p>
      <p>Despite all these challenges, research on real-world recommender systems is a
rewarding and worthwhile effort. We feel that working on real systems provides much
more relevant research results. In addition, offering a real-world open-source project
attracts many volunteers, students, and project partners. This, in turn, enabled us to
conduct research in many areas, and be quite productive in terms of publication output.
7</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>This work was supported by a fellowship within the FITweltweit programme of the
German Academic Exchange Service (DAAD). In addition, this publication has
emanated from research conducted with the financial support of Science Foundation Ireland
(SFI) under Grant Number 13/RC/2106. We are also grateful for the support received
by Sophie Siebert, Stefan Feyer, Felix Beierle, Sara Mahmoud, Gabor Neusch, and
Mr. DLib's partners.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>SciPlore MindMapping now provides literature recommendations (Beta 15)</article-title>
          . http://sciplore.org/
          <year>2011</year>
          <article-title>/sciplore-mindmapping-now-provides-literature-</article-title>
          <string-name>
            <surname>recommendations-</surname>
          </string-name>
          beta-
          <volume>15</volume>
          /.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breitinger</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Langer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lommatzsch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Towards Reproducibility in Recommender-Systems Research. User Modeling and User-Adapted Interaction (UMUAI</article-title>
          ),
          <volume>26</volume>
          (
          <issue>1</issue>
          ),
          <fpage>69</fpage>
          -
          <lpage>101</lpage>
          . doi:
          <volume>10</volume>
          .1007/s11257-016-9174-x
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dinesh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carevic</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Raghvendra</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Stereotype and Most-Popular Recommendations in the Digital Library Sowiport</article-title>
          .
          <source>Proceedings of the 15th International Symposium of Information Science (ISI).</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Aizawa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Mr. DLib: Recommendations-as-a-Service (RaaS) for Academia</article-title>
          .
          <source>Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL).</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Langer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Breitinger</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Research Paper Recommender Systems: A Literature Survey</article-title>
          .
          <source>International Journal on Digital Libraries, (4)</source>
          ,
          <fpage>305</fpage>
          -
          <lpage>338</lpage>
          . doi:
          <volume>10</volume>
          .1007/s00799-015-0156-0
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Langer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Genzmehr</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Docear: An Academic Literature Suite for Searching, Organizing and Creating Academic Literature</article-title>
          .
          <source>Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries (JCDL)</source>
          ,
          <source>JCDL '11</source>
          (pp.
          <fpage>465</fpage>
          -
          <lpage>466</lpage>
          ).
          <source>doi:10.1145/1998076</source>
          .1998188
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Mueller</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2009</year>
          ). SciPlore MindMapping'
          <article-title>- A Tool for Creating Mind Maps Combined with PDF</article-title>
          and
          <string-name>
            <given-names>Reference</given-names>
            <surname>Management. D-Lib</surname>
          </string-name>
          <string-name>
            <surname>Magazine</surname>
          </string-name>
          ,
          <volume>15</volume>
          (
          <issue>11</issue>
          ). doi:
          <volume>10</volume>
          .1045/november2009-inbrief
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Langer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Genzmehr</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Nürnberger</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Introducing Docear's Research Paper Recommender System</article-title>
          .
          <source>Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL'13)</source>
          (pp.
          <fpage>459</fpage>
          -
          <lpage>460</lpage>
          ). ACM. doi:
          <volume>10</volume>
          .1145/2467696.2467786
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Langer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Nürnberger</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>The Architecture and Datasets of Docear's Research Paper Recommender System</article-title>
          .
          <string-name>
            <surname>D-Lib</surname>
            <given-names>Magazine</given-names>
          </string-name>
          ,
          <volume>20</volume>
          (
          <issue>11</issue>
          /12). doi:
          <volume>10</volume>
          .1045/november14-beel
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Langer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nürnberger</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Genzmehr</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>The Impact of Demographics (Age and Gender) and Other User Characteristics on Evaluating Recommender Systems</article-title>
          .
          <source>In Proceedings of the 17th International Conference on Theory and Practice of Digital Libraries (TPDL)</source>
          (pp.
          <fpage>400</fpage>
          -
          <lpage>404</lpage>
          ). Springer.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Beierle</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aizawa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Exploring Choice Overload in Related-Article Recommendations in Digital Libraries</article-title>
          .
          <source>5th International Workshop on Bibliometric-enhanced Information Retrieval (BIR) at the 39th European Conference on Information Retrieval (ECIR).</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Bethard</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Jurafsky</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Who should I cite: learning literature search models from citation behavior</article-title>
          .
          <source>Proceedings of the 19th ACM international conference on Information and knowledge management</source>
          (pp.
          <fpage>609</fpage>
          -
          <lpage>618</lpage>
          ). ACM.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Cabanac</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Accuracy of inter-researcher similarity measures based on topical and social clues</article-title>
          .
          <source>Scientometrics</source>
          ,
          <volume>87</volume>
          (
          <issue>3</issue>
          ),
          <fpage>597</fpage>
          -
          <lpage>620</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Cabanac</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Preuss</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Capitalizing on order effects in the bids of peer-reviewed conferences to secure reviews by expert referees</article-title>
          .
          <source>Journal of the American Society for Information Science and Technology</source>
          ,
          <volume>64</volume>
          (
          <issue>2</issue>
          ),
          <fpage>405</fpage>
          -
          <lpage>415</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Caragea</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silvescu</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>C. L.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Can't See the Forest for the Trees? A Citation Recommendation System</article-title>
          .
          <source>iConference 2013 Proceedings</source>
          (pp.
          <fpage>849</fpage>
          -
          <lpage>851</lpage>
          ).
          <source>doi:10.9776/13434</source>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Dong</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tokarchuk</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ma</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>Digging Friendship: Paper Recommendation in Social Network</article-title>
          .
          <source>Proceedings of Networking &amp; Electronic Commerce Research Conference (NAEC</source>
          <year>2009</year>
          )
          <article-title>(pp</article-title>
          .
          <fpage>21</fpage>
          -
          <lpage>28</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Ekstrand</surname>
            ,
            <given-names>M. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kannan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stemper</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Butler</surname>
            ,
            <given-names>J. T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Riedl</surname>
            ,
            <given-names>J. T.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Automatically building research reading lists</article-title>
          .
          <source>Proceedings of the fourth ACM conference on Recommender systems</source>
          (pp.
          <fpage>159</fpage>
          -
          <lpage>166</lpage>
          ). ACM.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Feyer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Siebert</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aizawa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Integration of the Scientific Recommender System Mr</article-title>
          .
          <article-title>DLib into the Reference Manager JabRef</article-title>
          .
          <source>Proceedings of the 39th European Conference on Information Retrieval (ECIR).</source>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Guse</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schuck</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hohlfeld</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raake</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Möller</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Subjective quality of webpage loading: The impact of delayed and missing elements on quality ratings and task completion time</article-title>
          .
          <source>Quality of Multimedia Experience (QoMEX)</source>
          , 2015 Seventh International Workshop on (pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          ). IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Harper</surname>
            ,
            <given-names>F. M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>The movielens datasets: History and context</article-title>
          .
          <source>ACM Transactions on Interactive Intelligent Systems (TiiS)</source>
          ,
          <volume>5</volume>
          (
          <issue>4</issue>
          ),
          <fpage>19</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pei</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kifer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Context-aware citation recommendation</article-title>
          .
          <source>Proceedings of the 19th international conference on World wide web</source>
          (pp.
          <fpage>421</fpage>
          -
          <lpage>430</lpage>
          ). ACM.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Hienert</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sawitzki</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2015</year>
          ). Digital Library Research in Action - Supporting Information Retrieval in Sowiport.
          <string-name>
            <surname>D-Lib</surname>
            <given-names>Magazine</given-names>
          </string-name>
          ,
          <volume>21</volume>
          (
          <issue>3</issue>
          /4). doi:
          <volume>10</volume>
          .1045/march2015-hienert
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Jannach</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <source>Recommender systems: an introduction. Lecture Slides (PhD School</source>
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiong</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Liang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Effect of Loading Symbol of Online Video on Perception of Waiting Time</article-title>
          .
          <source>International Journal of Human-Computer Interaction</source>
          ,
          <article-title>(just-accepted)</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ekstrand</surname>
            ,
            <given-names>M. D.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Introduction to Recommender Systems</article-title>
          .
          <source>Coursera Lecture Slides.</source>
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Langer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>The Comparability of Recommender System Evaluations and Characteristics of Docear's Users</article-title>
          .
          <source>Proceedings of the Workshop on Recommender Systems Evaluation: Dimensions and Design (REDD) at the 2014 ACM Conference Series on Recommender Systems (RecSys)</source>
          (pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Langer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Apache Lucene as Content-Based-Filtering Recommender System: 3 Lessons Learned</article-title>
          .
          <source>5th International Workshop on Bibliometric-enhanced Information Retrieval (BIR) at the 39th European Conference on Information Retrieval (ECIR).</source>
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <surname>McNee</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Albert</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cosley</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gopalkrishnan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lam</surname>
            ,
            <given-names>S. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rashid</surname>
            ,
            <given-names>A. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          , et al. (
          <year>2002</year>
          ).
          <article-title>On the Recommending of Citations for Research Papers</article-title>
          .
          <source>Proceedings of the ACM Conference on Computer Supported Cooperative Work</source>
          . doi:http://doi.acm.
          <source>org/10</source>
          .1145/587078.587096
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <surname>McNee</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kapoor</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Don't look stupid: avoiding pitfalls when recommending research papers</article-title>
          .
          <source>Proceedings of the 20th Conference on Computer supported cooperative work.</source>
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <surname>Pennock</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horvitz</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lawrence</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>C. L.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Collaborative filtering by personality diagnosis: A hybrid memory-and model-based approach</article-title>
          .
          <source>Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence</source>
          (pp.
          <fpage>473</fpage>
          -
          <lpage>480</lpage>
          ). Morgan Kaufmann Publishers Inc.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <surname>Ricci</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rokach</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Shapira</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <source>Recommender Systems Handbook (2nd edt.)</source>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <string-name>
            <surname>Rokach</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kataria</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>A Supervised Learning Method for ContextAware Citation Recommendation in a Large Corpus</article-title>
          .
          <source>Proceedings of the Large-Scale and Distributed Systems for Information Retrieval Workshop (LSDS-IR)</source>
          (pp.
          <fpage>17</fpage>
          -
          <lpage>22</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          <string-name>
            <surname>Schwarzer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meuschke</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breitinger</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markl</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Evaluating Linkbased Recommendations for Wikipedia</article-title>
          .
          <source>Proceedings of the 16th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL)</source>
          ,
          <source>JCDL '16</source>
          (pp.
          <fpage>191</fpage>
          -
          <lpage>200</lpage>
          ).
          <source>doi:10.1145/2910896</source>
          .2910908
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          <string-name>
            <surname>Selvidge</surname>
            ,
            <given-names>P. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chaparro</surname>
            ,
            <given-names>B. S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Bender</surname>
            ,
            <given-names>G. T.</given-names>
          </string-name>
          (
          <year>2002</year>
          ).
          <article-title>The world wide wait: effects of delays on user performance</article-title>
          .
          <source>International Journal of Industrial Ergonomics</source>
          ,
          <volume>29</volume>
          (
          <issue>1</issue>
          ),
          <fpage>15</fpage>
          -
          <lpage>20</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          <string-name>
            <surname>Siebert</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dinesh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Feyer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Extending a Research Paper Recommendation System with Bibliometric Measures</article-title>
          .
          <source>5th International Workshop on Bibliometric-enhanced Information Retrieval (BIR) at the 39th European Conference on Information Retrieval (ECIR).</source>
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          <string-name>
            <surname>Sinha</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Song</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ma</surname>
          </string-name>
          , H.,
          <string-name>
            <surname>Eide</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hsu</surname>
            ,
            <given-names>B. P.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>An overview of Microsoft Academic Service (MAS) and applications</article-title>
          .
          <source>Proceedings of the 24th WWW conference</source>
          (pp.
          <fpage>243</fpage>
          -
          <lpage>246</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          <string-name>
            <surname>Torres</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McNee</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Riedl</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Enhancing digital libraries with TechLens+</article-title>
          .
          <source>Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries</source>
          (pp.
          <fpage>228</fpage>
          -
          <lpage>236</lpage>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>