<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Apache Lucene as Content-Based-Filtering Recommender System: 3 Lessons Learned</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Stefan</forename><surname>Langer</surname></persName>
							<email>langer@ovgu.de</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">Otto-von-Guericke University</orgName>
								<address>
									<settlement>Magdeburg</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Joeran</forename><surname>Beel</surname></persName>
							<email>joeran.beel@adaptcentre.ie</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution" key="instit1">Trinity College Dublin</orgName>
								<orgName type="institution" key="instit2">ADAPT Centre</orgName>
								<address>
									<country key="IE">Ireland</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Digital Content and Media Sciences Research Division</orgName>
								<orgName type="institution">National Institute of Informatics</orgName>
								<address>
									<settlement>To-kyo</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Apache Lucene as Content-Based-Filtering Recommender System: 3 Lessons Learned</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">E2F43F6C9A3D066808A958379E403FB3</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T18:26+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>recommender systems</term>
					<term>apache lucene</term>
					<term>content-based filtering</term>
					<term>lessons learned BIR 2017 Workshop on Bibliometric-enhanced Information Retrieval</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>For the past few years, we used Apache Lucene as recommendation framework in our scholarly-literature recommender system of the reference-management software Docear. In this paper, we share three lessons learned from our work with Lucene. First, recommendations with relevance scores below 0.025 tend to have significantly lower click-through rates than recommendations with relevance scores above 0.025. Second, by picking ten recommendations randomly from Lucene's top50 search results, click-through rate decreased by 15%, compared to recommending the top10 results. Third, the number of returned search results tend to predict how high click-through rates will be: when Lucene returns less than 1,000 search results, click-through rates tend to be around half as high as if 1,000+ results are returned.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Apache Lucene/Solr is probably the most common search framework, and it is frequently used by content-based-filtering recommender systems <ref type="bibr" target="#b0">(Bancu et al., 2012;</ref><ref type="bibr" target="#b8">Caragea et al., 2014;</ref><ref type="bibr" target="#b13">Garcia Esparza, O'Mahony, &amp; Smyth, 2010;</ref><ref type="bibr" target="#b15">Jonnalagedda, Gauch, Labille, &amp; Alfarhood, 2016;</ref><ref type="bibr" target="#b16">Livne, Gokuladas, Teevan, Dumais, &amp; Adar, 2014;</ref><ref type="bibr" target="#b17">Mitzig et al., 2016;</ref><ref type="bibr" target="#b19">Phelan, McCarthy, &amp; Smyth, 2009;</ref><ref type="bibr" target="#b20">Pohl, 2007;</ref><ref type="bibr" target="#b21">Pursel et al., 2016;</ref><ref type="bibr" target="#b23">Shelton, Duffin, Wang, &amp; Ball, 2010</ref>). Lucene's build-in recommendation method, which uses a classic TF-IDF-weighted term-vector retrieval approach, is also generally used frequently as a baseline method that typically achieves good results. <ref type="bibr" target="#b11">(Demner-Fushman et al., 2011;</ref><ref type="bibr" target="#b14">Gipp, Meuschke, &amp; Lipinski, 2015;</ref><ref type="bibr" target="#b22">Schwarzer et al., 2016)</ref> We used Lucene to implement a research-paper recommender system in Docear <ref type="bibr">(Beel, Gipp, Langer, &amp; Genzmehr, 2011;</ref><ref type="bibr" target="#b4">Beel, Gipp, &amp; Mueller, 2009;</ref><ref type="bibr" target="#b6">Beel, Langer, Genzmehr, &amp; Nürnberger, 2013;</ref><ref type="bibr" target="#b7">Beel, Langer, Gipp, &amp; Nürnberger, 2014)</ref>. Docear is a free and open-source reference manager, comparable to tools like Endote, Zotero, Mendeley, or Citavi. Docear has approximately 50,000 registered users and uses mindmaps to manage PDFs and references. Since 2012, Docear has been offering a recommender system for 1.8 million publicly available research papers on the web. Recommendations are displayed as a list of ten research papers, showing the title of the recommended papers (Fig. <ref type="figure" target="#fig_0">1</ref>). Clicking a recommendation opens the paper's full-text (PDF) in the user's web browser. Between 2012 and 2015, the recommender system delivered around one million recommendations to more than 7,000 researchers. For more details on the recommender system please refer to <ref type="bibr" target="#b7">Beel et al. (2014)</ref>. In this paper, we share some experiences we made with Lucene, focusing on three aspects. First, we analyze the meaning of Lucene's relevance scores. Second, we analyze how effective recommendations are based on Lucene's suggested rank. Finally, we analyze the relationship between the amount of recommendation candidates that Lucene returns and the recommendation effectiveness. Although we did our research in the context of research-paper recommendations, results might also be interesting for other recommender-systems domains that use Lucene, for instance, in the domains of news recommender systems, website recommender systems, or tweet recommender systems <ref type="bibr" target="#b9">(Chen, Ororbia, Alexander, &amp; Giles, 2015;</ref><ref type="bibr" target="#b12">Duma, Liakata, Clare, Ravenscroft, &amp; Klein, 2016;</ref><ref type="bibr" target="#b13">Garcia Esparza et al., 2010;</ref><ref type="bibr" target="#b15">Jonnalagedda et al., 2016;</ref><ref type="bibr" target="#b17">Mitzig et al., 2016;</ref><ref type="bibr" target="#b19">Phelan et al., 2009;</ref><ref type="bibr" target="#b23">Shelton et al., 2010)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Methodology</head><p>All presented results are based on data that we collected between May 2013, and October 2014. During this time, Docear's recommender system delivered 418,308 recommendations to 4,674 unique users. We use click-through rate as measure for the effectiveness of delivered recommendations. Click-through rate (CTR) describes the ratio of clicked and delivered recommendations. For more details on click-through rate and its suitability as evaluation metric please refer to <ref type="bibr" target="#b5">Beel &amp; Langer (2015)</ref>. All reported differences are statistically significant (p &lt; 0.05) based on a two-tailed t-test.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Results &amp; Discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Lucene's Relevance Scores</head><p>Lucene provides relevance scores for each recommendation. This information could be used, theoretically, to recommend only documents with a relevance score above a certain threshold. However, on the Web it is often reported that these scores cannot be used to compare relevancies of recommendations between different queries, or to conclude from the relevance score how relevant the search result or recommendation is overall. <ref type="foot" target="#foot_0">1</ref> Our data shows a slightly different picture.</p><p>In our data, the highest relevance score for a recommendation was 19.01, median was 0.16 and mean was 0.22. Fig. <ref type="figure">2</ref> shows that CTR was lowest (3.36%) for recommendations with a relevance score below 0.01, and highest (6.16%) for relevance scores of 1 and above. For recommendations with relevance scores between 0.1 and 0.8, CTR remained mostly stable around 5%. Overall, there is a notable trend: CTR increases, the higher Lucene relevance scores become.</p><p>Our observation contradicts the common claims that Lucene's relevance score cannot be used to estimate a search result's absolute relevance. If, for instance, an operator of a recommender system decided that a click-through rate of at least 4% was desirable, then recommendations with a relevance score below 0.25 should probably be discarded.</p><p>Similarly, our result might lead to the conclusion to only recommend documents above a certain relevance threshold, e.g. 1. However, recommending only documents with a relevance score of 1 and above is probably not sensible as only a small fraction of recommendations had a relevance score of 1 and above (0.60%). Similarly, it might seem sensible to not recommend documents with relevance scores below 0.025 as these documents had very low CTRs. However, only a small fraction of recommendations (4.27%) had relevance scores below 0.025, so this decision would barely affect the overall click-through rate.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Fig. 2. Lucene relevance score and corresponding CTR</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Lucene's Rank</head><p>To increase the diversity of recommendations, Docear's recommender system randomly chose 10 recommendations out of the top50 results returned by Lucene. However, this leads to lower click-through rates. Recommendations originally being ranked 1 by Lucene received CTRs of 6.83% on average and recommendations on rank 2 received CTRs of 6.08% on average (Fig. <ref type="figure">3</ref>). For ranks 3 to 10, CTR remains stable around 5.3% and then CTR constantly decreases the lower the original rank.  Overall, recommendations being in Lucene's top10 results, achieved CTRs of 5.55% on average, while the top50 achieved CTRs of 4.73% on average. This means, selecting randomly 10 recommendations from the top50 candidates decreases recommendation effectiveness by around 15%, compared to showing recommendations from the top10 only. The recommender system shuffled recommendations before they were displayed. This means, position bias cannot have influenced the results <ref type="bibr" target="#b10">(Craswell, Zoeter, Taylor, &amp; Ramsey, 2008;</ref><ref type="bibr" target="#b14">Hofmann, Schuth, Bellogin, &amp; Rijke, 2014;</ref><ref type="bibr" target="#b18">Pan et al., 2007;</ref><ref type="bibr" target="#b24">Wang, Bendersky, Metzler, &amp; Najork, 2016)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Number of Recommendation Candidates</head><p>By default, Lucene returns 1,000 recommendations, i.e. search results. In our data, Lucene returned the maximum possible amount of 1,000 results for 91.25% of all termbased recommendations (Fig. <ref type="figure" target="#fig_2">4</ref>). In contrast, only for 0.05% of citation-based searches 1,000 results were returned. Most citation-based searches returned between one and nine results (34.84%) or between 10 and 24 results (29.94%). Click-through rates seem to be rather high when only few results were returned. For term-based searches, results are the opposite: the more recommendation candidates are available, the higher the CTR tends to be. Consequently, for term-based recommendations, the number of results might be a good approximation of recommendation effectiveness. If less than 1,000 results are returned it might make sense to no recommend the documents or try an alternative recommendation approach. From our analysis, we learned three lessons. First, Lucene's relevance score allows to predictto some extendhow relevant a recommendation will be for a user. For instance, in our scenario, it seems sensible to not recommend documents with a relevance score below 0.025. However, since only few recommendations had such a low rele- vance score, discarding them will probably not notably affect the overall recommendation effectiveness. Second, recommending ten recommendations out of the top50 results might be sensible. Although this process decreases the overall recommendation effectiveness by 15%, the recommendation diversity or number of total recommendations is increased. Third, the number of recommendation candidates returned by Lucene is suitable to approximate the recommendation effectiveness. If Lucene returns less than 1,000 results for term-based recommendations, the click-through rate probably will be around half as high as if 1,000 candidates are returned. In the case of less than 1,000 results, it might make sense to not display the recommendations or generate recommendations again with another recommendation approach.</p><p>For the future, we suggest to repeat our analyses in different scenarios, for instance, with news recommenders or other literature recommender systems, to see if Lucene behaves in the same way as in the scenario of Docear. Currently, we are developing a recommender system as-a-service that would allow us to conduct such analyses with different partners <ref type="bibr" target="#b1">(Beel &amp; Gipp, 2017;</ref><ref type="bibr">Beel, Gipp, Langer, Genzmehr, et al., 2011)</ref>.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Screenshot of Docear and the recommender system</figDesc><graphic coords="2,133.40,489.40,328.49,184.90" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig.</head><label></label><figDesc>Fig. 3. Lucene's rank and corresponding CTR</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 4 .</head><label>4</label><figDesc>Fig. 4. CTR based on the number of recommendation candidates in Lucene</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>recommendations Number of Recommendation Candidates in Lucene</head><label></label><figDesc></figDesc><table><row><cell cols="2">100%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>10%</cell></row><row><cell>Number of displayed</cell><cell>10% 20% 30% 40% 50% 60% 70% 80% 90%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>2% 4% 6% 8%</cell><cell>CTR</cell></row><row><cell></cell><cell>0%</cell><cell>[1;9]</cell><cell>[10-24]</cell><cell>[25-50]</cell><cell>[51-99]</cell><cell cols="2">[100-249] [250-999]</cell><cell>1000</cell><cell>0%</cell></row><row><cell cols="2"># Dspld. Recs (Terms)</cell><cell>0.18%</cell><cell>0.21%</cell><cell>0.18%</cell><cell>0.19%</cell><cell>0.38%</cell><cell>0.92%</cell><cell>97.94%</cell></row><row><cell cols="2"># Dspld. Recs (Cit.)</cell><cell>34.84%</cell><cell>29.94%</cell><cell>17.37%</cell><cell>10.28%</cell><cell>5.93%</cell><cell>1.59%</cell><cell>0.05%</cell></row><row><cell>CTR (Terms)</cell><cell></cell><cell>2.73%</cell><cell>1.28%</cell><cell>1.50%</cell><cell>1.86%</cell><cell>1.29%</cell><cell>2.22%</cell><cell>4.70%</cell></row><row><cell>CTR (Citations)</cell><cell></cell><cell>7.96%</cell><cell>4.43%</cell><cell>5.27%</cell><cell>5.05%</cell><cell>3.54%</cell><cell>2.84%</cell><cell>0.00%</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://wiki.apache.org/lucene-java/ScoresAsPercentages BIR</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2017" xml:id="foot_1">Workshop on Bibliometric-enhanced Information Retrieval</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This work was supported by a fellowship within the FITweltweit programme of the German Academic Exchange Service (DAAD). In addition, this publication has emanated from research conducted with the financial support of Science Foundation Ireland (SFI) under Grant Number 13/RC/2106.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">ARSYS-Article Recommender System</title>
		<author>
			<persName><forename type="first">C</forename><surname>Bancu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dagadita</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dascalu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Dobre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Trausan-Matu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Florea</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing</title>
				<meeting>the 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing</meeting>
		<imprint>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="349" to="355" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Mr. DLib: Recommendations-as-a-Service (RaaS) for Academia (Pre-print)</title>
		<author>
			<persName><forename type="first">J</forename><surname>Beel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Gipp</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL)</title>
				<meeting>the ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL)</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Docear: An Academic Literature Suite for Searching, Organizing and Creating Academic Literature</title>
		<author>
			<persName><forename type="first">J</forename><surname>Beel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Gipp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Langer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Genzmehr</surname></persName>
		</author>
		<idno type="DOI">10.1145/1998076.1998188</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries (JCDL), JCDL &apos;11</title>
				<meeting>the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries (JCDL), JCDL &apos;11</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="465" to="466" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Introducing Mr. DLib, a Machine-readable Digital Library</title>
		<author>
			<persName><forename type="first">J</forename><surname>Beel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Gipp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Langer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Genzmehr</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Wilde</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Nürnberger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pitman</surname></persName>
		</author>
		<idno type="DOI">10.1145/1998076.1998187</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL&apos;11</title>
				<meeting>the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL&apos;11</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="463" to="464" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">SciPlore MindMapping&apos; -A Tool for Creating Mind Maps Combined with PDF and Reference Management</title>
		<author>
			<persName><forename type="first">J</forename><surname>Beel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Gipp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Mueller</surname></persName>
		</author>
		<idno type="DOI">10.1045/november2009-inbrief</idno>
	</analytic>
	<monogr>
		<title level="j">D-Lib Magazine</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="issue">11</biblScope>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">A Comparison of Offline Evaluations, Online Evaluations, and User Studies in the Context of Research-Paper Recommender Systems</title>
		<author>
			<persName><forename type="first">J</forename><surname>Beel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Langer</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-319-24592-8_12</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 19th International Conference on Theory and Practice of Digital Libraries (TPDL)</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<editor>
			<persName><forename type="first">S</forename><surname>Kapidakis</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Mazurek</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">&amp;</forename><forename type="middle">M</forename><surname>Werla</surname></persName>
		</editor>
		<meeting>the 19th International Conference on Theory and Practice of Digital Libraries (TPDL)</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="volume">9316</biblScope>
			<biblScope unit="page" from="153" to="168" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Introducing Docear&apos;s Research Paper Recommender System</title>
		<author>
			<persName><forename type="first">J</forename><surname>Beel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Langer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Genzmehr</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Nürnberger</surname></persName>
		</author>
		<idno type="DOI">10.1145/2467696.2467786</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL&apos;13</title>
				<meeting>the 13th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL&apos;13</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="459" to="460" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The Architecture and Datasets of Docear&apos;s Research Paper Recommender System</title>
		<author>
			<persName><forename type="first">J</forename><surname>Beel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Langer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Gipp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Nürnberger</surname></persName>
		</author>
		<idno type="DOI">10.1045/november14-beel</idno>
	</analytic>
	<monogr>
		<title level="j">D-Lib Magazine</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">11/12</biblScope>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">CiteseerX: A scholarly big dataset</title>
		<author>
			<persName><forename type="first">C</forename><surname>Caragea</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ciobanu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Williams</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Fernández-Ram𝚤rez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H.-H</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Wu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">European Conference on Information Retrieval</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="311" to="322" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">ExpertSeer: a Keyphrase Based Expert Recommender for Digital Libraries</title>
		<author>
			<persName><forename type="first">H.-H</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Ororbia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Alexander</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">L</forename><surname>Giles</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1511.02058</idno>
		<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">An experimental comparison of click position-bias models</title>
		<author>
			<persName><forename type="first">N</forename><surname>Craswell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Zoeter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Taylor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Ramsey</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2008 International Conference on Web Search and Data Mining</title>
				<meeting>the 2008 International Conference on Web Search and Data Mining</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="87" to="94" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">A Knowledge-Based Approach to Medical Records Retrieval</title>
		<author>
			<persName><forename type="first">D</forename><surname>Demner-Fushman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Abhyankar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jimeno-Yepes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Loane</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Rance</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Lang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Ide</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 20th Text REtrieval Conference (TREC)</title>
				<meeting>the 20th Text REtrieval Conference (TREC)</meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Rhetorical Classification of Anchor Text for Citation Recommendation</title>
		<author>
			<persName><forename type="first">D</forename><surname>Duma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Liakata</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Clare</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ravenscroft</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Klein</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">D-Lib Magazine</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">9</biblScope>
			<date type="published" when="2016">2016. /10</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">On the real-time web as a source of recommendation knowledge</title>
		<author>
			<persName><forename type="first">Garcia</forename><surname>Esparza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>O'mahony</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">P</forename><surname>Smyth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the fourth ACM conference on Recommender systems</title>
				<meeting>the fourth ACM conference on Recommender systems</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="305" to="308" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">CITREC: An Evaluation Framework for Citation-Based Similarity Measures based on TREC Genomics and PubMed Central</title>
		<author>
			<persName><forename type="first">B</forename><surname>Gipp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Meuschke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lipinski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hofmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Schuth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Bellogin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Rijke</surname></persName>
		</author>
		<author>
			<persName><surname>De</surname></persName>
		</author>
		<ptr target="https://www.ideals.illinois.edu/bitstream/handle/2142/73680/98_ready" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the iConference 2015</title>
				<meeting>the iConference 2015<address><addrLine>Newport Beach, California</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2014">2015. 2014</date>
			<biblScope unit="page" from="624" to="630" />
		</imprint>
	</monogr>
	<note>Advances in Information Retrieval</note>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Incorporating popularity in a personalized news recommender system</title>
		<author>
			<persName><forename type="first">N</forename><surname>Jonnalagedda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Gauch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Labille</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Alfarhood</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PeerJ Computer Science</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page">e63</biblScope>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">CiteSight: supporting contextual citation recommendation using differential search</title>
		<author>
			<persName><forename type="first">A</forename><surname>Livne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Gokuladas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Teevan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">T</forename><surname>Dumais</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Adar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 37th international ACM SIGIR conference on Research &amp; development in information retrieval</title>
				<meeting>the 37th international ACM SIGIR conference on Research &amp; development in information retrieval</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="807" to="816" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">SciELO suggester: An intelligent support tool for cataloging library resources</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">L</forename><surname>Mitzig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">S</forename><surname>Mitzig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>Mart𝚤nez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A</forename><surname>Piriz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">M</forename><surname>Ferracutti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">P</forename><surname>González</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Maguitman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Library &amp; Information Science Research</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="39" to="51" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">In google we trust: Users&apos; decisions on rank, position, and relevance</title>
		<author>
			<persName><forename type="first">B</forename><surname>Pan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Hembrooke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Joachims</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Lorigo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Gay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Granka</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Computer-Mediated Communication</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="801" to="823" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Using twitter to recommend real-time topical news</title>
		<author>
			<persName><forename type="first">O</forename><surname>Phelan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Mccarthy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Smyth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the third ACM conference on Recommender systems</title>
				<meeting>the third ACM conference on Recommender systems</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="385" to="388" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">Using Access Data for Paper Recommendations on ArXiv</title>
		<author>
			<persName><forename type="first">S</forename><surname>Pohl</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
		<respStmt>
			<orgName>Technical University of Darmstadt</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Master Thesis</note>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">BBookX: Design of an Automated Web-based Recommender System for the Creation of Open Learning Content</title>
		<author>
			<persName><forename type="first">B</forename><surname>Pursel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Liang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Williams</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Brautigam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Saul</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 25th International Conference Companion on World Wide Web</title>
				<meeting>the 25th International Conference Companion on World Wide Web</meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="929" to="933" />
		</imprint>
	</monogr>
	<note>International World Wide Web Conferences Steering Committee</note>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Evaluating Link-based Recommendations for Wikipedia</title>
		<author>
			<persName><forename type="first">M</forename><surname>Schwarzer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schubotz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Meuschke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Breitinger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Markl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Gipp</surname></persName>
		</author>
		<idno type="DOI">10.1145/2910896.2910908</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 16th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), JCDL &apos;16</title>
				<meeting>the 16th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), JCDL &apos;16<address><addrLine>Newark, New Jersey, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="191" to="200" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Linking open course wares and open education resources: creating an effective search and recommendation system</title>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">E</forename><surname>Shelton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Duffin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ball</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Procedia Computer Science</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="2865" to="2870" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Learning to rank with selection bias in personal search</title>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bendersky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Metzler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Najork</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval</title>
				<meeting>the 39th International ACM SIGIR conference on Research and Development in Information Retrieval</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="115" to="124" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
