<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Defining a Meaningful Baseline for News Recommender Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Benjamin Kille</string-name>
          <email>benjamin.kille@tu-berlin.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreas Lommatzsch</string-name>
          <email>andreas.lommatzsch@dai-labor.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Technology Berlin</institution>
          ,
          <addr-line>Berlin</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <abstract>
        <p>Evaluation protocols for news recommender systems typically involve comparing the performance of methods to a baseline. The diference in performance ought to tell us what benefit we can expect from using a more sophisticated method. Ultimately, there is a trade-of between performance and efort in implementing and maintaining a system. This work explores what baselines have been used, what criteria baselines must fulfil, and evaluates a variety of baselines in a news recommender evaluation setting with multiple publishers. We find that circular bufers and trend-based predictions score highly, need little efort to implement, and require no additional data. Besides, we observe variations among publishers, suggesting that not all baselines are equally competitive in diferent circumstances.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>CCS CONCEPTS</title>
      <sec id="sec-1-1">
        <title>Information systems</title>
      </sec>
      <sec id="sec-1-2">
        <title>Recommender systems.</title>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>INTRODUCTION</title>
      <p>
        Readers struggle to keep up with the plethora of stories
which publishers continue to release on their digital
platforms. News Recommender Systems (NRS) support readers
navigating the dynamic news landscape. They deliver a subset
of articles deemed interesting. Publishers’ success depends—
at least partially—on how much of readers’ attention they
obtain. Revenue strongly correlates with the number of
advertisements shown to readers in an “attention economy” [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
Consequently, publishers want to know whether an NRS
recommends relevant articles to their readers. The dynamic
character of the news landscape impedes on the comparability
of evaluation results. For instance, breaking news may shift
readers’ attention in a way completely unrelated to the
recommendation algorithms. Publishers account for this efect
by comparing recommendation algorithms to baselines [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ].
      </p>
      <p>
        The choice of baseline introduces a variable into the
evaluation protocol. Ideally, evaluators would use the same baseline
to arrive at comparable results. Not every baseline applies
to each recommendation task. For instance, collaborative
ifltering requires user profiles [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], whereas content-based
ifltering needs meta-data about items [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. Section 2 reviews
Copyright ' 2019 for this paper by its authors. Use permitted under
Creative Commons License Attribution 4.0 International (CC BY 4.0).
previous research on NRS focusing on which baselines they
have used. Section 3 derives requirements which baselines
must fulfil. Section 4 describes experiments conducted on a
large-scale data set from three publishers. The experiments
compare the performance of a variety of baselines. Section 5
summarises our findings and hints at directions for future
research.
2
      </p>
    </sec>
    <sec id="sec-3">
      <title>RELATED</title>
    </sec>
    <sec id="sec-4">
      <title>WORK</title>
      <p>
        A consensus concerning the evaluation protocol of NRS has
yet to establish. The availability of data afects what
baselines evaluators can use in their experiments. Frequently,
researchers use recorded interactions between users and news
articles. Whenever researchers have access to the NRS
directly, they can even employ counterfactual reasoning [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ].
The earliest works on automated NRS relied on items’
popularity as a baseline. The rationale behind the popularity
baseline suggests that items relevant to many users are suited
candidates. Das et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and Garcin et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] employ the
popularity baseline. Lommatzsch [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] uses a circular bufer
implementation as baseline. This implementation combines
the popularity of items with a recency focus. Researchers
interested in content-based news recommendation devise
baselines using content features. Gao et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and Zheng et al.
[
        <xref ref-type="bibr" rid="ref37">37</xref>
        ] use a term-frequency model as a baseline. Cantador et al.
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] use a keyword-based baseline for their semantic news
recommendation model. Okura et al. [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] define a word-based
baseline for their embedding experiments. Li et al. [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] use
an  -greedy strategy as baseline in their contextual bandit
evaluation. Li et al. [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] and Lu et al. [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] use collaborative
filtering and content-based filtering baselines. Some researchers
invest considerable resources to replicate existing results by
re-implementing proposed news recommendation algorithms.
Li and Li [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] compare their results to [
        <xref ref-type="bibr" rid="ref17 ref19 ref21 ref5 ref7">5, 7, 17, 19, 21</xref>
        ]. Zheng
et al. [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ] contrast their findings to [
        <xref ref-type="bibr" rid="ref17 ref28 ref32 ref4">4, 17, 28, 32</xref>
        ]. Wang et al.
[
        <xref ref-type="bibr" rid="ref33">33</xref>
        ] consider [
        <xref ref-type="bibr" rid="ref10 ref13 ref28 ref34 ref35 ref4 ref6">4, 6, 10, 13, 28, 34, 35</xref>
        ]. Khattar et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]
compare their approach with [
        <xref ref-type="bibr" rid="ref11 ref12 ref16 ref26 ref29">11, 12, 16, 26, 29</xref>
        ]. Studying the
baselines, we notice variances across works. Some baselines
appear frequently. Some papers describe baselines tailored
to their particular use case. There is no consensus on what
baseline should be used. Moreover, it remains unclear how
baselines correlate or interdepend on one another.
3
      </p>
    </sec>
    <sec id="sec-5">
      <title>BASELINE REQUIREMENTS</title>
      <p>Baselines allow us to measure the relative improvement of
an algorithm. Evaluators can not only report a value but
express how much better the value is compared to a more
straightforward method. Thereby, we learn whether it is
worth investing additional efort into developing more
sophisticated algorithms. Still, defining a meaningful baselines
poses a technical challenge. Denfiing a baseline which fails
miserably does not reveal much about relative improvements.
Defining a baseline which requires too much efort or has too
many design choices might render evaluation results hard to
compare. Requiring additional data can make it impossible
for some baselines to be used in experiments where these
data are lacking. Based on these propositions, we introduce
three requirements for news recommendation baselines:
(1) low implementation efort
(2) competitive performance
(3) no additional data required</p>
      <p>The next section introduces a variety of candidate
algorithms and analyses how well they perform in a large-scale
experiment.
4</p>
    </sec>
    <sec id="sec-6">
      <title>EXPERIMENT</title>
      <p>
        In this experiment, we use the NewsREEL evaluation
platform described in Lommatzsch et al. [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. NewsREEL ofers
researchers the opportunity to evaluate their news
recommendation algorithms on real users with several connected
publishers. We use data recorded in the time 1 March 2017 to
30 June 2018 (14 months) including approximately 94 million
sessions of three publishers. The system tracks readers using
session cookies. The session information allows us to link
reading events to a particular reader. Whenever more than
1 h passes in between events, we create a new session. Note
that we disregard all sessions with a single reading event as
we cannot compare predictions to future events in these cases.
All three publishers operate an NRS. Empirically, the NRS
produce clicks on the order of a few per thousand
recommendations. Hence, their efect on the collected reading events
appears negligible.
      </p>
      <p>Table 1 outlines the characteristics of the data set.
Publishers A and C have to deal with expectedly short sessions.
Table 1 also lists the number and proportion of sessions with
only a single interaction. This matters as we have to
disregard those sessions in the evaluation. Without a second
interaction, we cannot determine whether a recommendation
was successful.
4.1</p>
    </sec>
    <sec id="sec-7">
      <title>Candidate Baselines</title>
      <p>In the experiment, we consider eight baseline candidates.</p>
      <p>Random. The random method considers all known articles
and picks a suggestion at random. In addition, we consider a
slightly advanced version which draws only from the set of
items published in a certain time window. This accounts for
the readers’ desire to read more current news.</p>
      <p>Popularity. The popularity method suggests exactly the
article which has been read most often. Similarly to the
random method, we consider a version of the popularity
version, which considers reading frequencies in a specific time
window.</p>
      <p>Recency. The recency method recommends news articles
most recently published. The method disregards any form of
popularity or personalisation.</p>
      <p>Reading Sequences. The reading sequences method
monitors which article users read in sequence. It recommends
exactly the article which users read most frequently given
the readers current article.</p>
      <p>
        Collaborative filtering.Applying Collaborative Filtering
to news recommendation is particularly challenging. Not
only keep publishers adding new items, but also do systems
know very little about users. We follow the suggestion of Das
et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and implement a MinHash version of Collaborative
Filtering.
      </p>
      <p>Content-based Filtering. Content-based filtering requires
a way to define similarity among news articles. Generally,
we could employ a string matching approach on the title
or text. Still, this would require considerable computational
efort. Besides, the possibility of diferent languages adds
another level of dificulty. We have implemented a more
straightforward method. The Content-based filtering takes
the category of news articles as a proxy for similarity and
suggests articles from the same category at random.</p>
      <p>
        Circular buffer. We use the circular bufer proposed by
Lommatzsch [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] which has also been used as the baseline
of CLEF NewsREEL [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. This method has a fixed size
list which the systems updates as interactions occur. The
system adds the article of each interaction. When the system
arrives at the end of the list, it moves the index to the first
position and goes on. The methods select recommendations
by reverse lookup in the list. Thereby, it combines popularity
and recency. More popular articles occur more frequently
on the list. More recently read articles occur with a higher
probability as well.
      </p>
      <p>Trending. The trending method computes the trend for
each article in a given time window. More specifically, the
method carries out a regression on the reading frequency
binned on an hourly level. The method recommends those
articles with the steepest trend.</p>
      <p>Implementing baseline candidates causes little efort.
Contentbased filtering requires knowing the category of news articles.
All remaining candidates only require interaction data with
timestamps. Hence, they fulfil already two of our three
requirements.
4.2</p>
    </sec>
    <sec id="sec-8">
      <title>Evaluation Protocol</title>
      <p>We present each event to all of the baseline candidates and
request exactly one item as a recommendation. The evaluator
stores the recommendations with reference to the session.
When the session re-appears, the evaluator checks whether
this article had been recommended previously. If that is the
case, the evaluator adds one to the score () of that baseline.</p>
      <p>Based on the recorded score, we compute two evaluation
measures on recommendation success on event and session
level:


=
=
||− 1 ∑︁</p>
      <p>∈
||− 1 ∑︁ 1{()&gt;0}
∈
(1)
(2)</p>
      <p>The first score,  approximates the expected number of
successful recommendations per session. The second score,
, estimates the chance that at least one recommendation
will succeed.
4.3</p>
    </sec>
    <sec id="sec-9">
      <title>Evaluation Results</title>
      <p>Table 2 shows our observations for all combinations of
publishers and baselines. We notice that the baselines’ performances
vary considerably. We have scaled the results by a factor of
10− 5 to obtain more legible figures. Thus, a score of 1000
refers to one per cent. The random baseline performs poorly
among all publishers. The chance to randomly suggest
something interesting is below 1 in 1000. Constraining the time
window to shorter periods slightly improves the success rate.
The performance of popularity, recency, and sequences difers
between publishers. Publisher A and C show better results for
popularity and sequences. Sequences even perform best for
Publisher A overall. Publisher B, on the other hand, shows
the best results for recency, and, conversely, the worst
performance for sequences. This is surprising as Publisher B sees
the longest sessions on average (cf. Table 1). The presence
of very specific sequences could spoil the predictions,
particularly for recently published items. Content-based filtering
performs poorly, especially for Publisher C. Publisher C
focusses on the narrow topic of automotive news. This could
lead to circumstances in which the NRS faces a large number
of very specific categories. Collaborative filtering performs
well for publishers A and B but not for C. The circular bufer
and the trending baseline perform well for all publishers.
The circular bufers exhibit little variation depending on the
parameter choice. This could be due to the highly skewed
frequencies with which readers engage with articles. The
distributions exhibit a strong popularity bias which supplies
the same small subset of articles to the bufer. As a result,
the recommendations would largely coincide among lists of
varying lengths. Publishers A and B show improvements for
shorter time windows, whereas Publisher C trending baseline
performance peaks at 12 hours. Besides, we observe diferent
maximum results among the three publishers. Sequences top
Publisher A at 18.5 % () and 14.7 % (). In contrast, the
circular bufer with one hundred elements scores highest for
Publisher B with 11.7 % () and 3.0 % (). The circular
bufer with five hundred or a thousand elements performs best
for Publisher C with 2.1 % () and 1.6 % (). Hence, the
diferences with respect to  span 16.4 %. Such substantial
diferences are unlikely to emerge from the baseline. Instead,
we have to assume that other aspects play a vital role, such
as the composition of the readership, interface design, and
content.
5</p>
    </sec>
    <sec id="sec-10">
      <title>CONCLUSION AND FUTURE WORK</title>
      <p>News Recommender Systems support readers navigating the
news landscape by suggesting which article to read next.
Evaluation is necessary to optimise NRS. To estimate how
much value a new method adds, evaluation protocols compare
their results to baselines. A consensus on what baselines to
use has yet to establish. Researchers have used a variety of
baselines (cf. Section 2). We have formulated three criteria
which baselines have to meet to be considered a viable option.
Section 4.1 presents a list of candidate baselines, all of which
require manageable efort to implement and need not much
additional data. To check the candidate baselines’
competitiveness, we have devised an experiment on three publishers
with data covering 14 months. The results suggest that the
circular bufer and the trending baseline stably provide
competitive performance for all publishers. We have observed
variations among the performance of baselines for particular
publishers. For instance, the sequence baseline has scored
exceptionally well for publisher A yet failed for publisher
B completely. More research is needed to explore how to
transfer our findings to other publishers. We conjecture that
similar publishers will confirm the order of performance for
the baselines.</p>
      <p>
        Researchers are keen to compare their methods to the most
competitive approach. This requires considerable investment
in re-implementing previous research. We argue that using
the circular bufer or trend-based method already represents
a solid baseline. Both have scored higher than
collaborative filtering and content-based filtering in our experiments.
Researchers ought to highlight the trade-of between
predictive accuracy and computational costs. Amatriain and
Basilico [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] have highlighted this critical trade-of in the case
of streaming service Netflix. Their team decided not to
implement an ensemble of 107 algorithms as the engineering
costs surpassed the added value in prediction accuracy. The
accelerated dynamics of collections of news articles raise even
Baseline
random
random (6h)
random (12h)
random (24h)
random (48h)
popular
popular (6h)
popular (12h)
popular (24h)
popular (48h)
recency
circular bufer (100)
circular bufer (200)
circular bufer (500)
circular bufer (1000)
sequences
content-based
collaborative filtering
trends (2h)
trends (6h)
trends (12h)
trends (24h)
stricter constraints on recommender systems than in the case
of movies.
      </p>
      <p>We see several directions for future research. First, one
could introduce a method to estimate the implementation
costs in a more quantitative fashion. This would allow us to
address the trade-of more rigorously. Likewise, one could
measure the data footprint of diferent baselines to assess
their space and time complexity. Second, one could apply
the proposed baselines and possible additions or adaptions
on data from other publishers. This would help to quantify
the generality of our findings. Finally, evaluation protocols
including more than a single recommendation could reveal
how ranking metrics compare to our binary scheme. Still,
ranking metrics might not display most accurately whether
a system performs well unless it presents recommendations
in the form of a list.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Xavier</given-names>
            <surname>Amatriain</surname>
          </string-name>
          and
          <string-name>
            <given-names>Justin</given-names>
            <surname>Basilico</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Netflix recommendations: beyond the 5 stars (part 1)</article-title>
          .
          <source>Netflix Tech Blog</source>
          <volume>6</volume>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Erik</given-names>
            <surname>Brynjolfsson and JooHee Oh</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>The Attention Economy - Measuring the Value of Free Digital Services on the Internet</article-title>
          .
          <source>ICIS</source>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <article-title>[3] Iv´an Cantador, Pablo Castells</article-title>
          , and Alejandro Bellog´ın.
          <year>2011</year>
          .
          <article-title>An enhanced semantic layer for hybrid recommender systems: Application to news recommendation</article-title>
          .
          <source>International Journal on Semantic Web and Information Systems (IJSWIS) 7</source>
          ,
          <issue>1</issue>
          (
          <year>2011</year>
          ),
          <fpage>44</fpage>
          -
          <lpage>78</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Heng-Tze</surname>
            <given-names>Cheng</given-names>
          </string-name>
          , Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye,
          <string-name>
            <surname>Glen Anderson</surname>
          </string-name>
          , Greg Corrado, Wei Chai,
          <string-name>
            <surname>Mustafa Ispir</surname>
          </string-name>
          , et al.
          <year>2016</year>
          .
          <article-title>Wide &amp; deep learning for recommender systems</article-title>
          .
          <source>In Proceedings of the 1st workshop on deep learning for recommender systems. ACM</source>
          ,
          <fpage>7</fpage>
          -
          <lpage>10</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Wei</given-names>
            <surname>Chu</surname>
          </string-name>
          and
          <string-name>
            <surname>Seung-Taek Park</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Personalized recommendation on dynamic content using predictive bilinear models</article-title>
          .
          <source>In Proceedings of the 18th international conference on World wide web. ACM</source>
          ,
          <volume>691</volume>
          -
          <fpage>700</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Paul</given-names>
            <surname>Covington</surname>
          </string-name>
          , Jay Adams, and
          <string-name>
            <given-names>Emre</given-names>
            <surname>Sargin</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Deep neural networks for youtube recommendations</article-title>
          .
          <source>In Proceedings of the 10th ACM conference on recommender systems. ACM</source>
          ,
          <volume>191</volume>
          -
          <fpage>198</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Abhinandan S Das</surname>
          </string-name>
          ,
          <string-name>
            <surname>Mayur Datar</surname>
            , Ashutosh Garg, and
            <given-names>Shyam</given-names>
          </string-name>
          <string-name>
            <surname>Rajaram</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Google news personalization: scalable online collaborative filtering</article-title>
          .
          <source>InProceedings of the 16th international conference on World Wide Web. ACM</source>
          ,
          <volume>271</volume>
          -
          <fpage>280</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Qi</given-names>
            <surname>Gao</surname>
          </string-name>
          , Fabian Abel,
          <string-name>
            <surname>Geert-Jan Houben</surname>
            , and
            <given-names>Ke</given-names>
          </string-name>
          <string-name>
            <surname>Tao</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Interweaving trend and user modeling for personalized news recommendation</article-title>
          .
          <source>In 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology</source>
          , Vol.
          <volume>1</volume>
          . IEEE,
          <fpage>100</fpage>
          -
          <lpage>103</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Florent</given-names>
            <surname>Garcin</surname>
          </string-name>
          , Boi Faltings, Olivier Donatsch, Ayar Alazzawi, Christophe Bruttin, and
          <string-name>
            <given-names>Amr</given-names>
            <surname>Huber</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Offline and online evaluation of news recommender systems at swissinfo. ch</article-title>
          .
          <source>In Proceedings of the 8th ACM Conference on Recommender systems. ACM</source>
          ,
          <volume>169</volume>
          -
          <fpage>176</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Huifeng</surname>
            <given-names>Guo</given-names>
          </string-name>
          , Ruiming Tang, Yunming Ye,
          <string-name>
            <given-names>Zhenguo</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Xiuqiang</given-names>
            <surname>He</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>DeepFM: A Factorization-Machine based Neural Network for CTR Prediction</article-title>
          .
          <source>In Proceedings of the TwentySixth International Joint Conference on Artificial Intelligence, IJCAI-17</source>
          .
          <fpage>1725</fpage>
          -
          <lpage>1731</lpage>
          . https://doi.org/10.24963/ijcai.
          <year>2017</year>
          /239
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Xiangnan</surname>
            <given-names>He</given-names>
          </string-name>
          , Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and
          <string-name>
            <surname>Tat-Seng Chua</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Neural collaborative filtering</article-title>
          .
          <source>InProceedings of the 26th international conference on world wide web. International World Wide Web Conferences Steering Committee</source>
          ,
          <fpage>173</fpage>
          -
          <lpage>182</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Xiangnan</surname>
            <given-names>He</given-names>
          </string-name>
          , Hanwang Zhang,
          <string-name>
            <surname>Min-Yen Kan</surname>
          </string-name>
          , and
          <string-name>
            <surname>Tat-Seng Chua</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Fast matrix factorization for online recommendation with implicit feedback</article-title>
          .
          <source>In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval. ACM</source>
          ,
          <volume>549</volume>
          -
          <fpage>558</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Po-Sen</surname>
            <given-names>Huang</given-names>
          </string-name>
          , Xiaodong He,
          <string-name>
            <surname>Jianfeng Gao</surname>
            , Li Deng,
            <given-names>Alex</given-names>
          </string-name>
          <string-name>
            <surname>Acero</surname>
            , and
            <given-names>Larry</given-names>
          </string-name>
          <string-name>
            <surname>Heck</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Learning deep structured semantic models for web search using clickthrough data</article-title>
          .
          <source>In Proceedings of the 22nd ACM international conference on Information &amp; Knowledge Management. ACM</source>
          ,
          <volume>2333</volume>
          -
          <fpage>2338</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Dhruv</surname>
            <given-names>Khattar</given-names>
          </string-name>
          , Vaibhav Kumar, Vasudeva Varma, and
          <string-name>
            <given-names>Manish</given-names>
            <surname>Gupta</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Weave&amp;Rec: A Word Embedding based 3-D Convolutional Network for News Recommendation</article-title>
          .
          <source>In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. ACM</source>
          ,
          <year>1855</year>
          -
          <fpage>1858</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Yehuda</given-names>
            <surname>Koren</surname>
          </string-name>
          and
          <string-name>
            <given-names>Robert</given-names>
            <surname>Bell</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Advances in Collaborative Filtering</article-title>
          .
          <source>In Recommender Systems Handbook</source>
          . Springer US, Boston, MA,
          <fpage>77</fpage>
          -
          <lpage>118</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Vaibhav</surname>
            <given-names>Kumar</given-names>
          </string-name>
          , Dhruv Khattar, Shashank Gupta, Manish Gupta, and
          <string-name>
            <given-names>Vasudeva</given-names>
            <surname>Varma</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Deep Neural Architecture for News Recommendation.</article-title>
          .
          <source>In CLEF (Working Notes).</source>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Lihong</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <surname>Wei Chu</surname>
          </string-name>
          , John Langford, and
          <string-name>
            <given-names>Robert E</given-names>
            <surname>Schapire</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>A contextual-bandit approach to personalized news article recommendation</article-title>
          .
          <source>In Proceedings of the 19th international conference on World wide web. ACM</source>
          ,
          <volume>661</volume>
          -
          <fpage>670</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Lei</given-names>
            <surname>Li</surname>
          </string-name>
          and
          <string-name>
            <given-names>Tao</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>News Recommendation via Hypergraph Learning: Encapsulation of User Behavior and News Content</article-title>
          .
          <source>In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining (WSDM '13)</source>
          . ACM, New York, NY, USA,
          <fpage>305</fpage>
          -
          <lpage>314</lpage>
          . https://doi.org/10.1145/2433396.2433436
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>Lei</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Dingding</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Tao</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Daniel</given-names>
            <surname>Knox</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Balaji</given-names>
            <surname>Padmanabhan</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>SCENE: a scalable two-stage personalized news recommendation system</article-title>
          .
          <source>In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval. ACM</source>
          ,
          <volume>125</volume>
          -
          <fpage>134</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>Lei</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Li</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Fan</given-names>
            <surname>Yang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Tao</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Modeling and broadening temporal user interest in personalized news recommendation</article-title>
          .
          <source>Expert Systems with Applications 41</source>
          ,
          <issue>7</issue>
          (
          <year>2014</year>
          ),
          <fpage>3168</fpage>
          -
          <lpage>3177</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Jiahui</surname>
            <given-names>Liu</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Peter</given-names>
            <surname>Dolan</surname>
          </string-name>
          , and Elin Rønby Pedersen.
          <year>2010</year>
          .
          <article-title>Personalized news recommendation based on click behavior</article-title>
          .
          <source>In Proceedings of the 15th international conference on Intelligent user interfaces. ACM</source>
          ,
          <volume>31</volume>
          -
          <fpage>40</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Lommatzsch</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Real-time news recommendation using context-aware ensembles</article-title>
          .
          <source>In European Conference on Information Retrieval</source>
          , Vol.
          <source>LNCS 8416</source>
          . Springer, Springer,
          <fpage>51</fpage>
          -
          <lpage>62</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Andreas</surname>
            <given-names>Lommatzsch</given-names>
          </string-name>
          , Benjamin Kille, Frank Hopfgartner, Martha Larson, Torben Brodt, Jonas Seiler, and O¨zlem O¨zg¨obek.
          <year>2017</year>
          .
          <article-title>CLEF 2017 NewsREEL Overview: A Stream-based Recommender Task for Evaluation and Education</article-title>
          .
          <source>In International Conference of the Cross-Language Evaluation Forum for European Languages</source>
          . Springer,
          <fpage>239</fpage>
          -
          <lpage>254</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>Pasquale</surname>
            <given-names>Lops</given-names>
          </string-name>
          , Marco De Gemmis, and
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Semeraro</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Content-based Recommender Systems: State of the Art and Trends</article-title>
          .
          <source>In Recommender Systems Handbook</source>
          . Springer, Boston, MA, Boston, MA,
          <fpage>73</fpage>
          -
          <lpage>105</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Zhongqi</surname>
            <given-names>Lu</given-names>
          </string-name>
          , Zhicheng Dou, Jianxun Lian, Xing Xie, and
          <string-name>
            <given-names>Qiang</given-names>
            <surname>Yang</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Content-based collaborative filtering for news topic recommendation</article-title>
          .
          <source>In Twenty-ninth AAAI conference on artificial intelligence.</source>
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>Cataldo</surname>
            <given-names>Musto</given-names>
          </string-name>
          , Giovanni Semeraro, Marco de Gemmis, and
          <string-name>
            <given-names>Pasquale</given-names>
            <surname>Lops</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Learning word embeddings from wikipedia for content-based recommender systems</article-title>
          .
          <source>In European Conference on Information Retrieval</source>
          . Springer,
          <fpage>729</fpage>
          -
          <lpage>734</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <surname>Shumpei</surname>
            <given-names>Okura</given-names>
          </string-name>
          , Yukihiro Tagami, Shingo Ono, and
          <string-name>
            <given-names>Akira</given-names>
            <surname>Tajima</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Embedding-based News Recommendation for Millions of Users</article-title>
          .
          <source>In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '17)</source>
          . ACM, New York, NY, USA,
          <fpage>1933</fpage>
          -
          <lpage>1942</lpage>
          . https://doi.org/10. 1145/3097983.3098108
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>Steffen</given-names>
            <surname>Rendle</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Factorization machines</article-title>
          .
          <source>In 2010 IEEE International Conference on Data Mining. IEEE</source>
          ,
          <fpage>995</fpage>
          -
          <lpage>1000</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <surname>Steffen</surname>
            <given-names>Rendle</given-names>
          </string-name>
          , Christoph Freudenthaler, Zeno Gantner, and
          <string-name>
            <surname>Lars</surname>
          </string-name>
          Schmidt-Thieme.
          <year>2009</year>
          .
          <article-title>BPR: Bayesian personalized ranking from implicit feedback</article-title>
          .
          <source>In Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence</source>
          . AUAI Press,
          <fpage>452</fpage>
          -
          <lpage>461</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>Guy</given-names>
            <surname>Shani</surname>
          </string-name>
          and
          <string-name>
            <given-names>Asela</given-names>
            <surname>Gunawardana</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Evaluating Recommendation Systems</article-title>
          .
          <source>In Recommender Systems Handbook</source>
          . Springer US, Boston, MA.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>A</given-names>
            <surname>Swaminathan</surname>
          </string-name>
          and
          <string-name>
            <given-names>T</given-names>
            <surname>Joachims</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Counterfactual Risk Minimization: Learning from Logged Bandit Feedback.</article-title>
          .
          <source>In ICML.</source>
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <surname>Huazheng</surname>
            <given-names>Wang</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qingyun Wu</surname>
            , and
            <given-names>Hongning</given-names>
          </string-name>
          <string-name>
            <surname>Wang</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Learning hidden features for contextual bandits</article-title>
          .
          <source>In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM</source>
          ,
          <volume>1633</volume>
          -
          <fpage>1642</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <surname>Hongwei</surname>
            <given-names>Wang</given-names>
          </string-name>
          , Fuzheng Zhang, Xing Xie, and
          <string-name>
            <given-names>Minyi</given-names>
            <surname>Guo</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>DKN: Deep Knowledge-Aware Network for News Recommendation</article-title>
          .
          <source>In Proceedings of the 2018 World Wide Web Conference (WWW '18)</source>
          .
          <source>International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland</source>
          ,
          <fpage>1835</fpage>
          -
          <lpage>1844</lpage>
          . https://doi.org/10.1145/3178876.3186175
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <surname>Jin</surname>
            <given-names>Wang</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhongyuan</surname>
            <given-names>Wang</given-names>
          </string-name>
          , Dawei Zhang, and
          <string-name>
            <given-names>Jun</given-names>
            <surname>Yan</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification</article-title>
          ..
          <source>InIJCAI</source>
          .
          <volume>2915</volume>
          -
          <fpage>2921</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <surname>Hong-Jian</surname>
            <given-names>Xue</given-names>
          </string-name>
          , Xinyu Dai, Jianbing Zhang, Shujian Huang, and
          <string-name>
            <given-names>Jiajun</given-names>
            <surname>Chen</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Deep Matrix Factorization Models for Recommender Systems.</article-title>
          .
          <source>In IJCAI. 3203-3209.</source>
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <surname>Guanjie</surname>
            <given-names>Zheng</given-names>
          </string-name>
          , Fuzheng Zhang, Zihan Zheng,
          <string-name>
            <surname>Yang</surname>
            <given-names>Xiang</given-names>
          </string-name>
          , Nicholas Jing Yuan, Xing Xie, and
          <string-name>
            <given-names>Zhenhui</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>DRN: A Deep Reinforcement Learning Framework for News Recommendation</article-title>
          .
          <source>In Proceedings of the 2018 World Wide Web Conference (WWW '18)</source>
          .
          <source>International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland</source>
          ,
          <fpage>167</fpage>
          -
          <lpage>176</lpage>
          . https://doi.org/10.1145/3178876.3185994
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <surname>Li</surname>
            <given-names>Zheng</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Lei</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Wenxing</given-names>
            <surname>Hong</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Tao</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>PENETRATE: Personalized news recommendation using ensemble hierarchical clustering</article-title>
          .
          <source>Expert Systems with Applications 40</source>
          ,
          <issue>6</issue>
          (
          <year>2013</year>
          ),
          <fpage>2127</fpage>
          -
          <lpage>2136</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>