<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta />
  </front>
  <body>
    <sec id="sec-1">
      <title>2. News Recommender Systems</title>
      <p>News recommender systems share many features with
information retrieval systems and human computer interaction as
well. Text mining techniques for large scale data sets are needed,
and machine learning methods are employed when learning cycles
can be built into the systems. In general there are three steps. First
of all, data pre-processing such as sampling, dimension reduction,
denoising with use of similarity functions are normally applied.
Then the text is analyzed through supervised or unsupervised
machine learning techniques depending on availability of training
data sets. At the end the result is interpreted through for example
the F1- measure, ROC or MAE [1].</p>
      <p>If we consider news recommender system as search engines, the
user profiles can be regarded as long search queries. The system
ranks the results on the basis of well the profile matches the
descriptions of the news articles. Formally, the appropriateness of
recommended news to the user can described by the following
utility function [1]:
This function assigns a score r for each combination of user c and
news story s. Matrix indicates the characteristics of the user and
shows the different specifications of available articles such as
topic, location, news agency, date and other useful attributes. All
different algorithms in recommender systems try to maximize the
result matrix. Each entry of could be any non negative internal
between 0 and 1 or 0 and 100 based on the system definition. At
the end, an article that maximizes the utility function will be
recommended [1]:
News recommender systems differ in the context of items
structures from other recommenders. The structure of news
articles is not following any specific format. There are many news
articles in a day that have very short life spans while the system
must scale to deal with huge volumes of data. Besides, the news
recommender system must always recommend interesting articles
to the user, though it should not make over-specialize for the
target user. [2]</p>
    </sec>
    <sec id="sec-2">
      <title>3. User Profiles</title>
      <p>The desired user profiles need to have a changing essence and
flexible content. These profiles show their preferences towards
news articles by modeling the interesting articles. Besides, storing
user interactions is a basis to know their favorite topics which last
longer and which are only for a short period of time.</p>
      <p>This model consists of meta-data such as time and location, which
is changing according to the user behavior.</p>
      <p>The content of the user profile for this kind of recommender
system which has not very structured format is different from
others. In order to have an exact and practical model of the user
profiles, the system needs to know the behavior of the user
including background, interest and goals. These features are
changing over time, so considering the temporal parameters such
as time and location is crucial [3].</p>
      <p>There are three major presentations of terms in the user
profile. The first approach is presenting terms as vectors in a
vector space model. In order to weigh correctly every single word
based on its frequency in every document and in the collection of
documents, TF-IDF is often applied. This measure puts more
emphasis on one word that appears frequently in one specific
document and not in other ones. So it will gain more weight and
appointed document, will be retrieved to a target user. But the
problem of polysemy (multiple meaning for one word) and
synonymy (multiple words for identical meaning) remain. The
desired approach reflects cultural and linguistic knowledge of
terms and also could use reasoning on their content. As a result,
the presentation is more intelligent and is not a simple bag of
words and could provide the knowledge about desired terms [1].
The second one is the analysis words in the format of entity. They
have meanings and relations, but they suffer from generalization
or specialization since there is no hierarchical relationships among
the entities [3]. The third one is the semantic analysis that is
ontology-based. It has hierarchical relationships between the
semantic concepts modeling user interests. The terms that indicate
the user interests including their interests that last longer or the
ones that appear only for a short time could be enriched by
semantic approaches. The advantage of providing ontologies for
the user interests is that all the terms or entities are in hierarchical
relationships which give more specific detail of user interests at
the side of the general ones [3]. The semantic enrichment could
benefit from encyclopedic knowledge beside the knowledge of
applied documents. So the terms are semantic vectors in word
space model [1]. Each of them are indexed by their weights but
later will be interpreted semantically by using Wikipedia. It is
called Explicit Semantic Analysis (ESA) [4].</p>
      <p>The feedback of the user is the other approach of user modeling.
In general s/he could communicate and provide their interest
towards the news explicitly or implicitly. Explicit feedback is to
provide their interest (disaffection) directly to the system. It could
be actions such as rating, like or filling the survey through the
interface of the application. Implicit feedback includes the
interactions such as click on articles (touch in mobile device),
scrolling articles using a mouse or a keyboard (swapping in the
mobile device), printing or saving articles, copying and posting a
part or all of articles, reading articles, forwarding or sharing the
articles and providing the qualitative comments on the article.
Recommender systems are highly dependent on user feedback. As
long as the user interacts with the application, the accuracy of the
system may gradually improve. Explicit feedback tends to
produce more exact user profiles than what is possible with
implicit feedback. Unfortunately, not all users are willing to spend
time to provide such feedback, so the implicit signals of the users
are normally the basis of the recommendation [5].</p>
      <p>Specifying the type of user’s interest could help the system to
cover all domains of their attention. The long-term interest is
more dependent on the user profession and the personal
background than what will be traced by the log history. But the
short-term interest is the one mostly related to the current trend of
public that s/he has communication with. Although depending on
the goals, the long-term interest will change gradually. Besides,
supervising the context of user’s attention could provide good
evidence to capture the short-term interest and update their
longterm interest time by time. In [6] by defining running context over
category and topic, the current user‘s interest is captured. The old
user profile that is the indicator of their long-term interest is
updated progressively if there is nothing in common with their
current focus. Besides, there should be a balanced focus on the old
and new user profile. While keeping the old user profile and over
looking the context results in dissatisfaction, giving too much
priority to the current context will not cover the news articles that
are related to their background and are the basis of their interest.
In addition, different time of day (morning, evening) and week
(weekdays and weekend) could affect the user profile [7].
Considering the topic of the news articles, target users may have
different desires at different times. As an example, s/he might
have more interests in politics and economics in weekdays and
focus more on lifestyle news in the weekend [8].</p>
      <p>While personalizing the news is desirable, the importance of
public trend is not negligible. In [9] based on the frequency of
user clicks, public trend could provide the interesting news
articles as well. If there are not enough clicks from the user side,
then according to their location, public trend of that location is a
good indicator to recommend the news. This dimension of the
user profile that specifies the location has a key role in
recommending news articles. Short-term interests of the user are
highly dependent on their location. Location could capture public
trend and find similar networks of users as well. Sometimes
ignoring the user profile and focus on the context is helpful (in
economical news, user profile is not very helpful but the context
tracing is more informative), while other times it is better to count
only on the user profile (for entertainment section user profile
enrichment is much better than context) [10].</p>
      <p>As the amount of data explodes, the importance of extracting
models and predicting unseen data with machine learning
techniques is increasing [11]. There are two major types of
learning techniques, supervised and unsupervised. In the former
one, an annotated training dataset is provided, whereas in the
latter one, the machine explores the data to identify interesting
patterns without training data. Below is the list of supervised
learning techniques used in recommender systems:
</p>
      <p>Decision Trees (C4.5 or KART) handle
categoricalnominal and heterogeneous data. It is also able to cope
with missing values. Through pre pruning, overfitting
will be addressed. It tends to work well with small sized
datasets, though the cost of decisions on continuous data
streams is high [11, 12].</p>
      <p>
        Rule-based (RIPPER) can handle multi value features
very well. It is decision tree-based and uses rules to
categorize new items. It utilizes post pruning to find the
best fit for the rule set [
        <xref ref-type="bibr" rid="ref1">13</xref>
        ].
      </p>
      <p>
        K Nearest Neighbor (KNN) can handle continuous data
through Euclidean, Manhattan or Minkowski distance
and cope with categorical data through Hamming
distance. It is a lazy learner that works well with few
instances [
        <xref ref-type="bibr" rid="ref2 ref3">14, 15</xref>
        ].
      </p>
      <p>
        Rocchio and Relevance Feedback: the user profile is
regarded as a query [
        <xref ref-type="bibr" rid="ref4">16</xref>
        ] and based on the implicit
feedback of user, the recommendation will be improved
in time.
      </p>
      <p>
        Support Vector Machine (SVM): through SVM
reduction of sensitivity to the noises and increasing
generalization is done. For non linear problem if




features are more than instances, linear kernel is good
enough to be applied [
        <xref ref-type="bibr" rid="ref4 ref5">16, 17</xref>
        ].
 Probabilistic methods and Naive Bayes: Bayesian Belief
Network with conditional independency is the most
applicable one. Multinomial (Bernoulli) and
multivariate are two types of Naive Bayes. While in the
Bernoulli model absence or presence of a model is
checked, in multivariate one the number of occurrences
of a term will be calculated [
        <xref ref-type="bibr" rid="ref6 ref7">18, 19</xref>
        ].
 Neural Network: Single layer perceptron and multi layer
for non linear separable problems are the samples of
applied neural network in the recommender systems
[
        <xref ref-type="bibr" rid="ref8">20</xref>
        ].
      </p>
      <p>Below is the list of unsupervised learning techniques:
</p>
      <p>
        Probabilistic methods: If the structure of Bayesian
network is not known then building the DAG Bayesian
with scoring function, constraint based learning or
Conditional Independency can be applied. The last one
has more efficiency [
        <xref ref-type="bibr" rid="ref9">21</xref>
        ]. The other techniques such as
Bayesian Hidden Score (pairwise learning) and
graphbased learning have been applied in [
        <xref ref-type="bibr" rid="ref10">22</xref>
        ].
 Neural Network: Self Organizing Map (Kohonen) and
Restricted Boltzmann Machine belong to the category
of unsupervised learning [
        <xref ref-type="bibr" rid="ref8">20</xref>
        ].
 Clustering: flat clustering by k-means algorithm deals
with the categorical data and the most frequent term will
be the centroid. In the hierarchical clustering, the other
type of clustering, divisive is more accurate than
agglomerative. There are two approaches to label
clusters. The first one is differential that through feature
selection a label with a higher score will be chosen. The
second one is inter clustering that the closest one to the
title or the higher weight to the centroid of the cluster
will be chosen as the label. The drawback of
clusterinternal labeling is disability to distinguish between
words which are frequent in the whole clusters and the
ones that are frequent only in one specific cluster.
Labeling in hierarchical clustering due to the dependent
definitions of parent, child and sibling is more
complicated [
        <xref ref-type="bibr" rid="ref4">16</xref>
        ].
      </p>
      <p>Table 1 shows the applied machine learning techniques to build
up a user profile.</p>
    </sec>
    <sec id="sec-3">
      <title>4. Applying User Profiles in Recommender</title>
    </sec>
    <sec id="sec-4">
      <title>Systems</title>
      <p>There are different approaches to filter out the information.</p>
      <p>Content-based and collaborative filtering are the most applicable
ones. In content-based filtering, the concept of news articles will
be analyzed. Then according to the content of the user profile (i.e.
characteristic of read articles), similar articles are predicted and
presented to the user. In the content-based filtering, the utility
function is:
If each of the content of the user profile and item profile is
represented by TF-IDF weight, then the scoring function could be
calculated through cosine similarity of vectors of the weight. To
achieve the accurate prediction, attributes of news articles that
have been counted on, are important. Since the nature of news
article is unstructured, extracting relevant and important features
has a key role in content-based filtering. If the articles are
categorized with minimum misclassification error, then storing
interesting news articles in the user profile is much easier and
consequently, recommendations are of higher quality. Bayesian
Networks can be utilized well for learning user profiles based on
the articles that have been read. It can model profiles of the users
through ignoring missing data and considering conditional
dependency in one specific category of news articles. It can
provide probabilities of each attribute of article by its nodes. The
modeled domain includes continuous data. Then similarity of the
user profile based on predicted attributes of article and available
news articles is computed and the ones with the highest score will
be recommended. If another technique such as Naive Bayes
(Bernoulli Model) is applied for modeling user behavior, the
output is binary as it is considering absence or presence of terms
regardless of their conditional independency [1]. It can suggest the
new item to the target user by comparing the new item’s
characteristics to the terms in the user’s profile. But if there is not
enough attributes, content-based filtering is normally not the most
efficient one. If the user is new to the system it cannot recommend
anything as there is no content of their profile available. Besides,
it causes lack of serendipity due to providing too many similar
news articles to the user. Considering the collaborative approach
for filtering information, there are two different models,
memorybased and model-based. Memory-based utilizes the log</p>
      <p>
        KNN
Semantic enrichment can be handled at entity level, but in the beginning of building the user profile or for
capturing short-term interest [
        <xref ref-type="bibr" rid="ref1 ref11">13, 23</xref>
        ].
      </p>
      <p>Semantic enrichment can be handled at entity level. More interesting categories of news may be predicated
through rules [1].</p>
      <p>
        Captures the short term interest of user and popularity of the item among a group of user.
User profiles are regarded as queries, the system improves over time from relevance feedback of the user [
        <xref ref-type="bibr" rid="ref4">16</xref>
        ].
      </p>
      <p>
        It outperforms KNN,C4.5 and Rocchio [
        <xref ref-type="bibr" rid="ref4">16</xref>
        ] with the Reuters dataset
Bernoulli works well with small sizes of data set and multinomial works well in large sizes of datasets. DAG
captures the dependency of items in more detailed capturing interest, vigorous towards missing data and could
disregard noisy data.
      </p>
      <p>
        BHS and graph-based capture online interest of the user [
        <xref ref-type="bibr" rid="ref10">22</xref>
        ]
      </p>
      <sec id="sec-4-1">
        <title>Neural Network</title>
        <p>
          It can represent details of the user’s interest through deep learning of three layer perceptron [
          <xref ref-type="bibr" rid="ref12">24</xref>
          ].
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Clustering</title>
        <p>The content of the items are clustered and then item-based collaborative is implemented on the output.</p>
        <sec id="sec-4-2-1">
          <title>Fuzzy membership over the k-means.</title>
          <p>Similarity of the item-rating matrix, the group-rating matrix (MovieLense)</p>
          <p>
            Hierarchical clustering for the news groups (LDA for small dataset and PLSI for large dataset) [
            <xref ref-type="bibr" rid="ref13">25</xref>
            ]
history of all users and put top-N similar users who have the
same taste about the news articles into one specific group.
          </p>
          <p>Then to provide the latest and interesting news articles to the
target user, it filters out users with the same interest and
recommends the new articles that have been read by them. It is
working with a matrix of user’s profile and all the news articles.</p>
          <p>
            It is possible to apply K Nearest Neighbor (through
neighborhood measurement) to find the closest users to the
current active user. The other approach is applying similarity
measurement like cosine similarity or Pearson correlation, which
provide the new item for the target user if it has similarity with
previous chosen items. It can help us find similar users or items
regarding to the context of memory [
            <xref ref-type="bibr" rid="ref11">23</xref>
            ].
          </p>
          <p>
            The other type of collaborative filtering is model-based. It is
more scalable and much faster than memory based collaborative
filtering. Through this type of filtering not all the dataset will be
traced and investigated, but only some information will be
modeled. As finding the similarity between users or news
articles (users with the same interest in the specific news or two
similar news articles that are interesting to one specific user) is
not feasible due to lack of labeled data in the training phase,
clustering of news or users could be a practical solution. With
the Google News dataset, clustering is done on the basis of
users’ clicks on different news article. Through clustering, latent
factors (latent semantic analysis) can be revealed. Consequently,
ignoring the hidden values will result in a very poor accuracy. It
could be helpful to distinguish hidden variables through the
clustering and provide more accurate prediction of news articles
[
            <xref ref-type="bibr" rid="ref11">23</xref>
            ]. One the technique to implement this approach is building
up the matrix of users and item as matrix factorization. The
matrix of users and news articles is suffering from sparsity,
since there are several positions that users do not provide any
feedback. To find the hidden variables that affect the
recommendation as well, UV decomposition (it is one instance
of Singular Value Decomposition) is possible to be applied. If
the utility matrix is ( indicates the user and
indicates the news articles), then UV decomposes it
multiplication of two different matrixes including
and :
RMSE is a common tool to measure the accuracy of prediction
blank entries in considering the product .
          </p>
          <p>Although it is working much faster than memory based, it is less
exact than it. In spite of all the applicable different approaches
of collaborative filtering, it cannot make the accurate prediction
for the new user or the new item (cold-start problem). The core
of all the algorithms is dependent on the group of users (or
items) in order to find the proper match for the target user.</p>
          <p>Consequently it has nothing to present to the user with unique
taste.</p>
          <p>
            As each of these filtering techniques has its own problems and
challenges in recommender systems, a hybrid system is often
preferred. It takes into account both filtering in predefined step
and could overcome drawback of each. Considering two
techniques of filtering (content-based and collaborative), the
order of combination of them might be important to build a
hybrid system. Although in some techniques of hybridization,
the order is not a matter. The techniques that order is not
important are [
            <xref ref-type="bibr" rid="ref14">26</xref>
            ]:
          </p>
          <p>
            Mixed: the result from both techniques will be presented in
one grouped or separate list. It has been utilized in [
            <xref ref-type="bibr" rid="ref15">27</xref>
            ] to
provide the TV shows to the users. The mixed hybrid system
provides recommendations based on the characteristics of each
show and preferences of other users.
          </p>
          <p>Weighted: The score for each technique is computed, and
the weighting of final score will be the basis for the
recommendation. In personalized Tango (P-tango) for online
newspapers, equal weights are assigned to both filtering
techniques. Gradually each weight is increasing regarding the
user rating. Based on the rating, the absolute error is computed
and is decreasing through the better recommendation.</p>
          <p>
            Switching: This technique uses some criterion to switch
between filtering techniques and based on the specific chosen
filter, recommends the item. In the DailyLearner switching
hybrid system, content-based filtering with k nearest neighbor is
first applied. If it does not produce sufficient recommendations,
collaborative filtering takes advantage of similar users’ interests
to recommend desired items. In another system, item-based
collaborative filtering is triggered if the accuracy of the
contentbased filtering part is low [
            <xref ref-type="bibr" rid="ref16">28</xref>
            ].
          </p>
          <p>
            Feature combination: The technique takes advantage of
one filtering type such as collaborative filtering as feature allied
with data. Then content-based filtering is applied. Through this
kind of hybrid system, the absolute dependency on users is
dropped by applying collaborative filtering as a feature
combination. In the movie recommender domain [
            <xref ref-type="bibr" rid="ref17">29</xref>
            ], the
RIPPER algorithm is implemented with item features and users
rating.
          </p>
          <p>There are three other models of hybrid systems that are ordered
by their intrinsic structure:</p>
          <p>Feature augmentation: One of the filtering techniques is
applied to compute rating scores or to classify items. The output
of this filtering is the input for the other filtering technique. In
Libra system, content-based filtering through Naïve Bayes is
done on data that comes from Amazon. The data from Amazon
that show related authors and titles were implemented using
collaborative filtering. Collaborative filtering is done first.</p>
          <p>
            Meta-level: It provides a model through one of the filtering
methods as an input for the other one. The model is the complete
one, not a learned model like feature augmented techniques. In
Fab [
            <xref ref-type="bibr" rid="ref18">30</xref>
            ] at first by means of relevance feedback and the
Rocchio algorithm, collections of items (the need of users in
mass of dataset in web) are composed (content-based).
Knearest neighbor is then used with collaborative filtering to
complete the recommendations. Meta-level is the only ordered
technique that applies content-based filtering first.
          </p>
          <p>
            Cascade: Approximately similar to the other ordered
techniques, it refines the result of candidates that have been
filtered by the previous technique. But if the items in the first
filtering have very low priorities, they will not be in the second
filtering stage. In fact, the second filtering step is only applied to
provide more accurate recommendations and if an item has not
enough rating score, it will not be in the second phase. Fab [
            <xref ref-type="bibr" rid="ref19">31</xref>
            ]
is the example of this technique. With collaborative filtering on
the selection stage, the items are chosen with an exact score and
presented to the user.
          </p>
          <p>According to the implemented hybrid systems in news
recommender system (such as Daily Learner), switching schema
is the most common strategy. It can start with content-based
filtering and utilize Naive Bayes to categorize the news articles
based on the content of the articles and apply item-based
collaborative filtering to calculate the similarity between the
news articles and the user profile. On the other hand, it is also
possible to apply collaborative filtering to find the closest users
to the active user (through KNN) and then with content-based
filtering identify much more similar items based on the
similarity computation of user profile and news articles.</p>
          <p>Table 2 shows the applied machine learning techniques to deal
with the issues of news recommender systems.</p>
          <p>ML Techniques</p>
        </sec>
      </sec>
      <sec id="sec-4-3">
        <title>Decision Tree (C4.5) Rule-based (RIPPER)</title>
        <p>KNN</p>
      </sec>
      <sec id="sec-4-4">
        <title>Rocchio and</title>
      </sec>
      <sec id="sec-4-5">
        <title>Relevance</title>
      </sec>
      <sec id="sec-4-6">
        <title>Feedback</title>
      </sec>
      <sec id="sec-4-7">
        <title>Support Vector</title>
        <p>
          Serendipity can be supported with new category reasoning [
          <xref ref-type="bibr" rid="ref20">32</xref>
          ].
        </p>
        <p>Short-term interests and provide the latest news to the user based on their interests [1].</p>
        <sec id="sec-4-7-1">
          <title>Handling long-term interest of the user [1]. Sparse Problem and huge data after a long time usage of the application[33].</title>
        </sec>
      </sec>
      <sec id="sec-4-8">
        <title>Probabilistic</title>
        <p>methods and</p>
      </sec>
      <sec id="sec-4-9">
        <title>Naive Bayes</title>
      </sec>
      <sec id="sec-4-10">
        <title>Neural Network</title>
      </sec>
      <sec id="sec-4-11">
        <title>Clustering</title>
        <sec id="sec-4-11-1">
          <title>Handling long-term interest of the user</title>
        </sec>
        <sec id="sec-4-11-2">
          <title>Sparse problem</title>
        </sec>
        <sec id="sec-4-11-3">
          <title>Noisy data</title>
        </sec>
        <sec id="sec-4-11-4">
          <title>Cold Start</title>
        </sec>
        <sec id="sec-4-11-5">
          <title>Precious interest of the user [28].</title>
        </sec>
        <sec id="sec-4-11-6">
          <title>Short term and long term [34].</title>
          <p>Tied Boltzmann with residual parameter could outperform on non cold-start problem in comparison with simple
method of collaborative filtering, Pearson correlation for the items. It also is competitive with the cold-start
problem in content-based filtering. (Netflix)</p>
        </sec>
        <sec id="sec-4-11-7">
          <title>Changing interest of the user [24].</title>
        </sec>
        <sec id="sec-4-11-8">
          <title>Cold start Through fuzzy membership new and interesting news articles are possible to be represented to the user [25].</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>The news recommender system is somewhat different from
other recommender systems. It is used to provide a variety of
personalized news articles that have very short life spans. In
addition the range of the user’s interests is wide and changing
over time and contexts. These characteristics necessitate very
dynamic analyses of user profiles.</p>
      <p>In this paper the distinguishable characteristics that affect
recommendation strategies are assessed. The user feedback on
recommended items is one of them. Different algorithms of
machine learning (that fall into the categories of supervised and
unsupervised) are discussed to build up user profiles. On the
other hand, as the user profile is dependent on the whole
framework of filtering methods, the techniques are also studied.
They utilize user profiles in diverse ways which affect the
accuracy of the corresponding recommendations.</p>
    </sec>
    <sec id="sec-6">
      <title>References</title>
      <p>[1] Ricci, F., et al., Recommender Systems Handbook. 2010:</p>
      <p>Springer-Verlag New York, Inc. 842.
[2] Özgöbek, Ö., J. A. Gulla,, R. C. Erdur, A Survey on
Challenges and Methods in News Recommendation, in In
Proceedings of the 10th International Conference on Web
Information System and Technologies April 2014:
Barcelona.
[3] Bouneffouf, D., Towards User Profile Modelling in</p>
      <p>Recommender System. 2013.
[4] Gabrilovich, E. and S. Markovitch, Computing semantic
relatedness using Wikipedia-based explicit semantic
analysis, in Proceedings of the 20th international joint
conference on Artifical intelligence. 2007, Morgan
Kaufmann Publishers Inc.: Hyderabad, India. p. 1606-1611.
[5] Jon Atle Gulla, J.E.I., Arne Dag Fidjestøl, John Eirik
Nilsen, Kent Robin Haugen, and Xioameng Su, Learning
User Profiles in Mobile News Recommendation. Journal of
Print and Media Technology Research, September 2013.</p>
      <p>Vol II, No. 3: p. pp. 183-194.
[6] Jon Atle Gulla, A.D.F., Xiaomeng Su and Humberto
Castejon, Implicit User Profiling in News Recommender</p>
      <sec id="sec-6-1">
        <title>Systems, in In Proceedings of the 10th International</title>
        <p>Conference on Web Information System and Technologies
April 2014: Barcelona.
[7] Abel, F., et al., Analyzing user modeling on twitter for
personalized news recommendations, in Proceedings of the
19th international conference on User modeling, adaption,
and personalization. 2011, Springer-Verlag: Girona, Spain.
p. 1-12.
[8] Adomavicius, G. and A. Tuzhilin, Context-aware
recommender systems, in Proceedings of the 2008 ACM
conference on Recommender systems. 2008, ACM:
Lausanne, Switzerland. p. 335-336.
[9] Liu, J., et al., Personalized news recommendation based on
click behavior, in Proceedings of the 15th international
conference on Intelligent user interfaces. 2010, ACM:
Hong Kong, China. p. 31-40.
[10] Bellogín, A., et al. Discovering Relevant Preferences in a
Personalised Recommender System using Machine
Learning Techniques. in Preference Learning Workshop
(PL 2008), at the 8th European Conference on Machine
Learning and Principles and Practice of Knowledge
Discovery in Databases (ECML PKDD 2008). 2008.
[11] Witten, I.H., E. Frank, and M.A. Hall, Data Mining:
Practical Machine Learning Tools and Techniques. 2011:
Morgan Kaufmann Publishers Inc. 664.
[12] VENKATADRI.M, L.C.R. A Comparative Study On
Decision Tree Classification Algorithms In Data Mining.
2010; Available from:
https://www.academia.edu/1374211/A_Comparative_Study
_On_Decision_Tree_Classification_Algorithms_In_Data_
Mining.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Pazzani</surname>
            ,
            <given-names>M.J.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Billsus</surname>
          </string-name>
          ,
          <article-title>Content-based recommendation systems, in The adaptive web, B</article-title>
          . Peter,
          <string-name>
            <given-names>K.</given-names>
            <surname>Alfred</surname>
          </string-name>
          , and N. Wolfgang, Editors.
          <year>2007</year>
          , Springer-Verlag. p.
          <fpage>325</fpage>
          -
          <lpage>341</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Deokar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <article-title>WEIGHTED K NEAREST NEIGHBOR</article-title>
          .
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Webb</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pazzani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and D.</given-names>
            <surname>Billsus</surname>
          </string-name>
          ,
          <article-title>Machine Learning for User Modeling. User Modeling</article-title>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <year>2001</year>
          .
          <volume>11</volume>
          (
          <issue>1-2</issue>
          ): p.
          <fpage>19</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Manning</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.D.</surname>
          </string-name>
          , et al., Introduction to Information Retrieval.
          <year>2008</year>
          : Cambridge University Press. 496.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Prügel-Bennett</surname>
            ,
            <given-names>M.A.G.a.A.</given-names>
          </string-name>
          ,
          <article-title>Building Switching Hybrid Recommender System Using Machine Learning Classifiers and Collaborative Filtering</article-title>
          .
          <source>IAENG International Journal of Computer Science.</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Margaritis</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <source>Learning Bayesian Network Model Structure</source>
          .
          <year>2003</year>
          , University of Pittsburgh.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Barber</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <source>Bayesian Reasoning and Machine Learning</source>
          .
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Peretto</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <article-title>An Introduction to the Modeling of Neural Networks</article-title>
          .
          <year>1992</year>
          : Cambridge University Press.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Kotsiantis</surname>
            ,
            <given-names>S.B.,</given-names>
          </string-name>
          <article-title>Supervised Machine Learning: A Review of Classification Techniques</article-title>
          ,
          <source>in Proceedings of the 2007 conference on Emerging Artificial Intelligence Applications</source>
          in Computer Engineering:
          <article-title>Real Word AI Systems with Applications in eHealth, HCI</article-title>
          , Information Retrieval and
          <string-name>
            <given-names>Pervasive</given-names>
            <surname>Technologies</surname>
          </string-name>
          .
          <year>2007</year>
          , IOS Press. p.
          <fpage>3</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Bian</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al.,
          <article-title>Exploiting User Preference for Online Learning in Web Content Optimization Systems</article-title>
          .
          <source>ACM Trans. Intell. Syst. Technol.</source>
          ,
          <year>2014</year>
          .
          <volume>5</volume>
          (
          <issue>2</issue>
          ): p.
          <fpage>1</fpage>
          -
          <lpage>23</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Rajaraman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>J.D.</given-names>
            <surname>Ullman</surname>
          </string-name>
          , Mining of Massive Datasets.
          <year>2011</year>
          : Cambridge University Press. 326.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [24]
          <string-name>
            <surname>Gunawardana</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . and
          <string-name>
            <given-names>C.</given-names>
            <surname>Meek</surname>
          </string-name>
          ,
          <article-title>A unified approach to building hybrid recommender systems</article-title>
          ,
          <source>in Proceedings of the third ACM conference on Recommender systems</source>
          .
          <year>2009</year>
          , ACM: New York, New York, USA. p.
          <fpage>117</fpage>
          -
          <lpage>124</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Mouton</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <article-title>Unsupervised Word Sense Induction from Multiple Semantic Spaces with Locality Sensitive Hashing</article-title>
          , in International Conference RANLP.
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [26]
          <string-name>
            <surname>Burke</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <source>Hybrid Recommender Systems: Survey and Experiments</source>
          . User Modeling and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <year>2002</year>
          .
          <volume>12</volume>
          (
          <issue>4</issue>
          ): p.
          <fpage>331</fpage>
          -
          <lpage>370</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [27]
          <string-name>
            <surname>Cotter</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Smyth</surname>
          </string-name>
          ,
          <article-title>PTV: Intelligent Personalised TV Guides</article-title>
          ,
          <source>in Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence</source>
          .
          <year>2000</year>
          , AAAI Press. p.
          <fpage>957</fpage>
          -
          <lpage>964</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Ghazanfar</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Prugel-Bennett</surname>
          </string-name>
          ,
          <article-title>An Improved Switching Hybrid Recommender System Using Naive Bayes Classifier and Collaborative Filtering</article-title>
          .
          <source>International Multiconference of Engineers and Computer Scientists (Imecs</source>
          <year>2010</year>
          ),
          <string-name>
            <surname>Vols</surname>
            <given-names>I-Iii</given-names>
          </string-name>
          ,
          <year>2010</year>
          : p.
          <fpage>493</fpage>
          -
          <lpage>502</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [29]
          <string-name>
            <surname>Basu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , H. Hirsh, and
          <string-name>
            <given-names>W.</given-names>
            <surname>Cohen</surname>
          </string-name>
          ,
          <article-title>Recommendation as classification: using social and content-based information in recommendation</article-title>
          ,
          <source>in Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence</source>
          .
          <source>1998</source>
          , American Association for Artificial Intelligence: Madison, Wisconsin, USA. p.
          <fpage>714</fpage>
          -
          <lpage>720</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [30]
          <string-name>
            <surname>Balabanovi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , #
          <volume>263</volume>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Shoham</surname>
          </string-name>
          ,
          <article-title>Fab: content-based, collaborative recommendation</article-title>
          .
          <source>Commun. ACM</source>
          ,
          <year>1997</year>
          .
          <volume>40</volume>
          (
          <issue>3</issue>
          ): p.
          <fpage>66</fpage>
          -
          <lpage>72</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [31]
          <string-name>
            <surname>Balabanovi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and #
          <volume>263</volume>
          ,
          <article-title>An adaptive Web page recommendation service</article-title>
          ,
          <source>in Proceedings of the first international conference on Autonomous agents. 1997, ACM: Marina del Rey</source>
          , California, USA. p.
          <fpage>378</fpage>
          -
          <lpage>385</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>Markward</given-names>
            <surname>Britsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.G.</given-names>
            ,
            <surname>Michael</surname>
          </string-name>
          <string-name>
            <surname>Schmelling</surname>
          </string-name>
          ,
          <article-title>Application of the rule-growing algorithm RIPPER to particle physics analysis</article-title>
          .
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>Anatole</given-names>
            <surname>Gershman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.W.</given-names>
            ,
            <surname>Eugene</surname>
          </string-name>
          <string-name>
            <surname>Fink</surname>
          </string-name>
          , Jaime Carbonell,
          <article-title>News Personalization using Support Vector Machines</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [34]
          <string-name>
            <surname>Kyo-Joong Oh</surname>
            ,
            <given-names>W.-J.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chae-Gyun</surname>
            <given-names>Lim</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ho-Jin</surname>
            <given-names>Choi</given-names>
          </string-name>
          ,
          <article-title>Personalized News Recommendation using Classified Keywords to Capture User Preference</article-title>
          ,
          <source>in Advanced Communication Technology (ICACT)</source>
          .
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>