<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Evaluation and user acceptance issues of a Bayesian classifier based TV Recommendation System</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Benedikt Engelbert</string-name>
          <email>b.engelbert@hs-osnabrueck.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Karsten Morisse</string-name>
          <email>k.morisse@hs-osnabrueck.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Applied Sciences</institution>
          ,
          <addr-line>Osnabrück, Artilleriestr. 46, 49076 Osnabrück, +49 541 969 3262</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Applied Sciences</institution>
          ,
          <addr-line>Osnabrück, Barbarastr. 16, 49076 Osnabrück, +49 541 969 3615</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2012</year>
      </pub-date>
      <volume>9</volume>
      <issue>2012</issue>
      <abstract>
        <p>Nowadays there is a variety of TV channels and programs. This seems to be an advantage for the TV user, but in most cases the user is overwhelmed and not able to choose the most appropriate content though. Assistive systems are needed to support the user in selecting the most appropriate content regarding the user's interests. The research group Next Generation PVR faced the task to develop a user supporting Personal Video Recorder (PVR) in the form of a Bayesian classifier based recommendation system. The work on the prototype of the system is almost done. This paper focuses on the evaluation of the given system. We are presenting two types of evaluation scenarios as well as an approach for measuring user acceptance of a TV recommendation system. Within the evaluation, the acceptance will be questioned. In addition, the results of both scenarios and of the user acceptance survey are presented and discussed.</p>
      </abstract>
      <kwd-group>
        <kwd>Recommendation System</kwd>
        <kwd>Television</kwd>
        <kwd>Evaluation</kwd>
        <kwd>Bayesian classifier</kwd>
        <kwd>User Acceptance</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Kai-Christoph Hamborg
University of Osnabrück</p>
      <p>Seminarstr. 20
49069 Osnabrück
+49 541 969 4703</p>
    </sec>
    <sec id="sec-2">
      <title>1. INTRODUCTION</title>
      <p>
        The way of consuming media clearly changed in recent years.
Especially in times of high bandwidth and loads of appropriate
media delivering services, the Internet plays an important role in
cases of consuming audio/video content. However, live television
is still the most popular media. In Germany 97% of all households
posses a TV set with an average use of 220 minutes a day [1].
Therefore it is evident that the television market is still interesting
for broadcasters. The satellite operator ASTRA holds up to 1700
TV channels just for the region of Germany. Regardless of the
encrypted and shopping program, 53 is a realistic number of
receivable TV channels [2]. For the TV user it is quite difficult to
handle the enormous offer of content. In most cases extensive TV
guides list just a limited number of TV channels and often only
popular ones. The user will invest time to get an overview of all
the available content. Due to too much effort most of the users
focus on favored or popular TV channels and the most interesting
content regarding the user’s interests remains unnoticed. Because
of this, a user supporting Personal Video Recorder based on a
Bayesian classifier has been presented in [
        <xref ref-type="bibr" rid="ref1">3</xref>
        ] to generate
personalized TV recommendations and to counteract the problems
in the given TV landscape from a user perspective. For the sake of
missing evaluation results, it was not possible to give a statement
about the quality of recommendations. In this paper we present
two evaluation scenarios to measure the quality of the developed
Bayesian classifier based TV recommendation system. The
quality within a recommendation system is obviously one of the
most important facts to be determined. Nevertheless in a user
supporting system the question about user acceptance is just as
important. For this reason this paper describes also an approach of
measuring user acceptance for recommendation systems and the
associated results for the given system. It will be shown, that the
quality of recommendations and the user acceptance are strongly
related. To present our work, the paper is organized as follows:
section 2 gives an overview of related work on recommendation
system with the focus on multimedia and TV systems. Within
section 3, a short overview about the evaluated system is
presented. We just explain the parts of the system, which are
necessary for the work of the evaluation. Section 4 is divided into
a part where both of the evaluation scenarios are explained and
where our approach of measuring user acceptance is presented.
The results of the evaluation are described and discussed in
section 5. The paper ends with a conclusion and future work.
      </p>
    </sec>
    <sec id="sec-3">
      <title>2. RELATED WORK</title>
      <p>
        Recommendation systems (RS) exist in several application areas
and for different types of content. One of the most common
examples is Amazon.com. Within the online shop, Amazon
recommends items like Books or CDs on the basis of already
purchased items. For this, Amazon is using an item-to-item based
approach, where relations between items get calculated [
        <xref ref-type="bibr" rid="ref2">4</xref>
        ]. Also
in the area of multimedia applications RS are common. YouTube
for instance is a popular online video broadcaster with million
queries every day. The YouTube RS generates recommendations
for related video content by analyzing user activities on the portal
website [
        <xref ref-type="bibr" rid="ref3">5</xref>
        ]. With the intention to counteract the enormous offer of
video content on YouTube, the work in [
        <xref ref-type="bibr" rid="ref4">6</xref>
        ] presents a mobile RS
based on an extended Bayesian classifier. Also in other fields of
RS the Bayesian classifier has been proofed as an efficient and
proper working method. In 1996 Billsus and Pazzani presented
their work on classifying web sites using the Bayes classifier [
        <xref ref-type="bibr" rid="ref5">7</xref>
        ].
The main goal of the work was to classify web sites automatically
into the classes like or dislike. This is done by extracting
information from the given HTML tags within the source code.
The accuracy measurement of the classification process gained up
to 81% of properly classified web sites. The work points out, that
an increasing database does not yield in an increasing accuracy of
the classification process. A similar effect for a Bayesian classifier
approach has been reported in [
        <xref ref-type="bibr" rid="ref6">8</xref>
        ]. The work describes a system
for classifying Arabic text documents. For an evaluation a data set
with increasing number of words has been created. The results
have shown that the accuracy of correct classified documents is
the highest in a number of 800 words with 74%. With an
increasing number of words the accuracy continuously decreases.
This information needs to be carefully respected in our evaluation
framework described in section 4. Furthermore, in the context of
television, recommendation systems are also available. Already in
1998 [
        <xref ref-type="bibr" rid="ref7">9</xref>
        ] discusses fundamental ideas and methods for RS in the
TV application area. Das and Herman characterize the use of two
different user profiles, which are influenced by the implicit and
explicit behavior of a TV user. Those user profiles are also
considered in our work. A more concrete scenario is described in
the work of Gutta et. al. [
        <xref ref-type="bibr" rid="ref8">10</xref>
        ]. The paper describes an intelligent
TV guide, which generates personalized TV recommendations.
Those recommendations are generated on an adaptive user profile,
where search requests for TV content of a user are captured. For
the recommendation process a Bayesian classifier is also used. In
[
        <xref ref-type="bibr" rid="ref9">11</xref>
        ] a personalized Electronic Program Guide (EPG) is described.
Through user interactions with a Set-Top-Box, a user profile can
be derived to generate personalized recommendation. Users are
assigned to a user group on which basis the recommendations
depend. An evaluation showed that the accuracy for this approach
runs against 70% at maximum. A popular example in the TV area
is the Set-Top-Box System TiVo [
        <xref ref-type="bibr" rid="ref10">12</xref>
        ]. The TiVo approach is
similar to the Amazon item-to-item recommendation process
described in [
        <xref ref-type="bibr" rid="ref2">4</xref>
        ], where the TiVo user base rates TV content. With
the use of those ratings, the system tries to find related and similar
TV items. Problematic is the state in the beginning period of the
system. An item-to-item approach needs a certain number of
ratings before the system is able to generate accurate
recommendations. So the TiVo system is using a context aware
Bayesian classifier to counteract those circumstances. The work in
[
        <xref ref-type="bibr" rid="ref10">12</xref>
        ] speaks of accurate, but internal evaluation results. That’s why
it isn’t possible to name any results at this point.
      </p>
    </sec>
    <sec id="sec-4">
      <title>3. SYSTEM OVERVIEW</title>
      <p>
        The following section describes the system architecture of the
developed system. In section 2 several approaches have been
presented, where some ideas were also considered for the
following system architecture. In Figure 1 the components for the
recommendation process within the system are shown. The main
functionality of the Bayesian classifier is to classify given content
objects into the classes like or dislike. So the classifier is based on
a two-class decision model, where the conditional probability that
a certain content fits into one of the classes like or dislike gets
calculated. Within the calculation the classifier compares object
attributes and counts how often they occur in one of the pre
defined classes. For a detailed description of the classifier
calculation we refer to our early work [
        <xref ref-type="bibr" rid="ref1">3</xref>
        ]. The object attributes
are derived from a given EPG data set the system uses. More
specifically the system uses generic DomainObjects, which are
described for each application by metadata. In the given
application, the metadata is derived from the EPG data with the
following attributes: TV Channel, Title, Subtitle, Category (e.g.
Movie, show), Genre (e.g. Comedy), Actors, Description, Year.
With the generic data model it is possible to use the classifier also
in other application areas.
      </p>
      <p>Initial
Adaptive</p>
      <p>User
Profiles
Bayesian
classifier</p>
      <p>Class dislike</p>
      <p>
        Class like
For the classification process a database is needed on which the
calculation depends. The system provides for this purpose User
Profiles. There are two types of user profiles: initial and adaptive
user profile. The initial profile is important in the beginning of the
system usage. In section 2 the cold start problem has already been
discussed. The user can fill the initial profile with TV content he
or she likes to get recommendation also in the beginning period of
the system. For this purpose, the system provides a web-based TV
guide. The initial profile is just created once, whereas the adaptive
profiles gets build up continuously during the system use. So it is
planned that the adaptive profile grows over the time and the
influence as a database for the recommendation process increases.
This means that, conversely, the influence of the initial profile
decreases. There is the assumption that the higher the adaptive
profile the better the generated recommendations. It remains an
open question how the adaptive profile is created. We are dividing
between implicit and explicit feedback, which is based on the
work of Das and Herman in [
        <xref ref-type="bibr" rid="ref7">9</xref>
        ]. With an explicit feedback the
user consciously expresses whether he or she likes a certain TV
content or not (e.g. rating a TV content). The implicit feedback
bases on the viewing behavior of the user. For example, if a user
watched a whole TV content the system assumes, that the user
likes the content. Otherwise the system would register the content
as disliked. In this context the work of e.g Hu et. al. in [
        <xref ref-type="bibr" rid="ref17">19</xref>
        ] should
be mentioned. The paper deals with the profound analysis of
implicit feedback, however, it describes the use of a much more
complex model. The improvement of our model could be future
work. It is important to say, that the stored DomainObjects within
the user profiles are assigned to a class like or dislike. This is
necessary for the calculation mentioned at the beginning of this
section.
      </p>
    </sec>
    <sec id="sec-5">
      <title>4. EVALUATION</title>
      <p>In section 3 the main components with associated functionality of
the system architecture has been described. We just focused on
those parts of the system, which are important for the evaluation
process in the following section. First we explain two evaluation
scenarios. The first one is an online scenario to reach a high
number of potential volunteers. The second one is a more realistic
scenario, where the volunteers write down their TV habits in a TV
diary. In section 4.2 we describe an approach to measure user
acceptance for the given system. The approach is based on the
Technology Acceptance Model (TAM), which characterizes the
relationship between information systems and user acceptance.</p>
    </sec>
    <sec id="sec-6">
      <title>4.1 Scenarios</title>
      <sec id="sec-6-1">
        <title>4.1.1 Online Scenario</title>
        <p>An important question at the beginning of the scenario design is at
what point of the recommendation process the quality can be
measured. As pointed out before, the system uses a Bayesian
classifier, which classifies TV content in one of the classes like or
dislike. In the long term, the main database for generating
recommendations is the adaptive user profile, which is gathered
by the feedback of the user during the use of the system. The idea
for the online scenario is to simulate the creation of an adaptive
user profile on which basis the user gets personalized
recommendations. We consider at least five steps where the user
fills the adaptive profile and the system generates
recommendations. After every step the user needs to rate each of
the given recommendations. Every step represents one week of a
user’s watching behavior, which means that the system generates
recommendations from one week of EPG data. The rating of the
recommendations is more a verification of the classification the
system did. The system presents 20 TV items where ten of these
items belong to the class like and ten to the class dislike. The
items are displayed to the user without class affiliation. The user
needs to perform a classification on his/her own. In the back of the
process the system compares both classifications and stores the
classification of the user within the adaptive profile for the next
step. At the end of the scenario there is the sixth final
classification step, where the system calculates the quality of the
classification. To measure the quality, we are using the F1-score,
which can be interpreted as the weighted average of precision and
recall values. Precision and recall is a common metric for
assessing the accuracy of classification systems. To meet the
requirements of the evaluation, the relationship between temporal
development of the adaptive user profile and the quality of the
system classification needs to be respected. For this reason, every
evaluation user needs to do overall six steps, so that all users need
the same length for the scenario and have the same requirements.
Internally, every volunteer gets randomly another number of
steps. Thus, the completion of the adaptive profile varies between
one and five weeks, so that just the data for that random count of
steps gets stored. At the end of the evaluation we can compare the
calculated quality between the varying lengths of the scenario. At
the beginning of the scenario an initial profile by the user with ten
objects will be created.
4.1.2 TV diary
The measurement of quality is similar to the scenario described in
section 4.1.1. Significant is the development of the adaptive user
profile. Where the Online Scenario was more a simulation of data
creation, whereas the TV diary deals with real watched TV
content. The participants watch television as usual. They need to
write down all the watched TV content in a TV diary with the
name of a series, date and time so that the system can find the
watched content within the given EPG data. For the participants it
is also mandatory to write down TV content they began to watch,
but switched the channel in case of dissatisfaction. Zapping
behavior isn’t respected due to the fact that the system wouldn’t
do either. The participants write down their viewing behavior for
five weeks. After that, the evaluation supervisor collects all the
diaries and transfers the diary data to the system. On the basis of
the data, the system generates recommendations. Each participant
needs to rate the system recommendations by dividing the
displayed and unordered TV content into the classes like and
dislike. The classification of the system and the participant are
compared and on this basis the quality gets calculated. This is
equally done within the sixth step in the Online Scenario (cp.
section 4.1.1.). An initial profile won’t be created.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>4.2 User Acceptance</title>
      <p>
        Measuring user acceptance for a software system is related to the
field of psychology. It is necessary to figure out the conditions for
a software system, which result in an actual use of the system.
Already in 1989 Davis presented the Technology Acceptance
Model (TAM). The TAM is an information-theoretic model,
which defines the Variables Perceived Usefulness (PU) and
Perceived Ease of Use (PEOU). These two variables are defined
as the main factors influencing the use of a system. Davis defines
the variables as follows:
Perceived Usefulness „the degree to which a person believes that
using a particular system would enhance his or her job
performance“ cp. [
        <xref ref-type="bibr" rid="ref11">13</xref>
        ] p. 320
Perceived Ease of Use „ the degree to which a person believes
that using a particular system would be free of effort“ cp. [
        <xref ref-type="bibr" rid="ref11">13</xref>
        ] p.
320.
      </p>
      <p>
        There is a lot of research on User Acceptance Issues in
recommender systems cp. [
        <xref ref-type="bibr" rid="ref12">14</xref>
        ][
        <xref ref-type="bibr" rid="ref13">15</xref>
        ]. Especially the work of Jones
and Pu in [
        <xref ref-type="bibr" rid="ref12">14</xref>
        ] about User Acceptance Issues in Music
recommender systems is interesting for the work presented in this
paper due to similar conditions. Jones and Pu adapted the TAM
for the use in music recommendation systems by defining the
variables more Application-specific. They pointed out, that 1) the
entertainment due to given recommendations, 2) the correct
adaption of the user feedback and 3) the entirety of the given data
base is important for Perceived Usefulness. The factors for
Perceived Ease of Use Jones and Pu name Usability and Effort
until the system works properly. Caused by the fact, that we are
using a simulation within the evaluation process, the point of
Usability has been almost discarded. Only the Usability of the
creation of the initial user profile can be questioned. We presented
an adapted questionnaire from the given work in [
        <xref ref-type="bibr" rid="ref12">14</xref>
        ] with 20
items at the end of the evaluation. The user needed to apply a five
step Likert scale from 1 (totally disagree) to 5 (totally agree). The
items were formulated as positive or negative statements so that
the evaluation user could answer with the given Likert scale.
F1 I liked the TV content which has been recommended to me
The TV content which has been recommended to me tailored my
taste
      </p>
      <sec id="sec-7-1">
        <title>The TV content which has been recommended were new to me</title>
      </sec>
      <sec id="sec-7-2">
        <title>F4 I liked the TV content I already knew</title>
      </sec>
      <sec id="sec-7-3">
        <title>F5 In general I was satisfied with the recommendations</title>
        <p>F2
F3
F6
F7
F8</p>
        <p>The recommendations were as good as the recommendations from
my friends</p>
      </sec>
      <sec id="sec-7-4">
        <title>Many recommendations were too similar to each other</title>
      </sec>
      <sec id="sec-7-5">
        <title>The system has a huge selection of possible content</title>
      </sec>
      <sec id="sec-7-6">
        <title>F9 I like that the system identifies my taste</title>
        <p>F10 Ifeceadnbaincfkluence the quality of the recommendations with my</p>
      </sec>
      <sec id="sec-7-7">
        <title>F11 The system recognizes my taste</title>
        <p>F12 I know how the system generates recommendations after I’ve used it
F13 The time the system needs to generate recommendations is
appropriate
F14 tFimorethe creation of the initial profile I needed to spend too much
F15 The creation of the initial Profile was easy and comfortable
F16 Irewcoomulmdecnredaatteiotnhse fiansittial profile again to get proper</p>
      </sec>
      <sec id="sec-7-8">
        <title>F17 The system asks me for my television watching too much</title>
        <p>F18 I(fe.tgh.ebreooisksa)n,oIthweorutledchunseoliotgy which recommends other things to me</p>
      </sec>
      <sec id="sec-7-9">
        <title>F19 I think the system is useful and I would use it again F20 I think the system is useful choosing interesting TV content</title>
        <p>Table 1 Questionnaire User Acceptance</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>5. RESULTS</title>
      <p>In the following section we present the evaluation results. The
section is again divided into two subsections. Within section 5.1
we present the results for the evaluation scenarios; in section 5.2
the evaluated user acceptance questionnaire is presented. The
results will be discussed in section 6.</p>
    </sec>
    <sec id="sec-9">
      <title>5.1 Evaluation Results</title>
      <sec id="sec-9-1">
        <title>5.1.1 Online Scenario</title>
        <p>For the Online Scenario we had 51 participants, where nine of
them didn’t finish the evaluation. So the number can be reduced to
42 with 34 male and eight female participants. A large part of the
group can be classified as educated (higher education level or
university degree) with an age between 18 and 39. Table 2
presents the results for the Online Scenario. The data were tested
for independence using the chi square (χ2) test with a significance
level of 5% (α=0.05). We’ve proved the calculated chi square
value against the quantile of the chi square distribution p=3.84
respecting the significance level of α=0.05 and degree of freedom
df=1.</p>
        <p>Iterat.</p>
        <p>Participants</p>
        <p>Objects
The table shows the iteration number and the associated number
of participants to those the iteration number was randomly
assigned. The column number Objects presents the size of the
adaptive user profile e.g. for the scenario with one iteration the
adaptive user profile stored 30 TV objects like films, series, etc..
In the fourth column the quality in form of the calculated F1 score
is shown. As you can see at first the quality increases steadily.
The system achieved a maximum quality with a database of 70
objects. With a more increasing data volume the quality decreases.
5.1.2 TV diary
For the TV diary eight people attended, mostly between 18 and 39
years and with a high educational level. The number of
participants decreases from ten to eight caused by incomplete TV
diaries. Table 3 presents the results for the TV diary equally to the
results in section 5.1.1. Due to a limited number of
participants/data records we used Fisher’s exact test proving
independence of the data set. We have no quality data for the 5th
iteration because of insignificant test results. In contrast to the
Online Scenario the quality increases continuously. The highest
value in iteration four is similar to the highest quality in the
Online Scenario.</p>
        <p>Iterat.</p>
        <p>Participants</p>
        <p>F1 Score</p>
        <p>p-Value
30
50
70
90
110</p>
        <p>F1 Score</p>
      </sec>
    </sec>
    <sec id="sec-10">
      <title>5.2 User Acceptance Results</title>
      <p>
        The following section presents the results of the proposed user
acceptance approach. In Figure 2 the median of the 20 items are
shown. Negative formulated questions are marked with a dot on
the top of the bar.
Figure 2 shows high results within the questionnaire, which is an
indication for general user acceptance. Negated questions were
overall answered with a low valence, nevertheless the results tend
into the right direction. For a closer look we refer to Table 1
including the question overview. The questions are divided into
three sections asking for Quality, Effort and Acceptance as
already mentioned in section 4.2. To analyze the results, we
interpret every subarea separately. Items one till twelve concern
the quality of the given recommendations. The questioned quality
within the questionnaire is a rather subjective interpretation of
each user, than an objective measurement. Especially the items
1,2,4 and 5 question the perceived quality by the user. All of the 4
items were rated with a minimum value of 4, which tends to be a
positive feedback. Item 6 questioned if the quality of the
recommendations correspond to the quality of TV
recommendations from friends. The middle value of 3 is a little
low, but to reach human interpretation level for the system wasn’t
a compound goal at all. Item 12 questioned the transparency of the
system, which is with a value of 3 also a little low. However items
8 to 11 were with a value of 4 approvingly answered. On the basis
of the given results with agreeing (4) and fully agreeing (5)
feedback we can say that a certain quality of the system
recommendations is given, which influences the user acceptance
positively. For the subarea of effort we questioned the items 13 to
17. Those items questioned the fact of needed effort for using the
system properly. We got a positive feedback for the items 14 to 17
with a value of 4 (agree) and 2 (not agree) for negative formed
questions. These items were questioned for the construction of the
initial feedback. The creation of the initial feedback is the only
direct interaction between user and system needed to get
recommendations presented. Just the time for creating new
recommendations by the system was rated with a neutral value
(3). So we can say that the effort for using the system is limited,
which is also a positive influence for user acceptance. Within the
subarea of acceptance for recommender systems in general and for
the tested system all questions were approvingly answered (4).
These results also tend to a positive user acceptance. These results
have proven that a general user acceptance for the system is given.
We have some particular shortcomings in comparison to
recommendations by friends. In this case the system could be
advanced by a connection to one of the big social networks like
Facebook or Twitter. A concrete approach for this was already
presented in one of our previous works cp. [
        <xref ref-type="bibr" rid="ref14">16</xref>
        ], but not respected
within the evaluation.
      </p>
    </sec>
    <sec id="sec-11">
      <title>6. DISCUSSION</title>
      <p>
        In the following section the results from the evaluation presented
in section 5 are discussed. We proposed two different approaches
for evaluating a Bayesian classifier based recommendation
system. Both approaches differ in the way data for the generating
process was collected. Within the Online Simulation data was
collected in a continuous manner. The adaptive profiles within the
TV diary were created more qualitatively with a real user
behavior. However, both scenarios were designed to examine the
quality of given recommendations and to examine if there is
connection between increasing database (adaptive user profile)
and generated TV recommendations. First of all we can say that
the system classifies objects with a quality between 81% and 83%
with a database between 56 and 70 already classified objects.
Because of this, it is clear that a sufficiently large database is
needed to reach a proper quality. Both result tables show that
there is an increasing quality with an increasing number of
classified data objects. Table 2 shows a decreasing quality at a
high number of data objects though. This effect has been observed
in other application areas, where also a Bayesian classifier has
been used for classification. In [
        <xref ref-type="bibr" rid="ref15">17</xref>
        ] the classification of Arabic
text documents is described. Also in this research a large database
causes a decreasing quality. The work of Billsus and Pazzani on
classifying spam mails described a decreasing quality at one point
of the evaluation [
        <xref ref-type="bibr" rid="ref16">18</xref>
        ]. It can be assumed that the effect just occurs
within the online scenario. The Set-Top-Box prototype provides a
mechanism, so that older data objects are deleted. That means the
database needs to be updated and shouldn’t exceeds a certain
threshold. A number between 56 and 70 data objects has been
determined within our evaluation. The evaluation has also shown
that the quality is at a proper level, but can be increased. One
approach proposed in section 5.2 is the integration of social
networking to satisfy the user with recommendations done by
friends. As mentioned the approach is already implemented, but
not evaluated at all. A resulting increase in quality is thus just a
hypothesis. It would also be possible to increase the quality by
interpreting the attributes e.g. a series name synonymously. For
this it is possible to implement a thesaurus or ontology.
      </p>
    </sec>
    <sec id="sec-12">
      <title>7. CONCLUSION</title>
      <p>
        In this paper we presented an evaluation for a Bayesian classifier
based recommendation system for generating TV content on the
basis of user behavior. In addition to two evaluation scenarios,
which measure the quality of recommendations, we presented
results of user acceptance evaluation based on the work of [
        <xref ref-type="bibr" rid="ref12">14</xref>
        ].
With a correct classification rate up to 83% the system reached a
good quality and fulfills the demanded requirements. The results
of the user acceptance gave a good feedback for the acceptance
and the actual use of the system. Even though we achieved a good
quality, there is still space for improvements (cp. section 6). Our
future work concentrates on the evaluation of the social
recommendation approach.
      </p>
    </sec>
    <sec id="sec-13">
      <title>8. REFERENCES</title>
      <p>[1] B. Engel, C.-M. Ridder, “Massenkommunikation 2010”,</p>
      <p>Press Conference of ARD, ZDF, Sep. 2010.
[2] SES Astra. Available at http://www.astra.de/2117/de</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Engelbert</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blanken</surname>
            ,
            <given-names>M. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kruthoff-Brüwer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Morisse</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <year>2011</year>
          .
          <article-title>A user supporting Personal Video Recorder by implementing a generic Bayesian classifier based recommendation system</article-title>
          .
          <source>In Proceedings of 7th IEEE International Workshop on PervasivE Learning</source>
          , Life and Leisure (Seattle, WA, USA, March
          <volume>21</volume>
          - 25,
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Linden</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , and York,
          <string-name>
            <surname>J.</surname>
          </string-name>
          <year>2003</year>
          . Amazon.com Recommendations. IEEE Press.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Davidson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liebald</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nandy</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Vleet</surname>
            ,
            <given-names>T.V.</given-names>
          </string-name>
          <year>2010</year>
          .
          <article-title>The YouTube Video Recommendation System</article-title>
          .
          <source>In ACM Conference Proceedings on Recommender Systems (Barcelona</source>
          , Spain,
          <year>September 2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>De</given-names>
            <surname>Pessemier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Deryckere</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            , and
            <surname>Martens</surname>
          </string-name>
          ,
          <string-name>
            <surname>L.</surname>
          </string-name>
          <year>2010</year>
          .
          <article-title>Extending the Bayesian Classifier to a Context-Aware Recommender System for Mobile Devices</article-title>
          . In Fifth International Conference on Internet and
          <article-title>Web Applications</article-title>
          and
          <string-name>
            <surname>Services (Barcelona</surname>
          </string-name>
          , Spain,
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Billsus</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Pazzani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>1996</year>
          .
          <article-title>Revising User Profiles: The Search for Interesting Web Stites</article-title>
          .
          <source>In Proceedings of 3rd International Workshop on Multistrategy Learning.</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Thabtah</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eljinini</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zamzeer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Hadi</surname>
            ,
            <given-names>W. M.</given-names>
          </string-name>
          <year>2009</year>
          .
          <article-title>Naive Bayesian Based on Chi Squere to Categorize Arabic Data</article-title>
          .
          <source>Communications of the IBIMA (10).</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Das</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , and Herman, t. H.
          <year>1998</year>
          .
          <article-title>Recommender Systems for TV</article-title>
          .
          <source>Eindhoven: American Association for Artificial Intelligence.</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Gutta</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kurapati</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martino</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Milanski</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schaffer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et al.
          <year>2000</year>
          .
          <article-title>TV Content Recommender System</article-title>
          .
          <source>Briarcliff Manor: American Association of Artificial Intelligence.</source>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Ardissono</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gena</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torasso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bellifemine</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chiarotto</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Difino</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et. Al.
          <year>2003</year>
          .
          <article-title>Personalized Recommendation of TV Programs</article-title>
          .
          <source>In 8th Advanced in Artificial Intelligence Conference.</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Kamal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp; van
          <string-name>
            <surname>Stam</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <year>2004</year>
          .
          <article-title>TiVo: Making Show Recommendations Using a Distributed Collaborative Filtering Architecture</article-title>
          .
          <article-title>ACM Knowledge Discovery and Data Mining</article-title>
          . Seattle, Washington: ACM.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>F. D.</given-names>
          </string-name>
          <year>1989</year>
          .
          <article-title>Perceived Usefulness, Perceived Ease of Use, and User Acceptance of Information Technology</article-title>
          .
          <source>MIS Quaterly</source>
          Vol.
          <volume>13</volume>
          , pp.
          <fpage>319</fpage>
          -
          <lpage>339</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Pu</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>User Acceptance Issues in Music Recommender Systems</article-title>
          .
          <source>EPFL Technical Report. Lausanne.</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Pu</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <year>2009</year>
          .
          <article-title>Acceptance Issues of Personalitybased Recommender Systems</article-title>
          .
          <source>3rd ACM Conference on Recommender Systems</source>
          (pp.
          <fpage>221</fpage>
          -
          <lpage>224</lpage>
          ). New York: ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Engelbert</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blanken</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kruthoff-Brüwer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Morisse</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <year>2011</year>
          .
          <article-title>A User Supporting Personal Video Recorder Based on a Generic Bayesian Classifier and Social Network Recommendations</article-title>
          .
          <string-name>
            <given-names>J.J.</given-names>
            <surname>Park</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.T.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          Lee (Eds.):
          <source>Future Tech</source>
          <year>2011</year>
          , Proceedings,
          <source>Part II. Communications in Computer and Information Science</source>
          , Vol
          <volume>185</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Thabtah</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eljinini</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zamzeer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Hadi</surname>
            ,
            <given-names>W. M.</given-names>
          </string-name>
          <year>2009</year>
          .
          <article-title>Naive Bayesian Based on Chi Squere to Categorize Arabic Data</article-title>
          .
          <source>Communications of the IBIMA (10).</source>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Billsus</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Pazzani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>1996</year>
          .
          <article-title>Revising User Profiles: The Search for Interesting Web Sites</article-title>
          .
          <source>Proceedings of 3rd International Workshop on Multistrategy Learning.</source>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koren</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Volinsky</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>Collaborative Filtering for Implicit Feedback Datasets</article-title>
          .
          <source>Proceedings of the 2008 8th IEEE International Conference on Data Mining</source>
          . Washington,
          <string-name>
            <surname>D.C.</surname>
          </string-name>
          , USA.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>