<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Exploiting Synergy Between Ontologies and Recommender Systems</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>General Terms Design</institution>
          ,
          <addr-line>Experimentation</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Stuart E. Middleton, Harith Alani, Nigel R. Shadbolt, David C. De Roure Intelligence, Agents and Multimedia Group Department of Electronics and Computer Science University of Southampton Southampton</institution>
          ,
          <addr-line>SO17 1BJ</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Recommender systems learn about user preferences over time, automatically finding things of similar interest. This reduces the burden of creating explicit queries. Recommender systems do, however, suffer from cold-start problems where no initial information is available early on upon which to base recommendations. Semantic knowledge structures, such as ontologies, can provide valuable domain knowledge and user information. However, acquiring such knowledge and keeping it up to date is not a trivial task and user interests are particularly difficult to acquire and maintain. This paper investigates the synergy between a web-based research paper recommender system and an ontology containing information automatically extracted from departmental databases available on the web. The ontology is used to address the recommender systems cold-start problem. The recommender system addresses the ontology's interest-acquisition problem. An empirical evaluation of this approach is conducted and the performance of the integrated systems measured.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Cold-start problem</kwd>
        <kwd>interest-acquisition recommender system</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>The mass of content available on the World-Wide Web raises
important questions over its effective use. Search engines filter
web pages that match explicit queries, but most people find
articulating exactly what they want difficult. The result is large
lists of search results that contain a handful of useful pages,
Permission to make digital or hard copies of all or part of this work for
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that
copies bear this notice and the full citation on the first page. To copy
otherwise, to republish, to post on servers or to redistribute to lists,
requires prior specific permission by the authors.</p>
      <p>Semantic Web Workshop 2002 Hawaii, USA
Copyright by the authors.
defeating the purpose of filtering in the first place.</p>
      <p>
        Recommender systems [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] learn about user preferences over time
and automatically find things of similar interest, thus reducing the
burden of creating explicit queries. They dynamically track users
as their interests change. However, such systems require an initial
learning phase where behaviour information is built up to form an
user profile. During this initial learning phase performance is
often poor due to the lack of user information; this is known as
the cold-start problem [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>There has been increasing interest in developing and using tools
for creating annotated content and making it available over the
semantic web. Ontologies are one such tool, used to maintain and
provide access to specific knowledge repositories. Such sources
could complement the behavioral information held within
recommender systems, by providing some initial knowledge about
users and their domains of interest. It should thus be possible to
bootstrap the initial learning phase of a recommender system with
such knowledge, easing the cold-start problem.</p>
      <p>In return for any bootstrap information the recommender system
could provide details of dynamic user interests to the ontology.
This would reduce the effort involved in acquiring and
maintaining knowledge of people’s research interests. To this end
we investigate the integration of Quickstep, a web-based
recommender system, an ontology for the academic domain and
OntoCoPI, a community of practice identifier that can pick out
similar users.</p>
    </sec>
    <sec id="sec-2">
      <title>2. RECOMMENDER SYSTEMS</title>
      <p>
        People may find articulating what they want hard, but they are
good at recognizing it when they see it. This insight has led to the
utilization of relevance feedback [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], where people rate web
pages as interesting or not interesting and the system tries to find
pages that match the interesting, positive examples and do not
match the not interesting, negative examples. With sufficient
positive and negative examples, modern machine learning
techniques can classify new pages with impressive accuracy. Such
systems are called content-based recommender systems.
Another way to recommend pages is based on the ratings of other
people who have seen the page before. Collaborative
recommender systems do this by asking people to rate explicitly
pages and then recommending new pages that similar users have
rated highly. The problem with collaborative filtering is that there
is no direct reward for providing examples since they only help
other people. This leads to initial difficulties in obtaining a
sufficient number of ratings for the system to be useful.
Hybrid systems, attempting to combine the advantages of
contentbased and collaborative recommender systems, have proved
popular to-date. The feedback required for content-based
recommendation is shared, allowing collaborative
recommendation as well. We use the Quickstep [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] hybrid
recommender system in this paper to recommend on-line research
papers.
      </p>
    </sec>
    <sec id="sec-3">
      <title>2.1 The Cold-start Problem</title>
      <p>
        One difficult problem commonly faced by recommender systems
is the cold-start problem [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], where recommendations are
required for new items or users for whom little or no information
has yet been acquired. Poor performance resulting from a
coldstart can deter user uptake of a recommender system. This effect is
thus self-destructive, since the recommender never achieves good
performance since users never use it for long enough. We will
examine two types of cold-start problem.
      </p>
      <p>The new-system cold-start problem is where there are no initial
ratings by users, and hence no profiles of users. In this situation
most recommender systems have no basis on which to
recommend, and hence perform very poorly.</p>
      <p>The new-user cold-start problem is where the system has been
running for a while and a set of user profiles and ratings exist, but
no information is available about a new user. Most recommender
systems perform poorly in this situation too.</p>
      <p>Collaborative recommender systems fail to help in cold-start
situations, as they cannot discover similar user behaviour because
there is not enough previously logged behaviour data upon which
to base any correlations. Content-based and hybrid recommender
systems perform a little better since they need just a few examples
of user interest in order to find similar items.</p>
      <p>No recommender system can cope alone with a totally cold-start
however, since even content-based recommenders require a small
number of examples on which to base recommendations. We
propose to link together a recommender system and an ontology
to address this problem. The ontology can provide a variety of
information on users and their publications. Publications provide
important information about what interests a user has had in the
past, so provide a basis upon which to create initial profiles that
can address the new-system cold start problem. Personnel records
allow similar users to be identified. This will address the new-user
cold-start problem by providing a set of similar users on which to
base a new-user profile.</p>
    </sec>
    <sec id="sec-4">
      <title>3. ONTOLOGIES</title>
      <p>
        An ontology is a conceptualisation of a domain into a
humanunderstandable, but machine-readable format consisting of
entities, attributes, relationships, and axioms [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Ontologies can
provide a rich conceptualisation of the working domain of an
organisation, representing the main concepts and relationships of
the work activities. These relationships could represent isolated
information such as an employee’s home phone number, or they
could represent an activity such as authoring a document, or
attending a conference.
      </p>
      <p>In this paper we use the term ontology to refer to the classification
structure and instances within the knowledge base.</p>
      <p>
        The ontology used in our work is designed to represent the
academic domain, and was developed by Southampton’s AKT
team (Advanced Knowledge Technologies [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]). It models
people, projects, papers, events and research interests. The
ontology itself is implemented in Protégé 2000 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], a graphical
tool for developing knowledge-based systems. It is populated with
information extracted automatically from a departmental
personnel database and publication database. The ontology
consists of around 80 classes, 40 slots, over 13000 instances and
is focused on people, projects, and publications.
      </p>
    </sec>
    <sec id="sec-5">
      <title>3.1 The Interest-acquisition Problem</title>
      <p>
        People’s areas of expertise and interests are an important type of
knowledge for many applications, for example expert finders [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
Semantic web technology can be a good source of such
information, but usually requires substantial maintenance to keep
the web pages up-to-date. The majority of web pages receive little
maintenance, holding information that does not date quickly.
Since interests and areas of expertise are dynamic in nature they
are not often held within web pages. It is thus particularly difficult
for an ontology to acquire such information; this is the
interestacquisition problem.
      </p>
      <p>
        Many existing systems force users to perform self-assessment to
gather such information, but this has numerous disadvantages [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
Lotus have developed a system that monitors user interaction with
a document to capture interests and expertise [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Their system
does not, however, consider the online documents that users
browse.
      </p>
      <p>This paper investigates linking an ontology with a recommender
system to help overcoming the interest acquisition problem. The
recommender system will regularly provide the ontology with
interest profiles for users, obtained by monitoring user web
browsing and analysing feedback on recommended research
papers.</p>
    </sec>
    <sec id="sec-6">
      <title>4. Related Work</title>
      <p>
        Collaborative recommender systems utilize user ratings to
recommend items liked by similar people. PHOAKS [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] is an
example of a collaborative filtering, recommending web links
mentioned in newsgroups articles. Only newsgroups with at least
20 posted web links are considered by PHOAKS, avoiding the
cold-start problems associated with newer newsgroups containing
less messages. Group Lens [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] is an alternative example,
recommending newsgroup articles. Group Lens reports two
coldstart problems in their experimental analysis. Users abandoned the
system before they had provided enough ratings to receive
recommendations and early adopters of the system received poor
recommendations until enough ratings were gathered. These
systems are typical of collaborative recommenders, where a
coldstart makes early recommendation poor until sufficient people
have provided ratings.
      </p>
      <p>
        Content-based recommender systems recommend items with
similar content to things the user has liked before. An example of
a content-based recommender is Fab [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], which recommends web
pages. Fab needs a few early ratings from each user in order to
create a training set. ELFI [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] is another content-based
recommender, recommending funding information from a
database. ELFI observes users using a database and infers both
positive and negative examples of interest from this behaviour.
Both these systems are typical of content-based recommender
systems, requiring users to use the system for an initial period of
time before the cold-start problem is overcome.
      </p>
      <p>
        Personal web-based agents such as Letizia [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], Syskill &amp; Webert
[
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] and Personal Webwatcher [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] track the users browsing and
formulate user profiles. Profiles are constructed from positive and
negative examples of interest, obtained from explicit feedback or
heuristics analysing browsing behaviour. They then suggest which
links are worth following from the current web page by
recommending page links most similar to the users profile. Just
like a content-based recommender system, a few examples of
interest must be observed or elicited from the user before a useful
profile can be constructed.
      </p>
      <p>
        Ontologies can be used to improve content-based search, as seen
in OntoSeek [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Users of OntoSeek navigate the ontology in
order to formulate queries. Ontologies can also be used to
automatically construct knowledge bases from web pages, such as
in Web-KB [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Web-KB takes manually labelled examples of
domain concepts and applies machine-learning techniques to
classify new web pages. Both systems do not, however, capture
dynamic information such as user interests.
      </p>
      <p>
        Also of relevance are systems such as CiteSeer [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], which use
content-based similarity matching to help search for interesting
research papers within a digital library.
      </p>
    </sec>
    <sec id="sec-7">
      <title>5. THE QUICKSTEP RECOMMENDER</title>
    </sec>
    <sec id="sec-8">
      <title>SYSTEM</title>
      <p>
        Quickstep [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] is a hybrid recommender system, addressing the
real-world problem of recommending on-line research papers to
researchers. User browsing behaviour is unobtrusively monitored
via a proxy server, logging each URL browsed during normal
work activity. A nearest-neighbour algorithm classifies browsed
URL’s based on a training set of labelled example papers, storing
each new paper in a central database. The database of known
papers grows over time, building a shared pool of knowledge.
Explicit feedback and browsed URL’s form the basis of the
interest profile for each user. Figure 1 shows an overview of the
Quickstep system.
      </p>
      <sec id="sec-8-1">
        <title>World Wide Web</title>
      </sec>
      <sec id="sec-8-2">
        <title>Users</title>
      </sec>
      <sec id="sec-8-3">
        <title>Profiles</title>
      </sec>
      <sec id="sec-8-4">
        <title>Classifier</title>
      </sec>
      <sec id="sec-8-5">
        <title>Recommender</title>
      </sec>
      <sec id="sec-8-6">
        <title>Classified papers</title>
        <p>Each day a set of recommendations is computed, based on
correlations between user interest profiles and classified paper
topics. Any feedback offered by users on these recommendations
is recorded when the user looks at them. Users can provide new
examples of topics and correct paper classifications where wrong.
In this way the training set, and hence classification accuracy,
improves over time.</p>
        <p>Quickstep bases its user interest profiles on an ontology of
research paper topics. This allows inferences from the ontology to
assist profile generation; in our case topic inheritance is used to
infer interest in super-classes of specific topics. Sharing interest
profiles with the AKT ontology is not difficult since they are
explicitly represented using ontological terms.</p>
        <p>
          Previous trials [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] of Quickstep used hand-crafted initial profiles,
based on interview data, to cope with the cold-start problem.
Linking Quickstep with the AKT ontology automates this process,
allowing a more realistic cold-start solution that will scale to
larger numbers of users.
        </p>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>5.1 Paper classification algorithm</title>
      <p>
        Every research paper within Quickstep’s central database is
represented using a term frequency vector. Terms are single words
within the document, so term frequency vectors are computed by
counting the number of times words appear within the paper. Each
dimension within a vector represents a term. Dimensionality
reduction on vectors is achieved by removing common words
found in a stop-list and stemming words using the Porter [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]
stemming algorithm. Quickstep uses vectors with 10-15,000
dimensions.
      </p>
      <p>
        Once added to the database, papers are classified using an IBk [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
classifier boosted by the AdaBoostM1 [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] algorithm. The IBk
classifier is a k-Nearest Neighbour type classifier that uses
example documents, called a training set, added to a vector space.
Figure 2 shows the basic k-Nearest Neighbour algorithm. The
closeness of an unclassified vector to its neighbours within the
vector space determines its classification.
      </p>
      <p>____________
w(da,db) = √</p>
      <p>Σ (tja – tjb)2
j = 1..T
w(da,db) kNN distance between document a and b
da,db document vectors
T number of terms in document set
tja weight of term j document a
Classifiers like k-Nearest Neighbour allow more training
examples to be added to their vector space without the need to
rebuild the entire classifier. They also degrade well, so even when
incorrect the class returned is normally in the right
“neighbourhood” and so at least partially relevant. This makes
kNearest Neighbour a robust choice of algorithm for this task.
Boosting works by repeatedly running a weak learning algorithm
on various distributions of the training set, and then combining
the classifiers produced by the weak learner into a single
composite classifier. The “weak” learning algorithm here is the
IBk classifier. Figure 3 shows the AdaBoostM1 algorithm.</p>
      <sec id="sec-9-1">
        <title>Initialise all values of D to 1/N</title>
        <p>Do for t=1..T
call weak-learn(Dt)
calculate error et
calculate βt = et/(1-et)
calculate Dt+1</p>
        <p>1
classifier = argmax Σ log __
c ∈ C</p>
        <p>
          βt
t = all iterations
with result class c
Dt
N
T
weak-learn(Dt)
et
βt
classifier
C
class weight distribution on iteration t
number of classes
number of iterations
weak learner with distribution Dt
weak_learn error on iteration t
error adjustment value on iteration t
final boosted classifier
all classes
AdaBoostM1 has been shown to improve [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] the performance of
weak learner algorithms, particularly for stronger learning
algorithms like k-Nearest Neighbour. It is thus a sensible choice
to boost our IBk classifier.
        </p>
      </sec>
    </sec>
    <sec id="sec-10">
      <title>5.2 User profiling algorithm</title>
      <p>The profiling algorithm performs correlation between paper topic
classifications and user browsing logs. Whenever a research paper
is browsed that has been classified as belonging to a topic, it
accumulates an interest score for that topic. Explicit feedback on
recommendations also accumulates interest value for topics. The
current interest of a topic is computed using the inverse time
weighting algorithm shown in Figure 4.</p>
      <p>Topic interest =</p>
      <sec id="sec-10-1">
        <title>Interest values</title>
      </sec>
      <sec id="sec-10-2">
        <title>1..no of instances</title>
        <p>n
∑ Interest value(n) / days old(n)</p>
        <p>Paper browsed = 1
Recommendation followed = 2
Topic rated interesting = 10</p>
        <p>Topic rated not interesting = -10
An is-a hierarchy of research paper topics is held so that
superclass relationships can be used to infer broader topic interest.
When a specific topic is browsed, fractional interest is inferred for
each super-class of that topic, using a 1/2level weighting where
‘level’ refers to how many classes up the is-a tree the super-class
is from the original topic. Figure 5 shows a section from the
research paper topic ontology.
Hypermedia</p>
        <p>Agents
Belief Networks
Fuzzy
Game Theory
Genetic Algorithms
Genetic Programming
Knowledge Representation
Information Filtering
Information Retrieval
Machine Learning
Natural Language
Neural Networks
Philosophy [AI]
Robotics [AI]
Speech [AI]
Vision [AI]
Adaptive Hypermedia
Hypertext Design
Industrial Hypermedia
Literature [hypermedia]
Open Hypermedia
Spatial Hypertext
Taxonomic Hypertext
Visualization [hypertext]
Web [hypermedia]</p>
        <p>E-Commerce
Interface Agents
Mobile Agents
Multi-Agent-Systems
Recommender Systems
Ontologies
Text Classification
Content-Based Navigation
Architecture [open hypermedia]</p>
      </sec>
    </sec>
    <sec id="sec-11">
      <title>5.3 Recommendation algorithm</title>
      <p>Recommendations are formulated from a correlation between the
users current topics of interest and papers classified as belonging
to those topics. A paper is only recommended if it does not appear
in the users browsed URL log, ensuring that recommendations
have not been seen before. For each user, the top three interesting
topics are selected with 10 recommendations made in total. Papers
are ranked in order of the recommendation confidence before
being presented to the user. Figure 6 shows the recommendation
algorithm.</p>
      <p>Recommendation confidence = classification confidence *
topic interest value</p>
    </sec>
    <sec id="sec-12">
      <title>6. ONTOCOPI</title>
      <p>
        The Ontology-based Communities of Practice Identifier
(OntoCoPI) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] is an experimental system that uses the AKT
ontology to help identifying communities of practice (CoP). The
community of practice of a person is taken here to be the closest
group of people, based on specific features they have in common
with that given person. A community of practice is thus an
informal group of people who share some common interest in a
particular practice [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. Workplace communities of practice
improve organisational performance by maintaining implicit
knowledge, helping the spread of new ideas and solutions, acting
as a focus for innovation and driving organisational strategy.
Identifying communities of practice is an essential first step to
understand the knowledge resources of an organization [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ].
Organisations can bring the right people together to help the
identified communities of practice to flourish and expand, for
example by providing them with appropriate infrastructure and
give them support and recognition. However, community of
practice identification is currently a resource-heavy process
largely based on interviews, mainly because of the informal nature
of such community structures that are normally hidden within and
across organisations.
      </p>
      <p>OntoCoPI is a tool that uses ontology-based network analysis to
support the task of community of practice identification. A
breadth-first spreading activation algorithm is applied by
OntoCoPI to crawl the ontology network of instances and
relationships to extract patterns of certain relations between
entities relating to a community of practice. The crawl can be
limited to a given set of ontology relationships. These
relationships can be traced to find specific information, such as
who attended the same events, who co-authored papers and who
are members of the same project or organisation. Communities of
practice are based on informal sets of relationships while
ontologies are normally made up of formal relationships. The
hypothesis underlying OntoCoPI is that some informal
relationships can be inferred from the presence of formal ones.
For instance, if A and B have no formal relationships, but they
have both authored papers with C, then that could indicate a
shared interest.</p>
      <p>
        One of the advantages of using an ontology to identify
communities of practice, rather than other traditional information
networks [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] is that relationships can be selected according to
their semantics, and can have different weights to reflect relative
importance. For example the relations of document authorship
and project membership can be selected if it is required to identify
communities of practice based on publications and project work.
OntoCoPI allows manual selection of relationships or automatic
selection based on the frequency of relationship use within the
knowledge base. Selecting the right relationships and weights is
an experimental process that is dependent on the ontology
structure, the type and amount of information in the ontology, and
the type of community of practice required.
      </p>
      <p>When working with a new community of practice some
experiments will be needed to see which relationships are relevant
to the desired community of practice, and how to set relative
weights. In the experiments described in this paper, certain
relationships were selected manually and weighted based on our
preferences. Further trials are needed to determine the most
effective selection.</p>
    </sec>
    <sec id="sec-13">
      <title>7. INTEGRATION OF THE TWO</title>
    </sec>
    <sec id="sec-14">
      <title>TECHNOLOGIES</title>
      <p>We have investigated the integration of the ontology, OntoCoPI
and Quickstep recommender system to provide a solution to both
the cold-start problem and interest acquisition problem. Figure 7
shows our experimental systems after integration.</p>
      <p>User interest</p>
      <p>profiles
Quickstep</p>
      <p>AKT
Ontology</p>
      <p>User
publications</p>
      <p>Communities
of practice</p>
      <p>User and domain</p>
      <p>knowledge
OntoCoPI
Upon start-up, the ontology provides the recommender system
with an initial set of publications for each of its registered users.
Each user’s known publications are then correlated with the
recommender systems classified paper database, and a set of
historical interests compiled for that user. These historical
interests form the basis of an initial profile, overcoming the
newsystem cold-start problem. Figure 8 details the initial profile
algorithm. As per the Quickstep profiling algorithm, fractional
interest in a topic super-classes is inferred when a specific topic is
added.
∑ 1 / publication age(n)
t = &lt;research paper topic&gt;
When the recommender system is up and running and a new user
is added, the ontology provides the historical publication list of
the new user and the OntoCoPI system provides a ranked list of
similar users. The initial profile of the new user is formed from a
correlation between historical publications and any similar user
profiles. This algorithm is detailed in figure 9, and addresses the
new-user cold-start problem.
__γ___
1.. Nsimilar
∑ profile interest(u,t)
profile interest(u,t) = interest of user u in topic t * CoP confidence
new-user initial profile = (t, topic interest(t))*
t = research paper topic
u = user
γ = weighting constant &gt;= 0
Nsimilar = number of similar users
Npubs t = number of publications belonging to class t
CoP confidence = Communities of practice confidence
The task of populating and maintaining the ontology of user
research interests is left to the recommender system. The
recommender system compiles user profiles on a daily basis, and
these profiles are asserted into the ontology when ready. Figure 10
details the structure of these profiles. In this way up-to-date
interests are maintained, providing a solution to the interest
acquisition problem. The interest data is used alongside the more
static information within the ontology to improve the accuracy of
the OntoCoPI system.
user profile = (topic, interest)*
topic = research topic
interest = interest value</p>
    </sec>
    <sec id="sec-15">
      <title>7.1 Example of system operation</title>
      <p>When the Quickstep recommender system is first initialised, it
retrieves a list of people and their publication URLs from the
ontology. Quickstep analyses these publications and classifies
them according to the research topic hierarchy in the ontology.
Paper topics are associated with their date of publication, and the
‘new-system initial profile’ algorithm used to compute a set of
initial profiles for each user.</p>
      <p>Topic</p>
      <sec id="sec-15-1">
        <title>Knowledge Technology\Knowledge Management</title>
      </sec>
      <sec id="sec-15-2">
        <title>Knowledge Technology\Ontology</title>
        <p>AI\Agents\Recommender Systems
Knowledge Technology\Knowledge Acquisition
At a later stage, after Quickstep has been running for a while, a
new user registers with email address sem99r@ecs.soton.ac.uk.
OntoCoPI identifies this email account as that of Stuart
Middleton, a PhD candidate within the department, and returns
the ranked and normalised communities of practise list displayed
in table 3. This communities of practise list is identified using
relations on conference attendance, supervision, authorship,
research interest, and project membership, using the weights 0.4,
0.7, 0.3, 0.8, and 0.5 respectively. De Roure was found to be the
closest person as he is Middleton’s supervisor, and has one joint
publication co-authored with Middleton and Shadbolt. The people
with 0.82 values are other supervisees of De Roure. Alani
attended the same conference that Middleton went to in 2001.
The communities of practise list is then sent to Quickstep, which
searches for matching user profiles. These profiles will be more
accurate and up to date than those initially created profiles based
on publications. Quickstep manages to find the profiles in table 4
in its logs.
These profiles are merged to create a profile for the new user,
Middleton, using the ‘new-user initial profile’ algorithm with a γ
value of 2.5. For example, Middleton has a publication on
‘Recommender Systems’ that is 1 year old and DeRoure, Revill
and Shadbolt have interest in ‘Recommender Systems’ – this
topics value is therefore 1/1 + 2.5/5 *
(1.0*0.73+0.82*0.4+0.46*1.0) = 1.76. Table 5 shows the resulting
profile.
Every day Quickstep’s profiles are updated and automatically fed
back to the ontology, where they are used to populate the research
interest relationships of the relevant people.</p>
      </sec>
    </sec>
    <sec id="sec-16">
      <title>8. EMPIRICAL EVALUATION</title>
      <p>
        In order to evaluate the effect both the new-system and new-user
initial profiling algorithms have on our integrated system, we
conducted an experiment based around the browsing behaviour
logs obtained from the Quickstep [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] user trials. The algorithms
previously described are used, as per the example in the previous
section, and the average performance for all users calculated.
      </p>
    </sec>
    <sec id="sec-17">
      <title>8.1 Experimental approach</title>
      <p>Users were selected from the Quickstep trials whom had entries
within the departmental publication database. We selected nine
users in total, with each user typically having one or two
publications.</p>
      <p>The URL browsing logs of these users, extracted from 3 months
of browsing behaviour recorded during the Quickstep trials, were
then broken up into weekly log entries. Seven weeks of browsing
behaviour were taken from the start of the Quickstep trials, and an
empty log created to simulate the very start of the trial.
Eight iterations of the integrated system were thus run, the first
simulating the start of the trial and others simulating the following
weeks 1 to 7. Interest profiles were recorded after each iteration.
Two complete runs were made, one with the ‘new-system initial
profiling’ algorithm and one without. The control run without the
‘new-system initial profiling’ algorithm started with blank profiles
for each of its users.</p>
      <p>An additional iteration was run to evaluate the effectiveness of the
‘new-user initial profile’ algorithm. We took the communities of
practice for each user, based on data from week 7, and used the
‘new-user initial profile’ algorithm to compute initial profiles for
each user as if they were being entered onto the system at the end
of the trial. These profiles were recorded. Because we are using an
early prototype version of OntoCoPI, communities of practice
confidence values were not available; we thus used confidence
values of 1 throughout this experiment.</p>
      <p>In order to evaluate our algorithms effect on the cold-start
problem, we compared all recorded profiles to the benchmark
week 7 profile. This allows us to measure how quickly profiles
converge to the stable state existing after a reasonable amount of
behaviour data has been accumulated. The quicker the profiles
move to this state the quicker they will have overcome the
coldstart.</p>
      <p>Week 7 was chosen as the cut-off point of our analysis since after
about 7 weeks of use the behaviour data gathered by Quickstep
will dominate the user profiles. The effects of bootstrapping
beyond this point would not be significant. If we were to run the
system beyond week 7 we would simply see the profiles
continually adjusting to the behaviour logged each week.</p>
    </sec>
    <sec id="sec-18">
      <title>8.2 Experimental results</title>
      <p>Two measurements were preformed when comparing profiles to
the benchmark week 7 profile. The first, profile precision,
measures how many topics were mentioned in both the current
profile and benchmark profile. Profile precision is an indication of
how quickly the profile is converging to the final state, and thus
how quickly the effects of the cold-start are overcome. The
second, profile error rate, measures how many topics appeared in
the current profile that did not appear within the benchmark
profile. Profile error rate is an indication of the errors introduced
by the two bootstrapping algorithms. Figure 11 describes these
metrics.</p>
      <p>
        It should be noted that we are not measuring the absolute
precision and error rate of the profiles – only the relative precision
and error rate compared to the week 7 steady state profiles.
Measuring absolute profile accuracy is a very subjective matter,
and we do not attempt it here; we are only interested in how
quickly profiles reach their steady states. A more complete
evaluation of Quickstep’s overall profiling and recommendation
performance can be found in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
profile precision =
profile error rate =
      </p>
      <p>1
_____
Nusers</p>
      <p>1
_____
Nusers
user
user
1.. Nusers
_____N_c_o_rr_ec_t___
Ncorrect + Nmissing</p>
      <p>Nincorrect
______________________
Ncorrect + Nincorrect + Nmissing
Ncorrect
Nmissing
Nincorrect
Nusers</p>
      <sec id="sec-18-1">
        <title>Number of user topics that appear in current</title>
        <p>profile and benchmark profile
Number of user topics that appear in benchmark
profile but not in current profile
Number of user topics that appear in current
profile but not in benchmark profile
Total number of users
The results of our experimental runs are detailed in figures 12 and
13. The new-user results consist of a single iteration, so appear on
the graphs as a single point.</p>
        <p>At the start, week 0, no browsing behaviour log data is available
to the system so the profiles without bootstrapping are empty. The
new-system algorithm, however, can bootstrap the initial user
profiles and achieves a reasonable precision of 0.35 and a low
error rate of 0.06. We found that the new-system profiles
accurately captured interests users had a year or so ago, but
tended to miss current interests. This is because publications are
generally not available for up-to-date interests.</p>
        <p>As we would expect, once the weekly behaviour logs become
available to the system the profiles adjust accordingly, moving
away from the initial bootstrapping. On week 7 the profiles
converge to the benchmark profile.</p>
        <p>The new-user algorithm result show a more dramatic increase in
precision to 0.84, but comes at the price of a significant error rate
of 0.55. The profiles produced by the new-user algorithm tended
to be very inclusive, taking the set of similar user interests and
producing a union of these interests. While this captures many of
the new users real interests, it also included a large number of
interests not relevant to the new user but which were interesting to
the people similar to the new user.</p>
        <p>Profile precision relative to benchmark profile
Since error rate is measured relative to the final benchmark profile
of week 7, all the topics seen in the behaviour logs will be present
within the benchmark profile. Incorrect topics must thus come
from another source – in this case bootstrapping on week 0. This
causes error rates to be constant over the 7 weeks, since the
incorrect topics introduced on week 0 remain for all subsequent
weeks.</p>
      </sec>
    </sec>
    <sec id="sec-19">
      <title>9. DISCUSSION</title>
      <p>Cold-starts in recommender systems and interest acquisition in
ontologies are serious problems. If initial recommendations are
inaccurate, user confidence in the recommender system may drop
with the result that not enough usage data is gathered to overcome
the cold-start. In regards to ontologies, up-to-date interests are not
generally available from periodically updated information sources
such as web pages, personal records or publication databases.
Our integration of the Quickstep recommender system, AKT
ontology and OntoCoPI system has demonstrated one approach to
reduce both the cold-start and interest-acquisition problems. Our
practical work suggests that using an ontology to bootstrap user
profiles can significantly reduce the impact of the recommender
system cold-start problem. It is particularly useful for the
newsystem cold-start problem, where the alternative is to start with no
information at all. Regularly feeding the recommender systems
interest profiles back to the ontology also clearly assists in the
acquisition of up-to-date interests. A number of issues have,
however, arisen from our integration.</p>
      <p>The new-system algorithm produced profiles with a low error rate
and a reasonable precision of 0.35. This reflects that previous
publications are a good indication of users current interests, and
so can produce a good starting point for a bootstrap profile.
Where the new-system algorithm fails is for more recent interests,
which make up the remaining 65% of the topics in the final
benchmark profile. To discover these more recent interests, it is
possible that the new-system algorithm could be extended to take
some of the other information available within the ontology into
account, such as the projects a user is working on. To what degree
these relationships will help is difficult to predict however, since
the ontology itself has great difficulty in acquiring knowledge of
recent interests.</p>
      <p>For the purposes of our experiment, we selected those users who
had some entries within the universities on-line publication
database. There were some users who had not entered their
publications into this database or who have yet to publish their
work. For these users there is little information within the
ontology, making any new-system initial profiles of little use. In a
larger scale system, more sources of information would be needed
from the ontology to build the new-system profiles. This would
allow some redundancy, and hence improve robustness in the
realistic situation where information is sparsely available.
The community of practice for a user was found not to be always
relevant based on our selection of relationships and weights. For
example, Dave de Roure supervises Stuart Middleton, but Dave
supervises a lot of other students interested in mobile agents.
These topics are not relevant to Stuart, which raises the question
of how relevant the supervision relationship is to our
requirements, and how best to weight such a relationship. Further
experiments are needed to identify the most relevant settings for
community of practice identification. The accuracy of our
communities of practice are also linked to the accuracy of the
research interest information as identified by the recommender
system.</p>
      <p>The new-user algorithm achieved good precision of 0.84 at the
expense of a significant 0.55 error rate. This was because both the
communities of practice generated for users were not always
precise, and because of the new-user algorithm included all
interests from the similar users. An improvement would be to only
use those interests held by the majority of the people within a
community of practice. This would exclude some of the less
common interests that would otherwise be included into the
newuser profile.</p>
      <p>The new-user initial profile algorithm defines the constant γ,
which determines the proportional significance of previous
publications and similar users. Factors such as the availability of
relationship data within the ontology and quality of the
publication database will affect the choice of value for γ. We used
a value of 2.5, but empirical evaluation would be needed to
determine the best value.</p>
      <p>There is an issue as to how best to calculate the “semantic
distance” between topics within the is-a hierarchy. We make the
simplifying assumption that all is-a links have equal relevance,
but the exact relevance will depend on each topic in question. If
individual weightings were allowed for each topic, a method for
determination of these weights would have to be considered.
Alternatively the is-a hierarchy could be carefully constructed to
ensure equal semantic distance.</p>
      <p>A positive feedback loop exists between the recommender system
and ontology, making data incest a potential problem. For new
users there are no initial interest entries within the ontology, so
new user profiles are not incestuous. If the recommender system
were to use the communities of practice for more than just initial
profiles, however, a self-confirming loop would exist and interest
calculations would be incestuous.</p>
      <p>Finally, a question still remains as to just how good an initial
profile must be to fully overcome the effects of the cold-start
problem. If initial recommendations are poor users will not use
the recommender system and hence it will never get a chance to
improve. We have shown that improvements can be made to
initial profiles, but further empirical evaluation would be needed
to evaluate exactly how much improvement is needed before the
system is “good enough” for users to give it a chance.
10. FUTURE WORK
The next step for the integrated system is to continue to improve
the set of relationships and weights used to calculate communities
of practice, and find a more selective ‘new-user initial profile’
algorithm. With more precise communities of practice the
newuser bootstrapping error rate should fall substantially. We could
then conduct a set of further user trials. This would allow the
assessment of user up-take and use of the integrated system, and
reveal how improving initial profiles affect overall system usage
patterns.</p>
      <p>The Quickstep recommender system is currently being extended
to explore further the idea of using ontologies to represent user
profiles. A large-scale trial is under way over a full academic year
to evaluate the new system, which is called the Foxtrot
recommender system.
11. ACKNOWLEDGEMENTS
This work is funded by EPSRC studentship award number
99308831 and the Interdisciplinary Research Collaboration In
Advanced Knowledge Technologies (AKT) project
GR/N15764/01.
12. REFERENCES</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Aha</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Kibler</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Albert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Instance-based learning algorithms</article-title>
          ,
          <source>Machine Learning</source>
          ,
          <volume>6</volume>
          :
          <fpage>37</fpage>
          -
          <lpage>66</lpage>
          ,
          <year>1991</year>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Alani</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>O</given-names>
            <surname>'Hara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            , and
            <surname>Shadbolt</surname>
          </string-name>
          ,
          <string-name>
            <surname>N.</surname>
          </string-name>
          <article-title>ONTOCOPI: Methods and Tools for Identifying Communities of Practice</article-title>
          ,
          <source>Intelligent Information Processing Conference</source>
          , IFIP World Computer Congress (WCC), Montreal, Canada,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Albert</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          and Barabasi, AL.
          <article-title>Statistical Mechanics of Complex Networks</article-title>
          .
          <source>Review of Modern Physics</source>
          ,
          <volume>74</volume>
          ,
          <fpage>47</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Balabanovi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Shoham</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <article-title>Fab: content-based, collaborative recommendation</article-title>
          ,
          <source>Communications of the ACM</source>
          Volume
          <volume>40</volume>
          , No.
          <volume>3</volume>
          (
          <issue>Mar</issue>
          .
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Becerra-Fernandez</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <article-title>Facilitating the Online Seach of Experts at NASA using Expert Seeker People-Finder</article-title>
          .
          <source>Proceedings of the 3rd International Conference on Practical Aspects of Knowledge Management (PAKM)</source>
          , Basel, Switzerland,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Bollacker</surname>
            ,
            <given-names>K.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lawrence</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>C.L.</given-names>
          </string-name>
          <article-title>CiteSeer: An Autonomous Web Agent for Automatic Retrieval and Identification of Interesting Publications</article-title>
          , Proceedings of the Second International Conference on Autonomous Agents,
          <string-name>
            <surname>Minneapolis</surname>
            <given-names>MN</given-names>
          </string-name>
          , USA,
          <year>1998</year>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>[7] Brown and Duguid</source>
          <year>2000</year>
          .
          <article-title>The social life of information Harvard Buisness School Press</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Craven</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>DiPasquo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Freitag</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>McCallum</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Mitchell</surname>
          </string-name>
          ,
          <string-name>
            <surname>T. Nigam</surname>
            <given-names>K.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Slattery</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <article-title>Learning to Extract Symbolic Knowledge from the World Wide Web</article-title>
          ,
          <source>Proceedings of the 15th National Conference on Artificial Intelligence (AAAI98)</source>
          ,
          <fpage>1998</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Dunlop</surname>
            ,
            <given-names>M. D.</given-names>
          </string-name>
          <article-title>Development and evaluation of clustering techniques for finding people</article-title>
          .
          <source>Proceedings of the 3rd International Conference on Practical Aspects of Knowledge Management (PAKM)</source>
          , Basel, Switzerland,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Eriksson</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fergeson</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shahr</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>1999</year>
          ).
          <article-title>Automatic generation of ontology editors</article-title>
          .
          <source>12th Workshop on Knowledge Acquisition, Modelling, and Management (KAW'99)</source>
          , Ban, Alberta, Canada.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Freund</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Schapire</surname>
            ,
            <given-names>R.E.</given-names>
          </string-name>
          <article-title>Experiments with a New Boosting Algorithm</article-title>
          ,
          <source>Proceedings of the Thirteenth International Conference on Machine Learning</source>
          ,
          <year>1996</year>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Guarino</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Giaretta</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>1995</year>
          ).
          <article-title>Ontologies and Knowledge bases: towards a terminological clarification. Towards Very Large Knowledge Bases: Knowledge Building and Knwoledge Sharing</article-title>
          . N. Mars, IOS Press:
          <fpage>25</fpage>
          -
          <lpage>32</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Guarino</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masolo</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Vetere</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <article-title>OntoSeek: ContentBased Access to the Web</article-title>
          ,
          <source>IEEE Intelligent Systems</source>
          , Vol.
          <volume>14</volume>
          , No. 3, May/June 1999
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>B.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maltz</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herlocker</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gordon</surname>
            ,
            <given-names>L.R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Riedl</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <article-title>GroupLens: applying collaborative filtering to Usenet news</article-title>
          ,
          <source>Communications of the ACM</source>
          Volume
          <volume>40</volume>
          , No.
          <volume>3</volume>
          (
          <issue>Mar</issue>
          .
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Lieberman</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Letizia</surname>
          </string-name>
          :
          <article-title>An Agent That Assists Web Browsing</article-title>
          ,
          <source>Proceedings of the 1995 International Joint Conference on Artificial Intelligence</source>
          , Montreal, Canada,
          <year>August 1995</year>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Lotus</surname>
          </string-name>
          .
          <article-title>Locating Organisational Expertise with the Lotus Discovery Server</article-title>
          , White Paper,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Maltz</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Ehrlich</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <article-title>Pointing the way: Active collaborative filtering</article-title>
          ,
          <source>CHI'95 Human Factors in Computing Systems</source>
          ,
          <year>1995</year>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Middleton</surname>
            ,
            <given-names>S.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Roure</surname>
            ,
            <given-names>D. C.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Shadbolt</surname>
            ,
            <given-names>N.R.</given-names>
          </string-name>
          <string-name>
            <surname>Capturing</surname>
          </string-name>
          <article-title>Knowledge of User Preferences: ontologies on recommender systems</article-title>
          ,
          <source>In Proceedings of the First International Conference on Knowledge Capture (K-CAP</source>
          <year>2001</year>
          ),
          <year>Oct 2001</year>
          ,
          <string-name>
            <surname>Victoria</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          <string-name>
            <surname>Canada</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Mladenic</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <article-title>Personal WebWatcher: Implementation and Design</article-title>
          ,
          <source>Technical Report IJS-DP-7472</source>
          , Department of Intelligent Systems, J.Stefan Institute, Slovenia,
          <year>1996</year>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>O</given-names>
            <surname>'Hara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Shadbolt</surname>
          </string-name>
          ,
          <string-name>
            <surname>N.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Buckingham</given-names>
            <surname>Shum</surname>
          </string-name>
          , S.
          <source>The AKT Manifesto</source>
          ,
          <year>2001</year>
          . www.aktors.org/publications/Manifesto.doc
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Pazzani</surname>
            ,
            <given-names>M. Muramatsu J.</given-names>
          </string-name>
          <string-name>
            <surname>Billsus</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Syskill</surname>
          </string-name>
          &amp;
          <article-title>Webert: Identifying interesting web sites</article-title>
          ,
          <source>Proceedings of the National Conference on Artificial Intelligence</source>
          , Portland, Oregon,
          <year>1996</year>
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Porter</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>An algorithm for suffix stripping</article-title>
          ,
          <source>Program</source>
          <volume>14</volume>
          (
          <issue>3</issue>
          ),
          <year>July 1980</year>
          , pp.
          <fpage>130</fpage>
          -
          <lpage>137</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Resnick</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Varian</surname>
            ,
            <given-names>H. R.</given-names>
          </string-name>
          <article-title>Recommender systems</article-title>
          ,
          <source>Communications of the ACM</source>
          ,
          <volume>40</volume>
          (
          <issue>3</issue>
          ),
          <year>1997</year>
          ,
          <fpage>56</fpage>
          -
          <lpage>58</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>Rocchio</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          <article-title>Relevance feedback in information retrieval, in the SMART Retrieval System -</article-title>
          -
          <source>Experiments in Automatic Document Processing</source>
          , Englewood Cliffs, NJ,
          <year>1971</year>
          . Prentice Hall, Inc. pp.
          <fpage>313</fpage>
          -
          <lpage>323</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Schwab</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pohl</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Koychev</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <article-title>Learning to Recommend from Positive Evidence</article-title>
          ,
          <source>Proceedings of Intelligent User Interfaces</source>
          <year>2000</year>
          , ACM Press, pp
          <fpage>241</fpage>
          -
          <lpage>247</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>Terveen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hill</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Amento</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McDonald</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Creter</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <article-title>PHOAKS: a system for sharing recommendations</article-title>
          ,
          <source>Communications of the ACM</source>
          Volume
          <volume>40</volume>
          , No.
          <volume>3</volume>
          (
          <issue>Mar</issue>
          .
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <surname>Wenger</surname>
            ,
            <given-names>E. C.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Snyder</surname>
          </string-name>
          , W. M.
          <article-title>Communities of Practice: The Organizational Frontier</article-title>
          . Harvard Business Review. January-February:
          <fpage>139</fpage>
          -
          <lpage>145</lpage>
          .
          <year>2000</year>
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Wenger</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <article-title>Communities of practice: the key to knowledge strategy. Knowledge Directions: The Journal of the Institute for Knowledge Management</article-title>
          ,
          <volume>1</volume>
          ,
          <fpage>48</fpage>
          -
          <lpage>93</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>