<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Improving Tag-based Resource Recommendation with Association Rules on Folksonomies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Samia Beldjoudi</string-name>
          <email>beldjoudig@labged.net</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hassina Seridi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Catherine Faron Zucker</string-name>
          <email>catherine.faron-zucker@unice.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>I3S, Universite Nice - Sophia Antipolis</institution>
          ,
          <addr-line>CNRS 930 route des Colles, BP 145, 06930 Sophia Antipolis cedex</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Laboratory of Electronic Document Management LabGED Badji Mokhtar University</institution>
          ,
          <addr-line>Annaba</addr-line>
          ,
          <country country="DZ">Algeria</country>
        </aff>
      </contrib-group>
      <fpage>26</fpage>
      <lpage>37</lpage>
      <abstract>
        <p>In this paper, we propose a method to analyze user pro les according to their tags in order to personalize the recommendation of resources. Our objective is to enrich the pro les of folksonomy users with pertinent resources. We argue that the automatic sharing of resources strengthens social links among actors and we exploit this idea to enrich user pro les by increasing the weights associated to web resources according to social relations. We base upon association rules which are a powerful method for discovering interesting relationships among a large set of data on the web. We extract association rules from folksonomies and use them to recommend supplementary resources associated to the tags involved in these rules. In this recommendation process, we reduce tag ambiguity by taking into account social similarities calculated on folksonomies.</p>
      </abstract>
      <kwd-group>
        <kwd>Folksonomies</kwd>
        <kwd>Social Tagging</kwd>
        <kwd>Association Rules</kwd>
        <kwd>Tag-based Resource Recommendation</kwd>
        <kwd>Tag Ambiguity</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Web 2.0 technologies have created the conditions for new usages on the web
which has become a social web. Users create, annotate, share and make public
what they nd interesting on the web and therefore are greatly involved in
the evolution of the web. Folksonomies are one of the keystones of these new
social practices: they are systems of classi cation derived from the practice and
method of collaboratively creating and managing tags to annotate and categorize
content. This practice is known as collaborative tagging or social tagging. Among
the most popular social websites based on folksonomies let us cite Delicious
which o ers an e ective way to conduct the collaborative management of social
bookmarking, Flickr which is a photo management and sharing web application,
Youtube and Dailymotion designed for sharing videos, Myspace and Odeo for
sharing music les.</p>
      <p>The basic principle of social tagging relies on three main elements: the user,
the resource and the tag. The combination of these three elements enables the
development of semantic tools exploiting both folksonomies and annotations of
web resources by users with tags. In this paper, we propose a method to analyze
user pro les according to their tags in order to predict interesting personalized
resources and recommend them. In other words, our objective is to enrich the
pro les of folksonomy users with pertinent resources. We argue that the
automatic sharing of resources strengthens social links among actors and we exploit
this idea to reduce tag ambiguity in the recommendation process by increasing
the weights associated to web resources according to social similarities. We base
upon association rules which are a powerfull method for discovering interesting
relationships among a large set of data on the web.</p>
      <p>We insist on the fact that our nal aim is not to suggest tags to users: each
time a resource is presented to a user, the tags already used to annotate this
resource are indicated but the user is free to tag the resource by choosing a
tag among them or using a new one. Our aim is to recommend resources which
are annotated with tags suggested by association rules, in order to enrich user
pro les with these resources (if they validate them). In other words, our aim is
to enrich user pro les based on similarities between users and association rules
and by doing so to increase the community e ect when suggesting resources to
a given user. Our approach comes from a new view on the community e ect in
folksonomies since it aims at automatically strengthening existing correlations
between di erent members of online communities, without involving the user in
this process. The fact of suggesting to each user some resources considered useful
or interesting for him without him specifying explicit tags, this can signi cantly
improve folksonomy-based recommendation systems, because the man-machine
interaction and therefore the user e ort are considerably reduced.</p>
      <p>This paper is organized as follows: Section 2 is an overview of the main
contributions related to our work. Section 3 is dedicated to the presentation of
our approach. In section 4 we present and discuss the results of some experiences
we conducted to measure the performance of our approach. Conclusion and
future works are described in Section 6.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Related Works</title>
      <sec id="sec-2-1">
        <title>Tag Recommendation</title>
        <p>
          The general aim of tag recommendation systems is to help users choose the
appropriate tags when annotating resources in order to increase the weights
associated to each tag and so cross a step up to building a common vocabulary in
these systems. Among the many works adressing this problem, let us cite that of
Schmitz et al. [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] who showed how association rules can be adopted to analyze
and structure folksonomies and how these folksonomies can be used for learning
ontologies and supporting emergent semantics. Their approach consists in
reducing the ternary dimension of a folksonomy by projecting it on a triadic context,
and then in extracting association rules from this two-dimensional projection.
An association rule A ) B is interpreted in two ways: users assigning tags in
A to some resources often also assign tags in B to them or users labeling the
resources in A with some tags often also assign these tags to the resources in B.
        </p>
        <p>
          Another noticeable contribution is that of Jaschke et al. [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] who present a
formal model and a new search algorithm called FolkRank, especially designed
for folksonomies. It is also applied to nd communities within a folksonomy and
is used to structure search results. The authors have exploited the idea of the
PageRank algorithm, which consists in considering a web page as important
when there are several other pages connected to it. In FolkRank, a resource
tagged by an important number of users with an important number of tags
becomes important. The same type of relationships becomes true for tags and
users. The idea is to create graphs, and to associate to each node of these graphs
a weight indicating its importance.
        </p>
        <p>
          Gemmell et al. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] propose a tag-based recommendation method based on
the adaptation of the K-nearest neighbor algorithm so that it accepts as input
both a user U and a resource R and gives out a set of tags T . The interest of this
approach is to orient users to use the same tags, and thus increase the chance of
building a common vocabulary used by all members.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Resource Recommendation</title>
        <p>
          The general aim of resource recommendation systems is to enrich the quantity
and relevance of the recommended resources. Among the works adressing this
problem, let us cite De Meo et al. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] who propose an approach based on the
principle of query expansion. The aim is to recommend resources to users
searching by tags by enhancing their pro les represented by their tag-based queries.
The principle of the approach is to enrich user pro les by additional tags
discovered through the exploration of the two graphs TRG and TUG representing the
relations respectively between tags and resources and between tags and users.
        </p>
        <p>Let us note that, when compared to the works on tag recommendation, the
principle is the same: extract the most appropriate tags. Most of the techniques
performed in this process demonstrate their contributions for building a language
more or less common between users of folksonomies. However the methods that
are used to achieve this goal are di erent from one approach to another.
Regarding the work of De Meo et al., we can say that the results obtained with their
approach show that the idea of proposing a system of resource recommendation
is pertinent: the rates of precision and recall are optimistic. However the fact of
forcing the user to specify a list of tags in order to get resources can generate a
cognitive overload and it obliges the system to focus on the participation of the
user to perform its recommendation procedure. Moreover the technique that has
been designed in this work does not take into account the semantics between
tags, in particular it cannot distinguish between ambiguous tags and therefore
it may recommend resources that will be rejected by the user because they are
not close to his preferences.
2.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>Resolving Tag Ambiguity</title>
        <p>
          According to Mathes [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], \the terms in a folksonomy have inherent ambiguity as
di erent users apply terms to documents in di erent ways. There are no explicit
systematic guidelines and no scope notes". For this reason we are concerned by
the problem of tag ambiguity in our approach of tag-based resource
recommendation.
        </p>
        <p>
          Among the most important contributions on resolving tag ambiguity or
extracting the semantic links between tags in a folksonomy, we start with [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]
where Mika has proposed to extend the traditional bipartite model of ontologies
to a tripartite one where the instances are keywords used by the actors of the
system in order to annotate web resources. In his paper, Mika focuses on social
network analysis in order to extract lightweight ontologies, and therefore
semantics between the terms used by the actors. In [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], Gruber stated that there is
no contrast between ontologies and folksonomies, and so recommended to build
an ontology of folksonomy. According to Gruber, the problem of the lack of
semantic links between terms in folksonomies can be easily resolved by
representing folksonomies with ontologies. Specia and Motta [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] in their turn have
preferred the use of ontologies to extract the semantics of tags. The proposed
method consists in building clusters of tags, and then trying to identify possible
relationships between tags in the same cluster. The authors have chosen to reuse
available ontologies on the semantic web in order to express correlations which
can hold between tags. An attempt to automate this method has been done by
Angeletou et al. [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
        <p>
          Bu a et al. [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] present a semantic wiki reconciling two trends of the future
web: a semantically augmented web and a web of social applications where every
user is an active provider as well as a consumer of information. The goal here is
to exploit the force of ontologies and semantic web standard languages in order
to improve social tagging. According to the authors, with this approach,
tagging remains easy and becomes both motivating and unambiguous. The niceTag
project of Limpens et al. [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] is focused on the same principle: the use of ontologies
in order to extract the semantics between tags in a system. In addition, the
interactions among users and the system are used to validate or invalidate automatic
treatments carried out on tags. The authors have proposed methods to build
lightweight ontologies which can be used to suggest terms semantically close
during a tag-based search of documents. Pan et al. [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] address the problem of
tag ambiguity by expanding folksonomy search with ontologies. They proposed
to expand folksonomies in order to avoid bothering users with the rigidity of
ontologies. During a keyword-based search of resources, the set of ambiguous
used terms is concatenated with other tags so as to increase the precision of the
search results.
        </p>
        <p>
          To sum up, most of the works aspire to bring together ontologies and
folksonomies as a solution to resolve tag ambiguity and overcome the lack of semantic
links between tags. Sure enough the approaches described in this section show
that the social nature of resource sharing is not in contradiction with the
possibilities o ered by ontology-based systems. But the rigidity that characterizes
ontologies and the need for an expert who must control and organize the links
between terms as in [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] seem a little cumbersome and too much expensive. Even
the structures automatically extracted as in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] still su er from the ambiguity
of concepts. Regarding the work of [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], we can say that the use of semantic web
ontologies for extracting relationships between terms is not su cient, because
as the semantic web includes some speci c domain ontology, that will push back
the problem. Also the expertise of users which was introduced in [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] is
characterized by the complexity of its exploitation. As a result we propose an approach
of tag-based resource recommendation where we aim to resolve tag ambiguity
without explicitly using ontologies.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Resource Recommendations based on Association</title>
    </sec>
    <sec id="sec-4">
      <title>Rules</title>
      <p>3.1</p>
      <sec id="sec-4-1">
        <title>Association Rules: Basic De nitions</title>
        <p>
          In data mining, learning association rules is a widely used method for discovering
interesting relations among variables in large databases. [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] describes analyzing
and presenting strong rules discovered in databases using di erent measures of
interestingness. Based on this concept of strong rules, [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] introduces association
rules for discovering regularities between products in large scale transaction data
recorded by point-of-sale (POS) systems in supermarkets. For example, the rule
fonions; potatoesg ) burgers found in the sales data of a supermarket indicates
that if a customer buys onions and potatoes together, he is likely to also buy
burgers3. According to the original de nition by [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], the problem of association
rule mining is de ned as follows:
De nition 1. Let I = fi1; : : : ; ing be a set of n binary attributes called items.
A rule is an implication X ) Y where X; Y I and X \ Y = ;. The sets
of items (itemsets) X and Y are called antecedent and consequent of the rule
respectively.
        </p>
        <p>To select interesting rules from the set of all possible rules in a database
D = fd1; : : : ; dmg, with each transaction in D containing a subset of items in I,
two measures are commonly used: support and con dence.</p>
        <p>De nition 2. The support supp(X) of an itemset X is the proportion of
transactions in D which contain X.</p>
        <p>De nition 3. The con dence conf (X ! Y ) of a rule X ! Y measures the
proportion of transactions in D that contain Y among those that contain X.
conf (X ! Y ) = suspupp(pX(X[Y) ) .</p>
        <sec id="sec-4-1-1">
          <title>Let us illustrate these notions on the following dataset.</title>
          <p>3 http://en.wikipedia.org/wiki/Association rules [Retrieved 13 May 2011]</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>Transaction ID</title>
          <p>1
2
3
4
5</p>
        </sec>
        <sec id="sec-4-1-3">
          <title>ItemSet</title>
          <p>Bread, Cream, Water</p>
          <p>Cream
Bread, Cream, Milk</p>
          <p>Water
Cream, Water
From this dataset, we can extract the rule Bread ) Cream with a con dence
conf (Bread ) Cream) = supp(fCream;Breadg) 24==55 = 1=2.</p>
          <p>supp(Bread</p>
          <p>To be selected as signi cant and interesting, association rules are usually
required to satisfy a user-speci ed minimum support and a user-speci ed
minimum con dence. The process of generating association rules is usually split up
into two separate steps: First, the minimum support constraint is applied to nd
all frequent itemsets in a database. Second, the minimum con dence constraint
is applied among the rules involving these frequent itemsets. The quality of the
extraction algorithm thus strongly depends on the values chosen by the user for
the minimum support and minimum con dence, which adequacy is relative to
the application.
3.2</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>Association Rules and Folksonomies</title>
        <p>
          A folksonomy is a tuple F =&lt; U; T; R; A &gt; where U , T and R represent
respectively a set of users, a set of tags and a set of resources, and A represents the
relationships between the three preceding elements, i.e. A U T R [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. In
our approach we consider a folksonomy as being a tripartite model where web
resources are associated with a user to a list of tags. Therefore we have extracted
three social networks from our folksonomy, which represent three di erent
viewpoints on social interactions: one network relating tags and users, a second one
relating tags and resources and a third one relating users and resources. We
represent these social networks by three matrices T U , T R, U R:
{ T U = [Xij ] where Xij = 01 iofth9err2wiRse; &lt; uj ; ti; r &gt;2 A
{ T R = [Yij ] where Yij = 01 iofth9eurw2iUse; &lt; u; ti; rj &gt;2 A
{ U R = [Zij ] where Zij = 01 iofth9etr2wiTse;&lt; ui; t; rj &gt;2 A
{ RU , RT and U T are the transposed matrices of U R, T R and T U .
        </p>
        <p>
          This enables us to analyze the correlations captured from the di erent social
interactions. We used Pajek, a tool which has already been used by Mika to
analyze large networks [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
        </p>
        <p>To apply an association rule method to folksonomies, we represent each user
in a folksonomy by a transaction ID and the tags he uses by the set of items which
are in this transaction. The following table provides an illustrative example of a
dataset of user tags.</p>
        <sec id="sec-4-2-1">
          <title>Transaction ID</title>
          <p>U1
U2
U3
U4
U5</p>
        </sec>
        <sec id="sec-4-2-2">
          <title>Itemset</title>
          <p>Computer, Programming</p>
          <p>Computer, Apple
Kitchen, Apple
Programming</p>
          <p>Kitchen</p>
          <p>
            Our goal is to nd correlations between tags, i.e. to nd tags which frequently
appear together, in order to extract those which are not used by one particular
user but which are often used by other users close to him in the social network.
For example, let us consider a dataset in which it occurs that many users who
use the tag Software also employ the tag Java. We aim at extracting a rule
Sof tware ) J ava so that we can enrich the pro les of users who employ the
tag Software but not the tag Java, by the resources tagged with Java. Among
the wide range of algorithms proposed to extract interesting association rules,
we use the one known as Apriori [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ].
          </p>
          <p>Once the rules are extracted, our recommendation system proceeds as follows.
For each extracted rule, we test whether the tags which are in the antecedent
of the rule are used by the current user. If it is the case then the resources
tagged with each tag found in the consequent of the rule are candidate to be
recommended by the system. The e ectiveness of the recommendation depends
on the resolution of tag ambiguity, as explained in the next section.
3.3</p>
        </sec>
      </sec>
      <sec id="sec-4-3">
        <title>Resolving Tag Ambiguity in Recommendation</title>
        <p>A tag can have several meanings, i.e. refer to several concepts. Therefore, a basic
tag-based recommendation system would equally recommend resources relative
to fruits or to computers for a user searching with the tag apple. The resolution
of tag ambiguity is specially crucial in our approach where some tags which are
used to recommend resources are not directly used by the user but deduced with
association rules. To resolve the problem of tag ambiguity in recommendation,
we propose to measure the similarity between users to identify those who have
similar preferences and therefore adapt the recommendation to user pro les.
Similarity between Users. For each extracted association rule whose
antecedent applies to the user searching for resources, we measure the similarities
between this user and the users of his social network who use the tags
occuring in the consequent of the rule. The resources associated to these tags are
recommended to the user depending on these similarities.</p>
        <p>To measure the similarity between two users u1 and u2, we represent each of
them by the vector of binary numbers representing all his tags (extracted from
matrix U R) and we calculate the cosines of the angle between the two vectors:
sim(u1; u2) = cos(v1; v2) = kv1kv12::vk2v2k2</p>
      </sec>
      <sec id="sec-4-4">
        <title>Similarity between Resources. To avoid the cold start problem which gen</title>
        <p>erally results from a lack of data required by the system in order to make a good
recommendation, when the user of the recommendation system is not yet similar
to other users, we also measure the similarity between the resources which would
be recommended by the system (as related to a tag occuring in the consequent
of an association rule) and those which are already recommended to the user.</p>
        <p>To measure the similarity between two resources r1 and r2, we represent each
of them by the vector of binary numbers representing all its tags (extracted from
matrix T R) and we calculate the cosines of the angle between the two vectors.
Levels of Recommendation. Each resource recommended by the system is
rst associated an initial weight based on the similarities between users. Above
a threshold xed in [0; : : : 1], we qualify the resource as highly recommended.
Under this threshold, we consider the similarity between resources and we similarly
highly recommend the resources which weights calculated on the product matrix
RR = RT T R are above a given threshold. Otherwise, we compute the average
ratio between the number of resources shared by the user of the recommendation
system with his social network and the number of resources used by him. These
numbers are given by the product matrix RR = RU U R. Above a threshold
xed in [0..1], we qualify the resource as highly recommended ; under this
threshold, it is simply recommended or weakly recommended if the similarity is close to
zero.</p>
        <p>Whole Process of Recommendation. The activity diagram in Figure 1 gives
an overview of the whole process of recommendation including the key steps
described above to analyze existing interactions between the di erent elements
of a folksonomy, especially those between users.</p>
        <p>Let us note that our recommendation system is exible, since the user can
interact to accept or reject the recommended resources.</p>
        <p>Let us consider the example of a folksonomy represented through the
following three matrices T U , T R and U R:
Let us now suppose that we have extracted the interesting association rule
computer ) apple. Matrix T U shows that tag computer is used by user U1.
Since apple is in the consequent of the rule, matrix T R shows that resources R3
and R5 are candidates for a recommendation to U1. Matrix U T shows that apple
is used by users U2 and U3. Then we calculate the similarity between U1 and U2
and the similarity between U1 and U5, based on matrix U U = U T T U :
U1 and U2 show higher cosine similarity than U1 and U3 . Then, among the
resources tagged with apple, namely R3 and R5 (see matrix T R), those tagged
by U2 are highly recommended to U1: it is only the case of R3 (see matrix U R).</p>
        <p>U1 and U3 are not similar. Then, among the resources tagged with apple, we
compute the similarity of those tagged by U3, namely R5, with those already
recommended by the system, namely R3. It is based on matrix RR = RT T R:
sim(R3; R5) = cos(RR3; RR5)</p>
        <p>R5 and R3 are not similar. Then R5 is weakly recommended to U1.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Experiments</title>
      <p>In this section, we describe some experiments over two datasets and we analyze
and discuss our results. We have developped a simple application with a
convivial interface enabling the user to log in and get a personalized ordered list of
recommended resources | depending on his tagging activity and social network.
4.1</p>
      <sec id="sec-5-1">
        <title>Experiment over a subset of the del.icio.us database</title>
        <p>In order to validate our approach, we have conducted a rst experiment with the
del.icio.us database. Our test base comprises 207 tag assignments involving 21
users, 97 tags | some of which are ambiguous |, 92 resources | each having
possibly several tags and several users. Our system has extracted a set of 17
association rules from the analysis of the dataset with a support equal to 0.5
and a con dence equal to 0.6. We have for example the rule news ) sof tware:
60% of the users using the tag news also use the tag sof tware.</p>
        <p>To demonstrate the validity of our approach, we have distinguished two
classes of users: the rst one contains the users who have employed
ambiguous tags and the other one those who did not use those tags. This ambiguity of
tags has been subjectively decided: for instance apple is ambiguous and software
is not.</p>
        <p>Not surprisingly, our experiment has showed that, by applying the extracted
association rules, the resources associated to non ambiguous tags are highly
recommended. It has also showed that, in the case of rules involving ambiguous
tags, our system recommends to the user the resources which are close to his
interests with a high level of recommendation and, on the contrary, those which
are far from his interests with a low level of recommendation.
4.2</p>
      </sec>
      <sec id="sec-5-2">
        <title>Evaluation of our Recommendation System over an</title>
      </sec>
      <sec id="sec-5-3">
        <title>Experimental Dataset</title>
        <p>To evaluate the quality of our recommendation system, we used the following
three metrics: recall, precision and F1 metric. Precision measures the ability of
the system to reject all the resources which are not relevant. It is given by the
ratio between the number of the relevant resources recommended and that of the
recommended resources. Recall measures the ability of the system to retrieve all
the relevant resources. It is given by the ratio between the number of relevant
resources recommended and that of all the relevant resources in the database.
F1 is a combination of the two previous metrics; it is de ned by the following
formula:</p>
        <p>F 1 = 2 P recision Recall</p>
        <p>P recision+Recall</p>
        <p>Because the calculation of these metrics requires the knowledge of all
relevant resources for each user in order to compare the results provided by our
recommendation system and those which are actually preferred by each user, we
have built a database by inviting 6 users to participate to an experiment.</p>
        <p>We rst made a prototype of a folksonomy in the form of a website. Then
we asked the users to specify their preferred resources. Finally, we asked each
user to tag a set of resources among 18 ones available on our website, by using
free keywords. Based on this dataset, we extracted 10 association rules with a
support equal to 0.5 and a con dence equal to 0.6. Afterwards we calculated the
three metrics for each participant in our test, for each tag. The following table
presents the average values of the metrics we obtained for our 6 users:
U ser</p>
        <p>U1
U2
U3
U4
U5</p>
        <p>U6
Average</p>
        <p>These are quite encouraging results, showing that our approach of
recommendation adapted to user pro les is truly able to help users when searching for
resources.
We have shown through our experiments that the use of data mining methods
and tools has proved its e ectiveness for folksonomy-based recommendation. The
results of our data sample are optimistic and so we can say that the community
e ect which characterizes folksonomies has showed its power in users pro les
enrichment. This enhancement can signi cantly help to improve recommendation
systems. At the same time that our approach contributes to increase the weights
associated to the relevant resources, it reduces tag ambiguity: every time when
there are shared resources between two users, the system can avoid the trap
of tag ambiguity in the research phase and it will test the similarity between
resources when the users are not similar. The extraction of association rules is
based on tags rather than on resources because we believe that tag popularity
in folksonomies is greater than resource popularity and the meaning of tags in
these systems is more signi cant than that of resources: the same resource can
be used for many di erent purposes.
5</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion and Future Works</title>
      <p>In this paper, we have proposed a method to automatically enrich user pro les
with a set of relevant resources, based on social networks and folksonomies. We
exploit association rules extracted from the social relations in a folksonomy to
recommend resources tagged with terms occuring in these rules by other users
close in the social network. Our objective is to create a consensus among users of
a same network in order to teach them how they can organize their web resources
in a correct and optimal manner.</p>
      <p>We have tested our approach on a small amount of data where we have
obtained good results, but the validation of our approach still requires a larger
sample set. In order to continue and improve our work, we aim to address the
problem of scalability of our approach on larger databases. The measure of
similarity we use is based on several products of matrices whose dimensions are the
numbers of resources and tags of a folksonomy. In real scenariis, these
dimensions are usually too large. We are intending to explore matrix factorization and
latent semantic analysis.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>R.</given-names>
            <surname>Agraval</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Imielinski</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Swami</surname>
          </string-name>
          .
          <article-title>Mining association rules between sets of items in large databases</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Int. Conference on Management of Data</source>
          , Washington, USA,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>S.</given-names>
            <surname>Angeletou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sabou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Specia</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Motta</surname>
          </string-name>
          .
          <article-title>Bridging the Gap Between Folksonomies and the Semantic Web: An Experience Report</article-title>
          .
          <source>In Proc. of ESWC workshop on Bridging the Gap between Semantic Web and Web</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>M.</given-names>
            <surname>Bu a</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gandon</surname>
          </string-name>
          , G. Ereteo,
          <string-name>
            <given-names>P.</given-names>
            <surname>Sander</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Faron</surname>
          </string-name>
          .
          <article-title>SweetWiki: A semantic Wiki</article-title>
          .
          <source>Journal of Web Semantics.</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>P. De Meo</surname>
            , G. Quattrone, and
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Ursino</surname>
          </string-name>
          .
          <article-title>A query expansion and user pro le enrichment approach to improve the performance of recommender systems operating on a folksonomy. User Modeling</article-title>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>20</volume>
          (
          <issue>1</issue>
          ),
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>J.</given-names>
            <surname>Gemmell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Schimoler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ramezani</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Mobasher</surname>
          </string-name>
          .
          <article-title>Adapting k-nearest neighbor for tag recommendation in folksonomies</article-title>
          .
          <source>In Proc. of 7th Workshop on Intelligent Techniques for Web Personalization &amp; Recommender Systems (ITWP'09)</source>
          , Pasadena, California, USA, in
          <source>conjunction with IJCAI</source>
          <year>2009</year>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>T.</given-names>
            <surname>Gruber. TagOntology -</surname>
          </string-name>
          <article-title>a way to agree on the semantics of tagging data</article-title>
          . http://tomgruber.org/writing/tagontology.htm,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. R. Jaschke, L.B.
          <string-name>
            <surname>Marinho</surname>
            , Hotho A.,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Schmidt-Thieme</surname>
            , and
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Stumme</surname>
          </string-name>
          .
          <article-title>Tag recommendations in folksonomies</article-title>
          .
          <source>In Proc. of 11th Eur. Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD)</source>
          , Warsaw, Poland, volume
          <volume>4702</volume>
          <source>of LNCS</source>
          . Springer,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>F.</given-names>
            <surname>Limpens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gandon</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Bu</surname>
          </string-name>
          <article-title>a. Collaborative semantic structuring of folksonomies</article-title>
          .
          <source>In Proc. of IEEE/WIC/ACM Int. Conference on Web Intelligence (WI</source>
          <year>2009</year>
          ), Milan, Italy,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>A.</given-names>
            <surname>Mathes</surname>
          </string-name>
          .
          <article-title>Folksonomies - Cooperative Classi cation and Communication Through Shared Metadata</article-title>
          . http://www.adammathes.com/academic/computer-mediatedcommunication/folksonomies.html,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>P.</given-names>
            <surname>Mika</surname>
          </string-name>
          .
          <article-title>Ontologies are us: A uni ed model of social networks and semantics</article-title>
          .
          <source>In Proc. of 4th Int. Semantic Web Conference (ISWC</source>
          <year>2005</year>
          ), Galway, Ireland, volume
          <volume>3729</volume>
          <source>of LNCS</source>
          . Springer,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>J.Z. Pan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , and E. Thomas.
          <article-title>Reducing ambiguity in tagging systems with folksonomy search expansion</article-title>
          .
          <source>In Proc. of 6th Eur. Semantic Web Conference (ESWC</source>
          <year>2009</year>
          ), Heraklion, Greece, volume
          <volume>5554</volume>
          <source>of LNCS</source>
          . Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. G.
          <string-name>
            <surname>Piatetsky-Shapiro</surname>
          </string-name>
          .
          <article-title>Discovery, analysis, and presentation of strong rules</article-title>
          .
          <source>In Knowledge Discovery in Databases</source>
          . AAAI/MIT Press,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>C. Schmitz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Hotho</surname>
            , R. Jaschkee, and
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Stumme</surname>
          </string-name>
          .
          <article-title>Mining association rules in folksonomies</article-title>
          .
          <source>In Proc. of IFCS 2006 Conference: Data Science and Classi cation</source>
          , Ljubljana, Slovenia. Springer,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>L.</given-names>
            <surname>Specia</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Motta</surname>
          </string-name>
          .
          <article-title>Integrating folksonomies with the semantic web</article-title>
          .
          <source>In Proc. of 4th Eur. Semantic Web Conference (ESWC</source>
          <year>2007</year>
          ), Innsbruck, Austria, volume
          <volume>4519</volume>
          of Lecture Notes in Computer Science. Springer,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>