<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards a Multilingual Semantic Folksonomy</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Murad Magableh, Antonio Cau, Hussein Zedan, Martin Ward Software Technology Research Laboratory (STRL) Faculty of Technology De Montfort University The Gateway</institution>
          ,
          <addr-line>Leicester LE1 9BH</addr-line>
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2010</year>
      </pub-date>
      <fpage>2</fpage>
      <lpage>8</lpage>
      <abstract>
        <p>The content of collaborative tagging systems (so-called folksonomies) is generated, consumed, and annotated by the end users. Users annotate and categorise their data using free-keywords, so-called tags. Consequently, several linguistic problems come to the surface in folksonomies such as; synonyms, polysemy, multilinguality, and others which produce ambiguous and inconsistent classification of data. Therefore, relevant results are not retrieved in the user's query. In this paper, we suggest a novel approach to enhance the “social vocabulary” presented in folksonomies with the “controlled vocabulary” presented in Semantic Web ontologies. Therefore, our proposed approach uses the online WordNet lexical ontology in addition to the EuroWordNet multilingual lexical resource. Our approach tries to employ the ontological relations presented in WordNet in the folksonomy, it focuses on the problems of synonyms, tag relations, and multilinguality.</p>
      </abstract>
      <kwd-group>
        <kwd>Social Web</kwd>
        <kwd>Semantic Web</kwd>
        <kwd>Collaborative Tagging System</kwd>
        <kwd>Folksonomy</kwd>
        <kwd>Ontology</kwd>
        <kwd>WordNet</kwd>
        <kwd>EuroWordNet</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>approach in Section 3, followed by a discussion in Section 4. In Section 5, we
review some related work, and conclude in Section 6.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Challenges of Folksonomies</title>
      <p>
        By analysing the current collaborative tagging systems, we can notice that the
main problems are ambiguity, inconsistency, and redundancy problems [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1, 2, 3,
4</xref>
        ]. This is normal since the collaborative tagging systems (by their nature) are
shared by many users. These users came from different backgrounds, cultures,
countries, domains, and tongues. The diversity of the users’ behaviours would
inevitably create inconsistent tags that would give ambiguous identification of
the tagged objects.
      </p>
      <p>
        The ambiguity and inconsistency of the tags in folksonomies emerge mainly
because of linguistics reasons such as; word synonyms [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref5 ref6 ref7">1, 2, 3, 5, 6, 7</xref>
        ], polysemy
(homonym) [
        <xref ref-type="bibr" rid="ref1 ref2 ref5 ref6 ref7">1, 2, 5, 6, 7</xref>
        ], different lexical forms [
        <xref ref-type="bibr" rid="ref2 ref5 ref6 ref7">2, 5, 6, 7</xref>
        ], alternative spellings
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], misspelling errors [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ], and use of different languages [
        <xref ref-type="bibr" rid="ref4 ref8 ref9">4, 8, 9</xref>
        ]. When
searching the folksonomy, these problems cause irrelevant result to be retrieved, and
relevant results not to be retrieved. Our concern in this paper is the latter case.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Our Approach</title>
      <p>
        As aforementioned, we focus in our approach on synonyms, multilinguality, and
initiating relations among tags in folksonomy based on the semantic relations
existing in the ontology. Since all these challenges are lexical ones, the best choice
is to use the lexical ontology WordNet. WordNet is a lexical ontology which has
set of synonym words, called synset, that defines a particular concept. It includes
a lot of lexical and semantic relations between words and synsets. It is restricted
to no specific domain and covers all common parts of speech; nouns, adjectives,
verbs and adverbs [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
3.1
      </p>
      <sec id="sec-3-1">
        <title>Synonyms</title>
        <p>Usually, when a user is tagging, (s)he is not aware of all synonyms for the tags
(s)he uses. If the tagger is English, (s)he will use the word “lift ” whilst the
American one will use the word “elevator ” to describe the lifting device used
to move people from one floor to another in a building. Also, when we want to
express the beauty of something, we will use words (synonyms) like “beautiful ”,
“pretty ”, and maybe “gorgeous”. Always we miss some of the synonyms. In the
first example, if the tag that was used is “lift ”, the future search will retrieve
nothing if we use the word “elevator ” as a search keyword.</p>
        <p>Our idea is to add “system tags” every time the user adds tags. The
system tags will be added automatically by the collaborative tagging system by
consulting the WordNet ontology, these tags are all the existing synonyms in
WordNet for the “user tags”. Figure 1 shows subset of the synonyms set that
can be added by WordNet ontology for the tag “beautiful ”. When the user adds
the tag “beautiful ”, the system will add all related synonyms from the WordNet.
Future search using any of the synonyms added by the system (system tags) will
be able to retrieve the tagged object. Thus, it ensures the retrieval of relevant
results.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Tags Relations</title>
        <p>Imagine if a user tagged a resource as “poultry ”. Poultry is indeed kind of meat
and it is expected to be retrieved when searching using the keyword “meat ”
because it is relevant to the search keyword. Unfortunately, it will not be in
the result set since this word is not in the tags set for that resource. The same
problem is faced again; relevant results are not being retrieved due to lack of
semantics in the folksonomy.</p>
        <p>The WordNet ontology has such a semantic relations among words. Figure 2
shows a part of the WordNet ontology. The system will add the synonyms of the
“poultry ” (gallinacean, fowl ). Also it will add the parent of that word (meat )
and its synonym (flesh) as system tags. Therefore, anyone who searches using
the keyword “meat ” will retrieve the resource originally tagged with “poultry ”.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Multilinguality</title>
        <p>So far, the tagged resource is accessible and visible only if the search keywords
are English words. If a non-English speaker is searching using non-English
keywords, nothing will be retrieved. If an Italian is searching using the word “bello”
(it means: beautiful), the tagged resource in the previous example will seem as
irrelevant and thus will not be retrieved. As humans, we can see clearly that it
is relevant, but the machines do not.</p>
        <p>
          As a solution for multilinguality problem, we will use the EuroWordNet.
EuroWordNet relates and unites WordNets in different European languages (Dutch,
Spanish, Italian, German, French, Czeck, and Estonian) in a single multilingual
lexical resource, and it links them to the English WordNet [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
        <p>We propose that EuroWordNet will find the equivalent words for the tag
“beautiful ” in the abovementioned languages using so-called Inter-Lingual-Index
(ILI). These equivalent words (In addition to their synonyms and parent words
as aforementioned) will be added as system tags. This guarantees that future
searches by non-English speakers using their own languages will retrieve the
relevant resource even if these resources were tagged originally by only English tags,
and vice versa.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Discussion</title>
      <p>The proposed approach requires replicating the WordNet and EuroWordNet
words and storing them in the folksonomy as system tags. This redundancy of
data is justified in the following paragraphs.</p>
      <p>Alternatively, we can avoid adding system tags at tagging time by consulting
and deducing the relations from the lexical resources at search time. In the case
of synonyms in the previous example, when the user uses the keyword “pretty ”
in the search, the system will send it to the WordNet. The WordNet will send
all the found synonyms to the folksonomy, and thus all objects that are tagged
by any of these synonyms will be retrieved (See Figure 1).</p>
      <p>This communication between the folksonomy and the ontology and the
searching process inside the ontology itself is time consuming while the user is waiting
for a response. We have the choice either to save time or to save space. Time is
the critical factor in such a case.</p>
      <p>Our proposal needs a software agent that is responsible of reflecting any
prospective future changes in the online lexical resources on the folksonomy to
keep the system tags in the folksonomy up-to-date.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Related Work</title>
      <p>Many researchers have tried to address the abovementioned challenges of
folksonomies using different approaches. One of these approaches was to use the
power of the Semantic Web in decreasing the ambiguity an inconsistency of
tags. If we have a glance at these attempts, we can see that there are still many
gaps to fill.</p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], tags are filtered and normalised, then these tags will be adhered to
different domain ontologies’ concepts, and only the terms that appear in the
ontologies will be selected. In this method they remove some users’ tags which
reflect part of the users’ understanding of the tagged object. Moreover, the changes
in the users vocabulary will not be reflected in the semantic ontologies.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], they correct the misspelled tags and group the similar tags together,
and then the tags are mapped to online ontologies. This method then replaces
some tags with corresponding concepts in the online ontologies. We argue that
the interference in users’ tags will conflict with the ethos of folksonomies
(freekeywords).
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], they developed their own folksonomy system using domain-specific
ontology and WordNet ontology. They detect the domain of the most popular
tags, and then they manually build an ontology for that domain. The problem
in this method is the necessity of building the domains ontologies, even worse;
the domain ontology should be built manually.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], they used the WordNet concepts’ relations to show the user an
additional panel on his browser’s interface. This extra visualisation displays related
tags organised according to a semantic criterion to facilitate navigation and
searching in the folksonomy. It is only visualisation nothing more and some tags
were not recognised in the lexicon.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], they map the unstructured tags to more structured domain ontologies.
These ontologies are used for refining the queries to combine results of different
tag-based systems. This method uses an ontology-based navigation interface
allowing the user to retrieve more related results through graphical navigation of
the ontology concepts. This method can not deal with unmatched tags; which
are the tags that do not exist in the domain ontologies.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], they use WordNet and Wikipedia to substitute semantic assertions for
the current tags. These assertions are not simple strings to describe a
particular resource; each semantic assertion describes a specific property of a resource.
Therefore, the possibility of tagging using free words is absent which contradicts
the ethos of folksonomy.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], they apply both syntactic and semantic techniques for connecting
tag to ontologies in order to get more semantics about the tag and provide tag
suggestions for the users. This method, in addition to offering suggestions to the
users, asks the users to give feedback about these suggestions. Hence, we argue
that it puts more effort on the users’ side to improve the quality of the tags
by changing the conventional way by which the users used to interact with the
folksonomy.
6
      </p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>Folksonomies lack semantics among users’ tags which causes relevant results not
to be retrieved. Semantic Web ontologies are considered a rich source for
semantic relations that, if exploited properly, will improve the searching process
in folksonomies. Our approach focused on addressing the problems of synonyms,
semantic relations among tags, and multilinguality. It is based on the idea of
adding system tags as complements to the user tags for a wider coverage of
potential future search keywords, therefore, more relevant results will be retrieved.
7</p>
    </sec>
    <sec id="sec-7">
      <title>Future Work</title>
      <p>In the future, this proposal will be implemented therefore more empirical results
will follow.</p>
      <p>EuroWordNet is limited to only some European languages. Our approach is
extendable to other languages by using intermediate online dictionaries. These
dictionaries might be used to translate from one WordNet to another for
languages that are not included in EuroWordNet (e.g. from English WordNet to
Arabic WordNet).</p>
      <p>A unifying architecture for collaborative tagging systems is under
construction. This architecture includes clustering techniques to address the problem of
shorthands usage in tagging. Such tags are written using special words that do
not belong to any language. Therefore, the best choice is to consult the social
networks to predict their meanings.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
          </string-name>
          , S.C.Y.:
          <article-title>Collaborative tagging applications and approaches</article-title>
          .
          <source>Multimedia</source>
          <volume>15</volume>
          (
          <issue>3</issue>
          ) (
          <year>2008</year>
          )
          <fpage>14</fpage>
          -
          <lpage>21</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Marchetti</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tesconi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ronzano</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosella</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Minutoli</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Semkey: A semantic collaborative tagging system</article-title>
          .
          <source>In: Proceedings WWW 2007 Workshop on Tagging and Metadata for Social Information Organisation</source>
          . (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Mathes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Folksonomies-cooperative classification and communication through shared metadata</article-title>
          .
          <source>Computer Mediated Communication - LIS590CMC</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Angeletou</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sabou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
          </string-name>
          , E.:
          <article-title>Semantically enriching folksonomies with flor</article-title>
          .
          <source>In: European Semantic Web Conference Workshop: CISWeb</source>
          . (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Dix</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levialdi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Malizia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Semantic Halo for collaboration tagging systems</article-title>
          .
          <source>In: Workshop on the Social Navigation and Community based Adaptation Technologies</source>
          . (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Golder</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huberman</surname>
            ,
            <given-names>B.A.</given-names>
          </string-name>
          :
          <article-title>Usage patterns of collaborative tagging systems</article-title>
          .
          <source>Journal of Information Science</source>
          <volume>32</volume>
          (
          <issue>2</issue>
          ) (
          <year>2006</year>
          )
          <fpage>198</fpage>
          -
          <lpage>208</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yong</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          : Ontosonomy:
          <article-title>Ontology-based extension of folksonomy</article-title>
          .
          <source>In: Proceedings of the 2008 IEEE International Workshop on Semantic Computing and Applications</source>
          . (
          <year>2008</year>
          )
          <fpage>27</fpage>
          -
          <lpage>32</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Al-Khalifa</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
          </string-name>
          , H.:
          <article-title>FolksAnnotation: A semantic metadata tool for annotating learning resources using folksonomies and domain ontologies</article-title>
          .
          <source>In: Innovations in Information Technology</source>
          . (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Zamora</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nistal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Visualising tags as a network of relatedness</article-title>
          .
          <source>In: 39th ASEE/IEEE Frontiers in Education Conference</source>
          . (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Morato</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marzal</surname>
            ,
            <given-names>M.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Llorns</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moreiro</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>WordNet applications</article-title>
          .
          <source>In: Proceeding of the Second Global WordNet Conference</source>
          . (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Vossen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : WordNet, EuroWordNet and global WordNet. Revue Franaise de Linguistique Appliquee / RFLA 7(
          <issue>1</issue>
          ) (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Ghali</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Sharp</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cristea</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Folksonomies and ontologies in authoring of adaptive hypermedia</article-title>
          .
          <source>In: A3H 6th International Workshop on Authoring of Adaptive and Adaptable Hypermedia Workshop</source>
          . (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Laniado</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eynard</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Colombetti</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Using WordNet to turn a folksonomy into a hierarchy of concepts</article-title>
          .
          <source>In: Semantic Web Application and Perspectives - Fourth Italian Semantic Web Workshop</source>
          . (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Bindelli</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Criscione</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Curino</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Drago</surname>
            ,
            <given-names>M.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eynard</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orsi</surname>
          </string-name>
          , G.:
          <article-title>Improving search and navigation by combining ontologies and social tags</article-title>
          .
          <source>In: 1st International Workshop on Ambient Data Integration</source>
          . (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Sluijs</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Houben</surname>
            ,
            <given-names>G.J.</given-names>
          </string-name>
          :
          <article-title>Relating user tags to ontological information</article-title>
          .
          <source>In: Proceedings of 5th International Workshop on Ubiquitous User Modeling</source>
          . (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>