=Paper= {{Paper |id=None |storemode=property |title=Towards a Multilingual Semantic Folksonomy |pdfUrl=https://ceur-ws.org/Vol-585/paper6.pdf |volume=Vol-585 }} ==Towards a Multilingual Semantic Folksonomy== https://ceur-ws.org/Vol-585/paper6.pdf
1st International Workshop on Adaptation, Personalization and REcommendation in the Social-semantic Web (APRESW 2010)




                         Towards a Multilingual Semantic Folksonomy

                              Murad Magableh, Antonio Cau, Hussein Zedan, Martin Ward
                                  Software Technology Research Laboratory (STRL)
                                               Faculty of Technology
                                               De Montfort University
                                          The Gateway, Leicester LE1 9BH
                                                  United Kingdom
                                     {mmurad, cau, hzedan, mward}@dmu.ac.uk


                            Abstract. The content of collaborative tagging systems (so-called folk-
                            sonomies) is generated, consumed, and annotated by the end users. Users
                            annotate and categorise their data using free-keywords, so-called tags.
                            Consequently, several linguistic problems come to the surface in folk-
                            sonomies such as; synonyms, polysemy, multilinguality, and others which
                            produce ambiguous and inconsistent classification of data. Therefore,
                            relevant results are not retrieved in the user’s query. In this paper, we
                            suggest a novel approach to enhance the “social vocabulary” presented
                            in folksonomies with the “controlled vocabulary” presented in Seman-
                            tic Web ontologies. Therefore, our proposed approach uses the online
                            WordNet lexical ontology in addition to the EuroWordNet multilingual
                            lexical resource. Our approach tries to employ the ontological relations
                            presented in WordNet in the folksonomy, it focuses on the problems of
                            synonyms, tag relations, and multilinguality.
                            Keywords: Social Web, Semantic Web, Collaborative Tagging System,
                            Folksonomy, Ontology, WordNet, EuroWordNet.


                     1    Introduction
                     By introducing Web 2.0 (Social Web), end-users became at the heart of Web
                     content generation and classification processes. In collaborative tagging systems
                     (folksonomies), users generate contents and they use free-text keywords, so-called
                     tags, to classify their contents. Therefore, users create metadata as well as data.
                     This new approach of data categorisation and metadata creation is simple, easy,
                     fast, low cost, and flexible compared to traditional metadata creation process
                     by professionals and authors. Furthermore, it dynamically reflects the emergent
                     vocabulary used among online social communities. Nevertheless, lack of seman-
                     tics among data in such communities represents a real challenge regarding the
                     information retrieval.
                         The ethos of Semantic Web vision is to represent the data in such a way that
                     computers can understand. Thus, Semantic Web ontologies offer an efficient re-
                     source of structured data that can be exploited by the Social Web. Together,
                     Social Web and Semantic Web can produce a harmonised duet.
                         Section 2 is devoted for the challenges of folksonomies. We demonstrate our




7th Extended Semantic Web Conference (ESWC 2010)                                                           Page 51 of 64
1st International Workshop on Adaptation, Personalization and REcommendation in the Social-semantic Web (APRESW 2010)




                     approach in Section 3, followed by a discussion in Section 4. In Section 5, we
                     review some related work, and conclude in Section 6.


                     2     Challenges of Folksonomies

                     By analysing the current collaborative tagging systems, we can notice that the
                     main problems are ambiguity, inconsistency, and redundancy problems [1, 2, 3,
                     4]. This is normal since the collaborative tagging systems (by their nature) are
                     shared by many users. These users came from different backgrounds, cultures,
                     countries, domains, and tongues. The diversity of the users’ behaviours would
                     inevitably create inconsistent tags that would give ambiguous identification of
                     the tagged objects.
                          The ambiguity and inconsistency of the tags in folksonomies emerge mainly
                     because of linguistics reasons such as; word synonyms [1, 2, 3, 5, 6, 7], polysemy
                     (homonym) [1, 2, 5, 6, 7], different lexical forms [2, 5, 6, 7], alternative spellings
                     [2], misspelling errors [1, 2], and use of different languages [4, 8, 9]. When search-
                     ing the folksonomy, these problems cause irrelevant result to be retrieved, and
                     relevant results not to be retrieved. Our concern in this paper is the latter case.


                     3     Our Approach

                     As aforementioned, we focus in our approach on synonyms, multilinguality, and
                     initiating relations among tags in folksonomy based on the semantic relations
                     existing in the ontology. Since all these challenges are lexical ones, the best choice
                     is to use the lexical ontology WordNet. WordNet is a lexical ontology which has
                     set of synonym words, called synset, that defines a particular concept. It includes
                     a lot of lexical and semantic relations between words and synsets. It is restricted
                     to no specific domain and covers all common parts of speech; nouns, adjectives,
                     verbs and adverbs [10].


                     3.1   Synonyms

                     Usually, when a user is tagging, (s)he is not aware of all synonyms for the tags
                     (s)he uses. If the tagger is English, (s)he will use the word “lift” whilst the
                     American one will use the word “elevator ” to describe the lifting device used
                     to move people from one floor to another in a building. Also, when we want to
                     express the beauty of something, we will use words (synonyms) like “beautiful ”,
                     “pretty”, and maybe “gorgeous”. Always we miss some of the synonyms. In the
                     first example, if the tag that was used is “lift”, the future search will retrieve
                     nothing if we use the word “elevator ” as a search keyword.
                         Our idea is to add “system tags” every time the user adds tags. The sys-
                     tem tags will be added automatically by the collaborative tagging system by
                     consulting the WordNet ontology, these tags are all the existing synonyms in
                     WordNet for the “user tags”. Figure 1 shows subset of the synonyms set that




7th Extended Semantic Web Conference (ESWC 2010)                                                              Page 52 of 64
1st International Workshop on Adaptation, Personalization and REcommendation in the Social-semantic Web (APRESW 2010)




                     Fig. 1. Some Synonyms for The Word “Beautiful ” Obtained from WordNet On-
                     tology.


                     can be added by WordNet ontology for the tag “beautiful ”. When the user adds
                     the tag “beautiful ”, the system will add all related synonyms from the WordNet.
                     Future search using any of the synonyms added by the system (system tags) will
                     be able to retrieve the tagged object. Thus, it ensures the retrieval of relevant
                     results.

                     3.2   Tags Relations
                     Imagine if a user tagged a resource as “poultry”. Poultry is indeed kind of meat
                     and it is expected to be retrieved when searching using the keyword “meat”
                     because it is relevant to the search keyword. Unfortunately, it will not be in
                     the result set since this word is not in the tags set for that resource. The same
                     problem is faced again; relevant results are not being retrieved due to lack of
                     semantics in the folksonomy.
                        The WordNet ontology has such a semantic relations among words. Figure 2
                     shows a part of the WordNet ontology. The system will add the synonyms of the
                     “poultry” (gallinacean, fowl ). Also it will add the parent of that word (meat)
                     and its synonym (flesh) as system tags. Therefore, anyone who searches using
                     the keyword “meat” will retrieve the resource originally tagged with “poultry”.

                     3.3   Multilinguality
                     So far, the tagged resource is accessible and visible only if the search keywords
                     are English words. If a non-English speaker is searching using non-English key-
                     words, nothing will be retrieved. If an Italian is searching using the word “bello”
                     (it means: beautiful), the tagged resource in the previous example will seem as
                     irrelevant and thus will not be retrieved. As humans, we can see clearly that it
                     is relevant, but the machines do not.
                         As a solution for multilinguality problem, we will use the EuroWordNet. Eu-
                     roWordNet relates and unites WordNets in different European languages (Dutch,
                     Spanish, Italian, German, French, Czeck, and Estonian) in a single multilingual
                     lexical resource, and it links them to the English WordNet [11].




7th Extended Semantic Web Conference (ESWC 2010)                                                           Page 53 of 64
1st International Workshop on Adaptation, Personalization and REcommendation in the Social-semantic Web (APRESW 2010)




                                    Fig. 2. Part of WordNet for The Word “Poultry”.


                        We propose that EuroWordNet will find the equivalent words for the tag
                     “beautiful ” in the abovementioned languages using so-called Inter-Lingual-Index
                     (ILI). These equivalent words (In addition to their synonyms and parent words
                     as aforementioned) will be added as system tags. This guarantees that future
                     searches by non-English speakers using their own languages will retrieve the rele-
                     vant resource even if these resources were tagged originally by only English tags,
                     and vice versa.


                     4    Discussion

                     The proposed approach requires replicating the WordNet and EuroWordNet
                     words and storing them in the folksonomy as system tags. This redundancy of
                     data is justified in the following paragraphs.
                         Alternatively, we can avoid adding system tags at tagging time by consulting
                     and deducing the relations from the lexical resources at search time. In the case
                     of synonyms in the previous example, when the user uses the keyword “pretty”
                     in the search, the system will send it to the WordNet. The WordNet will send
                     all the found synonyms to the folksonomy, and thus all objects that are tagged
                     by any of these synonyms will be retrieved (See Figure 1).
                         This communication between the folksonomy and the ontology and the search-
                     ing process inside the ontology itself is time consuming while the user is waiting
                     for a response. We have the choice either to save time or to save space. Time is
                     the critical factor in such a case.
                         Our proposal needs a software agent that is responsible of reflecting any
                     prospective future changes in the online lexical resources on the folksonomy to
                     keep the system tags in the folksonomy up-to-date.




7th Extended Semantic Web Conference (ESWC 2010)                                                          Page 54 of 64
1st International Workshop on Adaptation, Personalization and REcommendation in the Social-semantic Web (APRESW 2010)




                     5    Related Work

                     Many researchers have tried to address the abovementioned challenges of folk-
                     sonomies using different approaches. One of these approaches was to use the
                     power of the Semantic Web in decreasing the ambiguity an inconsistency of
                     tags. If we have a glance at these attempts, we can see that there are still many
                     gaps to fill.
                         In [8], tags are filtered and normalised, then these tags will be adhered to
                     different domain ontologies’ concepts, and only the terms that appear in the on-
                     tologies will be selected. In this method they remove some users’ tags which re-
                     flect part of the users’ understanding of the tagged object. Moreover, the changes
                     in the users vocabulary will not be reflected in the semantic ontologies.
                         In [12], they correct the misspelled tags and group the similar tags together,
                     and then the tags are mapped to online ontologies. This method then replaces
                     some tags with corresponding concepts in the online ontologies. We argue that
                     the interference in users’ tags will conflict with the ethos of folksonomies (free-
                     keywords).
                         In [7], they developed their own folksonomy system using domain-specific
                     ontology and WordNet ontology. They detect the domain of the most popular
                     tags, and then they manually build an ontology for that domain. The problem
                     in this method is the necessity of building the domains ontologies, even worse;
                     the domain ontology should be built manually.
                         In [13], they used the WordNet concepts’ relations to show the user an addi-
                     tional panel on his browser’s interface. This extra visualisation displays related
                     tags organised according to a semantic criterion to facilitate navigation and
                     searching in the folksonomy. It is only visualisation nothing more and some tags
                     were not recognised in the lexicon.
                         In [14], they map the unstructured tags to more structured domain ontologies.
                     These ontologies are used for refining the queries to combine results of different
                     tag-based systems. This method uses an ontology-based navigation interface al-
                     lowing the user to retrieve more related results through graphical navigation of
                     the ontology concepts. This method can not deal with unmatched tags; which
                     are the tags that do not exist in the domain ontologies.
                         In [2], they use WordNet and Wikipedia to substitute semantic assertions for
                     the current tags. These assertions are not simple strings to describe a particu-
                     lar resource; each semantic assertion describes a specific property of a resource.
                     Therefore, the possibility of tagging using free words is absent which contradicts
                     the ethos of folksonomy.
                         In [15], they apply both syntactic and semantic techniques for connecting
                     tag to ontologies in order to get more semantics about the tag and provide tag
                     suggestions for the users. This method, in addition to offering suggestions to the
                     users, asks the users to give feedback about these suggestions. Hence, we argue
                     that it puts more effort on the users’ side to improve the quality of the tags
                     by changing the conventional way by which the users used to interact with the
                     folksonomy.




7th Extended Semantic Web Conference (ESWC 2010)                                                          Page 55 of 64
1st International Workshop on Adaptation, Personalization and REcommendation in the Social-semantic Web (APRESW 2010)




                     6    Conclusion
                     Folksonomies lack semantics among users’ tags which causes relevant results not
                     to be retrieved. Semantic Web ontologies are considered a rich source for se-
                     mantic relations that, if exploited properly, will improve the searching process
                     in folksonomies. Our approach focused on addressing the problems of synonyms,
                     semantic relations among tags, and multilinguality. It is based on the idea of
                     adding system tags as complements to the user tags for a wider coverage of po-
                     tential future search keywords, therefore, more relevant results will be retrieved.


                     7    Future Work
                     In the future, this proposal will be implemented therefore more empirical results
                     will follow.
                         EuroWordNet is limited to only some European languages. Our approach is
                     extendable to other languages by using intermediate online dictionaries. These
                     dictionaries might be used to translate from one WordNet to another for lan-
                     guages that are not included in EuroWordNet (e.g. from English WordNet to
                     Arabic WordNet).
                         A unifying architecture for collaborative tagging systems is under construc-
                     tion. This architecture includes clustering techniques to address the problem of
                     shorthands usage in tagging. Such tags are written using special words that do
                     not belong to any language. Therefore, the best choice is to consult the social
                     networks to predict their meanings.


                     References
                       [1] Li, Q., Lu, S.C.Y.: Collaborative tagging applications and approaches. Multimedia
                           15(3) (2008) 14–21
                       [2] Marchetti, A., Tesconi, M., Ronzano, F., Rosella, M., Minutoli, S.: Semkey: A
                           semantic collaborative tagging system. In: Proceedings WWW 2007 Workshop
                           on Tagging and Metadata for Social Information Organisation. (2007)
                       [3] Mathes, A.: Folksonomies-cooperative classification and communication through
                           shared metadata. Computer Mediated Communication - LIS590CMC (2004)
                       [4] Angeletou, S., Sabou, M., Motta, E.: Semantically enriching folksonomies with
                           flor. In: European Semantic Web Conference Workshop: CISWeb. (2008)
                       [5] Dix, A., Levialdi, S., Malizia, A.: Semantic Halo for collaboration tagging sys-
                           tems. In: Workshop on the Social Navigation and Community based Adaptation
                           Technologies. (2006)
                       [6] Golder, S.A., Huberman, B.A.: Usage patterns of collaborative tagging systems.
                           Journal of Information Science 32(2) (2006) 198–208
                       [7] Lee, S.S., Yong, H.S.: Ontosonomy: Ontology-based extension of folksonomy. In:
                           Proceedings of the 2008 IEEE International Workshop on Semantic Computing
                           and Applications. (2008) 27–32
                       [8] Al-Khalifa, H., Davis, H.: FolksAnnotation: A semantic metadata tool for annotat-
                           ing learning resources using folksonomies and domain ontologies. In: Innovations
                           in Information Technology. (2006)




7th Extended Semantic Web Conference (ESWC 2010)                                                               Page 56 of 64
1st International Workshop on Adaptation, Personalization and REcommendation in the Social-semantic Web (APRESW 2010)




                      [9] Zamora, F., Nistal, M.: Visualising tags as a network of relatedness. In: 39th
                          ASEE/IEEE Frontiers in Education Conference. (2009)
                     [10] Morato, J., Marzal, M.N., Llorns, J., Moreiro, J.: WordNet applications. In:
                          Proceeding of the Second Global WordNet Conference. (2004)
                     [11] Vossen, P.: WordNet, EuroWordNet and global WordNet. Revue Franaise de
                          Linguistique Appliquee / RFLA 7(1) (2002)
                     [12] Ghali, F. Sharp, M., Cristea, A.: Folksonomies and ontologies in authoring of
                          adaptive hypermedia. In: A3H 6th International Workshop on Authoring of Adap-
                          tive and Adaptable Hypermedia Workshop. (2008)
                     [13] Laniado, D., Eynard, D., Colombetti, M.: Using WordNet to turn a folksonomy
                          into a hierarchy of concepts. In: Semantic Web Application and Perspectives -
                          Fourth Italian Semantic Web Workshop. (2007)
                     [14] Bindelli, S., Criscione, C., Curino, C.A., Drago, M.L., Eynard, D., Orsi, G.: Im-
                          proving search and navigation by combining ontologies and social tags. In: 1st
                          International Workshop on Ambient Data Integration. (2008)
                     [15] Sluijs, K., Houben, G.J.: Relating user tags to ontological information. In: Pro-
                          ceedings of 5th International Workshop on Ubiquitous User Modeling. (2008)




7th Extended Semantic Web Conference (ESWC 2010)                                                              Page 57 of 64