<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Multilingual Ontology-based User Profile Enrichment</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ernesto William De Luca</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Till Plumbaum</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Je´ roˆ me Kunegis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sahin Albayrak</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>RDF/OWL</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Multilingualism</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>EuroWordNet</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Multilingual Semantic Web, User Modeling</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DAI Lab, Technische Universit a ̈t Berlin Ernst-Reuter-Platz 7</institution>
          ,
          <addr-line>10587 Berlin</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2010</year>
      </pub-date>
      <fpage>41</fpage>
      <lpage>42</lpage>
      <abstract>
        <p>In this paper, we discuss the possibility of enriching user proles with multilingual information. Nowadays, the English language is the de facto standard language of commerce and science, however users can speak and interact also in other languages. This brings up the need of enriching the user proles with multilingual information. Therefore, we propose to combine ontology-based user modeling with the information included in the RDF/OWL EuroWordNet hierarchy. In this way, we can personalize retrieval results according to user preferences, ltering relevant information taking into account the multilingual background of the user.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>At present most of the demand for text retrieval is well
satis ed by monolingual systems, because the English
language is the de facto standard language of commerce and
science. However, there is a wide variety of circumstances in
which a reader might nd multilingual retrieval techniques
useful. Being able to read a document in a foreign language
does not always imply that a person can formulate
appropriate queries in that language as well. Furthermore, dealing
with polysemic words seems to be more di cult in
multilingual than in monolingual retrieval tasks.</p>
      <p>Every text retrieval approach has two basic components:
the rst for representing texts (queries and documents) and
the other for their comparison. This automated process is
successful when its results are similar to those produced by
human comparison between queries and documents. Queries
and documents often di er from its length however. While
the query is often quite short, documents might be up to
hundreds of pages long. Moreover, users frequently adopt a
vocabulary that is not contained in the documents, known
as the paraphrase problem.</p>
      <p>
        Multilingual Retrieval. When working in a multilingual
environment, words have to be disambiguated both in the
native and in the other languages. In this case the
combination of multilingual text retrieval and word sense
disambiguation (WSD) approaches is crucial [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In order to
retrieve the same concept in di erent languages, some
relations between the searched concept and its translations
have to be built. WSD is used to convert relations between
words into relations between concepts; sense
disambiguation can be acquired for words, but it is more di cult for
documents. To have accurate WSD, we need a larger
coverage of semantic and linguistic knowledge than is available
in current lexical resources.
      </p>
      <p>
        Because we focus on multilingual concepts, we decided to
use EuroWordNet [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], a variant of the most well-known
available lexical database WordNet. In previous work, we
extended the RDF/OWL WordNet representation [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] for
multilingualism, leading to our own RDF/OWL EuroWordNet
representation [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Ontology-based User Modeling. With the advent of the
Web 2.0 and the growing impact of the Internet on our
every day life, people start to use more and more di erent web
applications. They manage their bookmarks in social
bookmarking systems, communicate with friends on Facebook1
and use services like Twitter2 to express personal opinions
and interests. Thereby, they generate and distribute
personal and social information like interests, preferences and
goals [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. This distributed and heterogeneous corpus of user
information, stored in the user model (UM) of each
application, is a valuable source of knowledge for adaptive systems
like information ltering services. These systems can utilize
such knowledge for personalizing search results, recommend
products or adapting the user interface to user preferences.
Adaptive systems are highly needed, because the amount of
information available on the Web is increasing constantly,
requiring more and more e ort to be adequately managed by
the users. Therefore, these systems need more and more
information about users interests, preferences, needs and goals
and as precise as possible. However, this personal and
social information stored in the distributed UMs usually exists
in di erent languages due to the fact that we communicate
with friends all over the world. Also, today's adaptive
systems are usually part of web applications and typically only
have access to the information stored in that speci c
ap1http://www.facebook.com/
2http://twitter.com/
plication. Therefore, we enhance the user model
aggregation process by adding valuable and important meta-data
which leads to better user models and thus to better
adaptive systems. For this reason, we propose a combination of
RDF/OWL EuroWordNet within ontology-based
aggregation techniques.
      </p>
    </sec>
    <sec id="sec-2">
      <title>PROPOSED SEMANTIC</title>
    </sec>
    <sec id="sec-3">
      <title>USER MODELING AGGREGATION</title>
      <p>RDF/OWL EuroWordNet opens new possibilities for
overcoming the problem of language heterogeneity in di erent
user models and thus allows a better user modeling
aggregation. Therefore, we propose an ontology-based user
modelling approach that combines mediator techniques to
aggregate user models from di erent applications and utilize the
EuroWordNet information to handle the multilingual
information in the models. Based on this idea, we de ne some
requirements that we have to ful ll.</p>
      <p>Requirement 1: Ontology-based pro le aggregation. We
need an approach to aggregate information that is both
application independent and application overarching. This
requires a solution that allows us to semantically de ne
relations and coherences between di erent attributes of di
erent UMs. The linked attributes must be easily accessible by
applications such as recommender and information ltering
systems. In addition, similarity must be expressed in these
de ned relations.</p>
      <p>Requirement 2: Integrating semantic knowledge. A
solution to handle the multilingual information for enriching
user pro les is needed. Hence, we introduce a method to
incorporate information from semantic data sources such as
EuroWordNet and to aggregate complete pro le
information. We decided to use an ontology as the conceptual
basis of our approach to meet the rst requirement explained
above. Therefore a meta-ontology is used to link attributes
of di erent UMs that contain equal or similar content.</p>
      <p>The de nition of a meta-model based on the meta-ontology
can be divided into two steps. First, we de ne a concrete
meta-model for a speci c domain we want to work with, such
as music, movies or personal information. The meta-model
can be an already existing model, like FOAF3 or a
proprietary model that only certain applications understand.
Next, we decribe how to connect multilingual attribute
information stored in di erent user models.</p>
    </sec>
    <sec id="sec-4">
      <title>MULTILINGUAL ONTOLOGY-BASED</title>
    </sec>
    <sec id="sec-5">
      <title>AGGREGATION</title>
      <p>
        To enrich the user model with multilingual information, as
described above, we decided to utilize the knowledge
available in RDF/OWL EuroWordNet [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. We want to leverage
this information and use it for a more precise and
qualitatively better user modeling. We treat the semantic external
resources as a huge semantic pro le that can be used to
enrich the user model and add valuable extra information (see
Figure 1). The aggregation of information into semantic
pro les and user models is performed similarly to the
approach described in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], by using components that mediate
between the di erent models. We extend this approach by
using a combined user model, aggregated with the proposed
ontology.
has !"#$%&amp;(
      </p>
      <p>UserModels
!"#$%&amp;'
!"#$%&amp;)</p>
      <p>AggregatedProfile</p>
      <p>Atrib
ute</p>
      <p>Atrib
ute
Atrib
ute</p>
      <p>Atrib
ute
ride
move drive
condu
icir
cgaabral mover</p>
      <p>Interlingual
Index
rijden
guidare
andare
gaan
berijd
en
caval
care
RDF/OWLEuroWordNet</p>
      <p>To use the information contained in RDF/OWL
EuroWordNet, we developed a framework that allows us to
dene several mediators that take the information from user
models and trigger di erent sources in the Semantic Web
for more information. These mediators are specialized
components that read a user model and collect additional data
from an external source.
4.</p>
    </sec>
    <sec id="sec-6">
      <title>CONCLUSION</title>
      <p>
        In this paper, we presented the possibility of enriching
user pro les with information included in the RDF/OWL
EuroWordNet hierarchy to better lter results during the
search process. This aggregated information can be used in
our multilingual semantic information retrieval system that
has been described in more details in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In this work, we
have shown that we can handle the high heterogeneity of
distributed data, especially concerning multilingual
heterogeneity, using aggregated user pro les that have been
enriched with information contained in the RDF/OWL
EuroWordNet representation. This gives us the possibility to
personalize retrieval results according to user preferences,
ltering relevant information taking into account the
multilingual background of the user.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Shlomo</given-names>
            <surname>Berkovsky</surname>
          </string-name>
          ,
          <article-title>Tsvi Ku ik, and Francesco Ricci. Mediation of user models for enhanced personalization in recommender systems</article-title>
          .
          <source>User Modeling</source>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>18</volume>
          (
          <issue>3</issue>
          ):
          <volume>245</volume>
          {
          <fpage>286</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Ernesto</given-names>
            <surname>William De Luca</surname>
          </string-name>
          .
          <article-title>Semantic Support in Multilingual Text Retrieval</article-title>
          . Shaker Verlag, Aachen, Germany,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Ernesto</given-names>
            <surname>William De Luca</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Martin</given-names>
            <surname>Eul</surname>
          </string-name>
          , and
          <article-title>Andreas Nurnberger. Converting EuroWordNet in OWL and extending it with domain ontologies</article-title>
          .
          <source>In Proc. Workshop on Lexical-semantic and Ontological Resources</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Till</given-names>
            <surname>Plumbaum</surname>
          </string-name>
          , Tino Stelter, and
          <string-name>
            <given-names>Alexander</given-names>
            <surname>Korth</surname>
          </string-name>
          .
          <article-title>Semantic web usage mining: Using semantics to understand user intentions</article-title>
          .
          <source>In Proc. Conf. on User Modeling, Adaptation and Personalization</source>
          , pages
          <volume>391</volume>
          {
          <fpage>396</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Mark van Assem</given-names>
            ,
            <surname>Aldo Gangemi</surname>
          </string-name>
          , and Guus Schreiber.
          <article-title>WordNet in RDFS and OWL</article-title>
          .
          <source>Technical report, W3C</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Piek</given-names>
            <surname>Vossen</surname>
          </string-name>
          . Eurowordnet general document, version 3, nal,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>