<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Multilingual Ontology-based User Profile Enrichment</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><roleName>Till Plumbaum, J ér ôme</roleName><forename type="first">Ernesto</forename><forename type="middle">William</forename><surname>De Luca</surname></persName>
							<email>ernesto.deluca@dai-labor.de</email>
							<affiliation key="aff0">
								<orgName type="department">DAI Lab</orgName>
								<orgName type="institution">Technische Universit ät Berlin</orgName>
								<address>
									<addrLine>Ernst-Reuter-Platz 7</addrLine>
									<postCode>10587</postCode>
									<settlement>Berlin</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Sahin</forename><surname>Kunegis</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">DAI Lab</orgName>
								<orgName type="institution">Technische Universit ät Berlin</orgName>
								<address>
									<addrLine>Ernst-Reuter-Platz 7</addrLine>
									<postCode>10587</postCode>
									<settlement>Berlin</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><surname>Albayrak</surname></persName>
							<email>sahin.albayrak@dai-labor.de</email>
							<affiliation key="aff0">
								<orgName type="department">DAI Lab</orgName>
								<orgName type="institution">Technische Universit ät Berlin</orgName>
								<address>
									<addrLine>Ernst-Reuter-Platz 7</addrLine>
									<postCode>10587</postCode>
									<settlement>Berlin</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Multilingual Ontology-based User Profile Enrichment</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">3C1980034FBC4A49489BEF6FC64E43EF</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T07:19+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>H.3.1 [Information Storage and Retrieval]: Content Analysis and Indexing RDF/OWL</term>
					<term>Web 2.0</term>
					<term>Multilingualism</term>
					<term>EuroWordNet Multilingual Semantic Web</term>
					<term>User Modeling</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper, we discuss the possibility of enriching user profiles with multilingual information. Nowadays, the English language is the de facto standard language of commerce and science, however users can speak and interact also in other languages. This brings up the need of enriching the user profiles with multilingual information. Therefore, we propose to combine ontology-based user modeling with the information included in the RDF/OWL EuroWordNet hierarchy. In this way, we can personalize retrieval results according to user preferences, filtering relevant information taking into account the multilingual background of the user.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">INTRODUCTION</head><p>At present most of the demand for text retrieval is well satisfied by monolingual systems, because the English language is the de facto standard language of commerce and science. However, there is a wide variety of circumstances in which a reader might find multilingual retrieval techniques useful. Being able to read a document in a foreign language does not always imply that a person can formulate appropriate queries in that language as well. Furthermore, dealing with polysemic words seems to be more difficult in multilingual than in monolingual retrieval tasks.</p><p>Every text retrieval approach has two basic components: the first for representing texts (queries and documents) and the other for their comparison. This automated process is successful when its results are similar to those produced by human comparison between queries and documents. Queries and documents often differ from its length however. While the query is often quite short, documents might be up to hundreds of pages long. Moreover, users frequently adopt a vocabulary that is not contained in the documents, known as the paraphrase problem.</p><p>Multilingual Retrieval. When working in a multilingual environment, words have to be disambiguated both in the native and in the other languages. In this case the combination of multilingual text retrieval and word sense disambiguation (WSD) approaches is crucial <ref type="bibr" target="#b2">[2]</ref>. In order to retrieve the same concept in different languages, some relations between the searched concept and its translations have to be built. WSD is used to convert relations between words into relations between concepts; sense disambiguation can be acquired for words, but it is more difficult for documents. To have accurate WSD, we need a larger coverage of semantic and linguistic knowledge than is available in current lexical resources.</p><p>Because we focus on multilingual concepts, we decided to use EuroWordNet <ref type="bibr" target="#b6">[6]</ref>, a variant of the most well-known available lexical database WordNet. In previous work, we extended the RDF/OWL WordNet representation <ref type="bibr">[5]</ref> for multilingualism, leading to our own RDF/OWL EuroWordNet representation <ref type="bibr" target="#b3">[3]</ref>.</p><p>Ontology-based User Modeling. With the advent of the Web 2.0 and the growing impact of the Internet on our every day life, people start to use more and more different web applications. They manage their bookmarks in social bookmarking systems, communicate with friends on Facebook<ref type="foot" target="#foot_0">1</ref> and use services like Twitter<ref type="foot" target="#foot_1">2</ref> to express personal opinions and interests. Thereby, they generate and distribute personal and social information like interests, preferences and goals <ref type="bibr" target="#b4">[4]</ref>. This distributed and heterogeneous corpus of user information, stored in the user model (UM) of each application, is a valuable source of knowledge for adaptive systems like information filtering services. These systems can utilize such knowledge for personalizing search results, recommend products or adapting the user interface to user preferences. Adaptive systems are highly needed, because the amount of information available on the Web is increasing constantly, requiring more and more effort to be adequately managed by the users. Therefore, these systems need more and more information about users interests, preferences, needs and goals and as precise as possible. However, this personal and social information stored in the distributed UMs usually exists in different languages due to the fact that we communicate with friends all over the world. Also, today's adaptive systems are usually part of web applications and typically only have access to the information stored in that specific ap-plication. Therefore, we enhance the user model aggregation process by adding valuable and important meta-data which leads to better user models and thus to better adaptive systems. For this reason, we propose a combination of RDF/OWL EuroWordNet within ontology-based aggregation techniques.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">PROPOSED SEMANTIC USER MODELING AGGREGATION</head><p>RDF/OWL EuroWordNet opens new possibilities for overcoming the problem of language heterogeneity in different user models and thus allows a better user modeling aggregation. Therefore, we propose an ontology-based user modelling approach that combines mediator techniques to aggregate user models from different applications and utilize the EuroWordNet information to handle the multilingual information in the models. Based on this idea, we define some requirements that we have to fulfill.</p><p>Requirement 1: Ontology-based profile aggregation. We need an approach to aggregate information that is both application independent and application overarching. This requires a solution that allows us to semantically define relations and coherences between different attributes of different UMs. The linked attributes must be easily accessible by applications such as recommender and information filtering systems. In addition, similarity must be expressed in these defined relations.</p><p>Requirement 2: Integrating semantic knowledge. A solution to handle the multilingual information for enriching user profiles is needed. Hence, we introduce a method to incorporate information from semantic data sources such as EuroWordNet and to aggregate complete profile information. We decided to use an ontology as the conceptual basis of our approach to meet the first requirement explained above. Therefore a meta-ontology is used to link attributes of different UMs that contain equal or similar content.</p><p>The definition of a meta-model based on the meta-ontology can be divided into two steps. First, we define a concrete meta-model for a specific domain we want to work with, such as music, movies or personal information. The meta-model can be an already existing model, like FOAF 3 or a proprietary model that only certain applications understand. Next, we decribe how to connect multilingual attribute information stored in different user models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">MULTILINGUAL ONTOLOGY-BASED AGGREGATION</head><p>To enrich the user model with multilingual information, as described above, we decided to utilize the knowledge available in RDF/OWL EuroWordNet <ref type="bibr" target="#b3">[3]</ref>. We want to leverage this information and use it for a more precise and qualitatively better user modeling. We treat the semantic external resources as a huge semantic profile that can be used to enrich the user model and add valuable extra information (see Figure <ref type="figure" target="#fig_0">1</ref>). The aggregation of information into semantic profiles and user models is performed similarly to the approach described in <ref type="bibr" target="#b1">[1]</ref>, by using components that mediate between the different models. We extend this approach by using a combined user model, aggregated with the proposed ontology. To use the information contained in RDF/OWL Euro-WordNet, we developed a framework that allows us to define several mediators that take the information from user models and trigger different sources in the Semantic Web for more information. These mediators are specialized components that read a user model and collect additional data from an external source.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">CONCLUSION</head><p>In this paper, we presented the possibility of enriching user profiles with information included in the RDF/OWL EuroWordNet hierarchy to better filter results during the search process. This aggregated information can be used in our multilingual semantic information retrieval system that has been described in more details in <ref type="bibr" target="#b2">[2]</ref>. In this work, we have shown that we can handle the high heterogeneity of distributed data, especially concerning multilingual heterogeneity, using aggregated user profiles that have been enriched with information contained in the RDF/OWL Euro-WordNet representation. This gives us the possibility to personalize retrieval results according to user preferences, filtering relevant information taking into account the multilingual background of the user.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>3Figure 1 :</head><label>1</label><figDesc>Figure 1: Integrating semantic knowledge about multilingual dependencies with the information stored in the user models.</figDesc></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">http://www.facebook.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">http://twitter.com/</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title/>
		<author>
			<persName><surname>References</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Mediation of user models for enhanced personalization in recommender systems</title>
		<author>
			<persName><forename type="first">Shlomo</forename><surname>Berkovsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tsvi</forename><surname>Kuflik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Francesco</forename><surname>Ricci</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">User Modeling and User-Adapted Interaction</title>
		<imprint>
			<biblScope unit="volume">18</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="245" to="286" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Semantic Support in Multilingual Text Retrieval</title>
		<author>
			<persName><forename type="first">Ernesto</forename><surname>William</surname></persName>
		</author>
		<author>
			<persName><forename type="first">De</forename><surname>Luca</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008">2008</date>
			<publisher>Shaker Verlag</publisher>
			<pubPlace>Aachen, Germany</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Converting EuroWordNet in OWL and extending it with domain ontologies</title>
		<author>
			<persName><forename type="first">Ernesto</forename><surname>William</surname></persName>
		</author>
		<author>
			<persName><forename type="first">De</forename><surname>Luca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Martin</forename><surname>Eul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andreas</forename><surname>Nürnberger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. Workshop on Lexical-semantic and Ontological Resources</title>
				<meeting>Workshop on Lexical-semantic and Ontological Resources</meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Semantic web usage mining: Using semantics to understand user intentions</title>
		<author>
			<persName><forename type="first">Till</forename><surname>Plumbaum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tino</forename><surname>Stelter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alexander</forename><surname>Korth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. Conf. on User Modeling, Adaptation and Personalization</title>
				<meeting>Conf. on User Modeling, Adaptation and Personalization</meeting>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="391" to="396" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">WordNet in RDFS and OWL</title>
		<author>
			<persName><forename type="first">Aldo</forename><surname>Mark Van Assem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Guus</forename><surname>Gangemi</surname></persName>
		</author>
		<author>
			<persName><surname>Schreiber</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2004">2004</date>
			<publisher>W3C</publisher>
		</imprint>
	</monogr>
	<note type="report_type">Technical report</note>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">Piek</forename><surname>Vossen</surname></persName>
		</author>
		<title level="m">Eurowordnet general document, version 3, final</title>
				<imprint>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
