<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Mobile Location-Driven Associative Search in DBpedia with Tag Clouds</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Bjørnar Tessem</string-name>
          <email>bjornar.tessem@uib.no</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bjarte Johansen</string-name>
          <email>bjarte.johansen@uib.no</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Csaba Veres</string-name>
          <email>csaba.veres@uib.no</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information Science and Media Studies</institution>
          ,
          <addr-line>Postbox 7802</addr-line>
          ,
          <institution>University of Bergen</institution>
          ,
          <addr-line>5020 Bergen</addr-line>
          ,
          <country country="NO">Norway</country>
        </aff>
      </contrib-group>
      <fpage>6</fpage>
      <lpage>10</lpage>
      <abstract>
        <p>A primary contextual source for today's context-sensitive mobile phone apps is the user's location. The recent surge in the availability of open linked data can provide location-oriented semantic context, still wanting to be explored in innovative ways. In PediaCloud, the Android tool described here, we show how we can use the associative structure of the Semantic Web at a geographical location, visualize location information with tag clouds, and allow users to follow the associations of the Semantic Web enabled by the tag cloud, with the aim of enabling the users to construct an understanding of the “place” around them. The data we use are found through DBpedia, a project a project aimed to lift the information in WikiPedia into the Semantic Web.</p>
      </abstract>
      <kwd-group>
        <kwd>tag clouds</kwd>
        <kwd>mobile</kwd>
        <kwd>location</kwd>
        <kwd>semantic web</kwd>
        <kwd>DBpedia</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Exploiting location context has become a major theme for research on mobile
technologies and is widely applied in commercial applications. These new technologies have
implications for the realisation of the concept of place, which in sociology is
understood as not only the location itself, but also the physical surroundings, and a persons
attribution of meaning to the surroundings[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The location-based mobile tools guide
the users to information about the surroundings, and enable new ways of constructing
a user’s understanding of place. Examples of such tools are Google maps and Google
places, which in order to enhance the experience of places, connect to located
information from among others Wikipedia, and its linked data extraction, DBpedia1.
      </p>
      <p>A potential technique for presenting located information are tag clouds, which are a
means of visualizing natural language information. In a tag cloud a collection of words
is drawn in a bounded area, each word with a font proportional to a weight computed
from the text collection, often the count of that particular word. The size of the words
gives the user an impression of what topics are important in the text collection.</p>
      <p>With the PediaCloud tool we aim to show how one can create tag clouds of located
information from DBpedia. The goal is to use not only located (primary) resources, but
also to collect DBpedia entities that have some semantic link to the located resources</p>
      <sec id="sec-1-1">
        <title>1 http://dbpedia.org</title>
        <p>(secondary resources), and build the tag clouds from this combined set of information.
This should give users a richer sense of the semantic aspects of a location, the idea
being that both primary and secondary resources contribute to a user’s understanding
of the place. It is also a goal to create a mobile semantic web application without a
dedicated backend for the intermediate organisation of information, instead aiming to
use DBpedia as an example of an existing semantic web resource that can be consulted
directly.
2</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>The Workings of PediaCloud</title>
      <p>The DBpedia data collected from Wikipedia articles contain features like location,
abstract, categories, and other relations to many types of entities. For the PediaCloud
application we use Wikipedia articles with a location, and their abstracts, and in addition
the abstracts of the linked secondary resources. The collected abstracts are the source
of the tag clouds we build. The construction of the tag cloud essentially goes through
the following steps:
1. Get user location
2. Get all articles (primary) within fixed radius of location.
3. Get all articles (secondary) that the primary articles link to.
4. Find frequency of each word (wf) in each article’s abstract.
5. Weight the wf of the words from the primary articles as a function of distance,
scaling linearly with weight 1.0 at the user’s location and 0.0 at the radius.
6. Weight the wf of the words from the secondary articles by the cosine similarity to
the primary article it is linked from, multiplied by the weight of the primary article.
7. Add the weighted wf for each word together.
8. Select the highest n scoring words.
9. Use the word score to create tag cloud for current area.</p>
      <p>The tag cloud for the Main square of Graz is shown in Fig. 2a. The user can
select any word in the tag cloud, for example ”CULTURE”. This will show the list of
WikiPedia articles that contain the word ”culture” (Fig. 2b). Now, the user may choose
to look at the Wikipedia article for a resource, but may also select one of these articles
as a focused resource (as an alternative to the user’s location) in a new tag cloud. This
is shown in Fig. 2c where “Nikola Tesla” was chosen as a focal point. Note that the
selected article no longer has a spatial component. We use the same collection of articles
as for the previous tag cloud. However, the weight of the different abstracts will only
be based on the cosine similarity to the focused resource. The word weights in the tag
cloud thus depends on the choice of focused resource, giving the user a sense of what
words are most prominent in the collection given this special focus.</p>
      <p>The effect of using cosine similarity combined with word frequency to weight the
words as opposed to using only word frequency is shown in Table 1, where we show the
ten most prominent words and their weights in two tag clouds generated for the “Nikola
Tesla” resource. We notice that words relating to Nikola Tesla’s achievements
(ELECTRICAL, WIRELESS) get a place in the list when we use a cosine similarity approach
as opposed to when we use word frequency only. The word frequency approach mainly
results in an emphasis on general geographical words and nationalities. With the use of
cosine similarity, larger fonts are given to words that are more informative.</p>
      <p>The query we send to DBpedia’s sparql endpoint2 is a single SELECT with UNION
gathering data for both primary and secondary resources at the same time (steps 2 and
3 in the process). From the returned data the tool is able to compute the tag cloud for</p>
      <sec id="sec-2-1">
        <title>2 http://DBpedia.org/sparql</title>
        <p>Nikola Tesla: Word frequency Nikola Tesla: Word freq. * Cos.sim.</p>
        <p>GRAZ 57.0 TESLA 9.00
UNIVERSITY 44.0 GRAZ 6.78
CITY 30.0 UNIVERSITY 5.42
AUSTRIA 24.0 CITY 4.83
AUSTRIAN 21.0 ELECTRICAL 4.00
FIRST 16.0 WIRELESS 4.00
KNOWN 13.0 WORK 3.81
CROATIAN 13.0 CULTURE 3.60
CAPITAL 12.0 AUSTRIAN 3.32</p>
        <p>WORLD 12.0 AMERICAN 3.23
the mobile screen. A positive effect of doing one single query is that it saves response
time due to less network connection time.</p>
        <p>We would have preferred to run the data gathering as a single CONSTRUCT in
order to store all relevant triples locally, possibly on a triple store, but we ran in to a
memory limit at the DBpedia endpoint when sending the query, so we had to go for
the SELECT version. We considered installing a triple store with a query engine on
the device to support the CONSTRUCT version, but discovered early that even though
many of the triple stores and query engines are written in Java they are currently not
working on the Android platform.</p>
        <p>We are getting interesting results with the current implementation, but there are
weaknesses that we would like to fix. One problem is that some words describing large
enclosing areas (like ”GRAZ” and ”AUSTRIA”) often get very high weights, but may
not be very informative in the sense of getting a cultural and historical overview of a
location. We believe that we can reduce the weight of these words by modifying the
weighting algorithm we use, for instance by using tf-idf in conjunction with the already
implemented cosine similarity.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Related Work and Conclusion</title>
      <p>
        DBpedia has been used as a source of information in DBpedia mobile which is a tool
presenting DBpedia resources close to the user at a map [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Ruta et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] also use
DBpedia data, and combine semantic similarity with location closeness to give DBpedia
sources an overall match to a search criteria. MapXplore [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is a tool that uses DBpedia
to present classification and other factual data about points of interest, to users.
MapXplore uses a category browser for locating relevant points of interest for users, in order
to give them an overall impression of the important concepts at a place. For example,
Bergen in Norway features prominently with the concept ”mountain”, whereas Dubai
features with the category ”skyscraper”. van Aart et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] use data from a variety of
sources and connect non-located resources to a location through links to a located
resource. Ma¨kela¨ et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] do the same and put these data into a backend store and make
them accessible through a mobile application. Paelke et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and Baldauf et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
both use tag clouds on mobiles to present information from resources that are tagged
with location data. Do¨ rk et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] also use tag clouds in a web application allowing
location-based exploratory web searches with visualization tools.
      </p>
      <p>PediaCloud integrates ideas from these related projects as it focuses on located
information from DBpedia, and further, its visualisation through tag clouds. A particular
feature of PediaCloud is the use of secondary resources in constructing the tag cloud,
and that the tag cloud changes depending on the user’s choice of focus, either the user’s
location or a particular DBpedia resource. The weighting of tag cloud words are
computed from word counts combined with cosine similarity and geographical distance,
resulting in a higher emphasis on the more informative tags. PediaCloud also does not
depend on a dedicated backend. The tool gathers information from a main Semantic
Web endpoint, and computes the visual presentation locally.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Baldauf</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , Fro¨hlich,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Reichl</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.:</surname>
          </string-name>
          <article-title>The ambient tag cloud: A new concept for topic-driven mobile urban exploration</article-title>
          .
          <source>In: Proceedings of the European Conference on Ambient Intelligence. AmI '09</source>
          , Berlin, Heidelberg, Springer-Verlag (
          <year>2009</year>
          )
          <fpage>44</fpage>
          -
          <lpage>48</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Becker</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Exploring the geospatial semantic web with DBpedia Mobile</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>7</volume>
          (
          <issue>4</issue>
          ) (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. Do¨rk,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Williamson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Carpendale</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          :
          <article-title>Navigating tomorrow's web: From searching and browsing to visual exploration</article-title>
          .
          <source>ACM Trans. Web</source>
          <volume>6</volume>
          (
          <issue>3</issue>
          ) (
          <year>October 2012</year>
          )
          <volume>13</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>13</lpage>
          :
          <fpage>28</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gieryn</surname>
            ,
            <given-names>T.F.</given-names>
          </string-name>
          :
          <article-title>A space for place in sociology</article-title>
          .
          <source>Annual Review of Sociology</source>
          <volume>26</volume>
          (
          <issue>1</issue>
          ) (
          <year>2000</year>
          )
          <fpage>463</fpage>
          -
          <lpage>496</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Ma¨kela¨, E.,
          <string-name>
            <surname>Lindblad</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <article-title>Va¨a¨ta¨inen</article-title>
          , J.,
          <string-name>
            <surname>Alatalo</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          , Hyvo¨nen, E.:
          <article-title>Discovering places of interest through direct and indirect associations in heterogeneous sources - the travelsampo system</article-title>
          .
          <source>In: Terra Cognita</source>
          <year>2011</year>
          :
          <article-title>Foundations, Technologies and Applications of the Geospatial Web</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          , Vol-
          <volume>798</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Paelke</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dahinden</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eggert</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mondzech</surname>
          </string-name>
          , J.:
          <article-title>Location based context awareness through tag-cloud visualization</article-title>
          .
          <source>In: Advances in Geo-Spatial Information Science</source>
          . CRC Press (
          <year>2012</year>
          )
          <fpage>265</fpage>
          -
          <lpage>273</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ruta</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scioscia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Di Sciascio</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piscitelli</surname>
          </string-name>
          , G.:
          <article-title>Location-based semantic matchmaking in ubiquitous computing</article-title>
          .
          <source>In: Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03. WI-IAT '10</source>
          , Washington, DC, USA, IEEE Computer Society (
          <year>2010</year>
          )
          <fpage>124</fpage>
          -
          <lpage>127</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. van Aart,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Wielinga</surname>
          </string-name>
          , B.,
          <string-name>
            <surname>van Hage</surname>
            ,
            <given-names>W.R.</given-names>
          </string-name>
          :
          <article-title>Mobile cultural heritage guide: location-aware semantic search</article-title>
          . In:
          <article-title>Proceedings of the 17th international conference on Knowledge engineering and management by the masses</article-title>
          .
          <source>EKAW'10</source>
          , Berlin, Heidelberg, Springer-Verlag (
          <year>2010</year>
          )
          <fpage>257</fpage>
          -
          <lpage>271</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Veres</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>MapXplore: Linked data in the app store</article-title>
          . In Sequeda, J.,
          <string-name>
            <surname>Harth</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartig</surname>
          </string-name>
          , O., eds.
          <source>: COLD</source>
          . Volume
          <volume>905</volume>
          of CEUR Workshop Proceedings., CEUR-WS.org (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>