<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Providing Personalized Cultural Heritage Information for the Smart Region - A Proposed methodology</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Antonino Lo Bue</string-name>
          <email>lobue@pa.icar.cnr.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alan J. Wecker</string-name>
          <email>ajwecker@gmail.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tsvi Kuflik</string-name>
          <email>tsvikak@is.haifa.ac.il</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alberto Machì</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oliviero Stock</string-name>
          <email>stock@fbk.eu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CNR-ICAR Palermo</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>FBK-irst</institution>
          ,
          <addr-line>Trento</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Haifa</institution>
          ,
          <country country="IL">Israel</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Trento</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we present a methodology to provide visitors, in smart regions, additional cultural heritage attractions based on prior museum visits using user models and Linked Open Data. Visitor preferences and behavior are tracked via a museum mobile guide and used to create a visitor model. Semantic models and Linked Open Data support the representation of regional assets as Cultural Objects. The visitor model preferences are exploited using a graph similarity approach in order to identify personalized opportunities for visitors by filtering relevant Cultural Objects.</p>
      </abstract>
      <kwd-group>
        <kwd>Personalization</kwd>
        <kwd>User Models</kwd>
        <kwd>Linked Open Data</kwd>
        <kwd>Smart Regions</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In this short paper we show a blueprint how semantic models and Linked Open Data
(LOD) support the representation of regional assets in order to identify categories of
opportunities for visitors based on different personal characteristics determined by
previous visits. Having a broad infobase from which to cull possibilities is an arduous
task that can benefit from automation. Due to the overwhelming number of
possibilities, it is important to personalize the Cultural Heritage (CH) experience. When
considering what is requires from a smart, personalized system, it becomes clear that the
reasoning process of the system has to focus on identifying opportunities for
intervention. When and how to intervene and what information to deliver/service to offer.
Having a user model, a context model, and a model of the cultural objects are
essential for successful support. These can lead to the interaction of museums and places of
cultural heritage to create mega-tourist experience (similar to Verbke and Rekom [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
concept of the "museumpark") which can have a positive market effect for the region.
      </p>
      <p>We describe our methodology: First we use exhibits in a museum (we use Castle
Buonconsiglio in the Trentino Region as examples throughout this paper) and tag
them using semantic concepts. Then a mobile museum guide is used to track visitors.
Based on this data a user model is developed consisting of characteristics and
preferences. We then use a dataset of Cultural Objects using an ontological representation
of the domain to cull opportunities. Visitor Preferences are used to filter which
Cultural Objects are relevant, and Characteristics are used to determine whether an event
or cultural heritage place is desired. Context is used to filter for proximate locations
weather conditions, opening times, etc. Again characteristics are used to determine
how best to present this information to the visitor.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>In this section we review two technology areas: User Modeling and personalization in
CH, and Linked Open Data and Semantic relatedness.</p>
      <p>
        According to Ardisonno et al[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], for more than 20 years, cultural heritage has been
a favored domain for personalization research and as soon as mobile technology
appeared, it was adopted for delivering context-aware cultural heritage information both
indoors and outdoors. For personalization, a system needs to have a model of its user.
A number of approaches are possible: Overlay, Feature-based, Content based, and
Collaborative filtering. In this proposed methodology we use an implicit content
based approach, where user interests are represented as sets of words occurring in the
textual descriptions of items relevant for the user. Visitors have been observed to
behave in certain stereotypical movement patterns [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]; patterns such as Butterfly,
Grasshopper Ant, and Fish[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The use of personality types to tailor software is not
new. We use the SLOAN Big 5 characterization as it is standard and much research
has been done using it [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. We focus on two traits we believe are connected to the
museum experience: Inquisitiveness, which is a measure of curiosity and Orderliness,
which measures thoroughness and the need for structure. Introversion and
Extroversion could also play a part in group visits, but is not examined in this research. In
addition we posit a connection between movement types and "identity" types
proposed by John Falk [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Preliminary ideas for the connection of movement patterns to
personality types have been proposed [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Public agencies collect organize and manage a vast amount of data. Local and
European projects aims to deliver data as freely available, reusable and distributed
without any restriction, the so call Open Data. As part of these initiatives, tourism and
cultural heritage datasets have been published as Open Data. Semantic Web
technologies and in particular the Linked (Open) Data paradigm, introduced by Sir Tim
Berners-Lee in 2006 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], are opening new ways for data integration and reuse, creating a
method to make data interoperable at a semantic level. Ontologies formally represent
knowledge as a set of concepts and their relationships within a domain. RDF1 and
OWL2 standards enable the formal representation of ontologies as set of triples
(subject, predicate, object). Ontologies are used to express vocabularies of Linked Data
      </p>
      <sec id="sec-2-1">
        <title>1 http://www.w3.org/RDF/</title>
        <p>2 http://www.w3.org/2001/sw/wiki/OWL
triples. On top of RDF and OWL, SPARQL Query Language3 is used to query and
retrieve information stored as triples thus allowing and facilitating access to the so
called Web of Data. DBpedia4, can be seen as the ontological version of Wikipedia,
its the core of the Linked Open Data cloud.</p>
        <p>
          In the Natural Language Processing area, semantic relatedness between terms or
concepts can be computed using two main approaches: (1) defining a topological
graph similarity using ontologies and computing the minimal graph distances between
terms, (2) using statistical methods and word co-occurrence in a corpus and
calculating the correlation between words. “WikiRelate!" [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], measures correlation among
terms using a graph based distance measure on the Wikipedia categories. The system
uses the inverse path length measure as a distance metric for terms correlation. Leal et
al [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] present an approach for computing the semantic relatedness of terms using the
knowledge base of DBpedia, based on an algorithm for finding and weighting a
collection of paths connecting concept nodes. The implemented algorithm defines the
concept of proximity rather than the inverse path length distance as a measure of
relatedness among nodes. Our methodology is based on the inverse path length measure
but we apply this to a graph of ontology terms extracted from DBpedia and used as
annotation for Open Data resources. Moreover, we also take into account the concept
introduced by Moore et al. [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], that evaluates paths calculating the number of
outgoing links of each node, in order to improve the precision of the algorithm.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>System</title>
      <p>The mobile guide, at each position of interest (POI), presents a list of relevant media
assets. The mobile guide system logs: the POI, which assets are chosen how long they
viewed the asset, and in general how long did they stay at the point of interest. We
collect two types of information, the first in order to determine general personal
characteristics and the second in order to determine specific topic interests. In general we
use movement styles, to predict user characteristics (such as personality). We use time
viewing presentations in order to determine user topic preferences.</p>
      <p>In order to characterize the user we make use of his general movement activities.
We use the following statistics: 1) NumberOfPOIsVisted (NPV) – number of
positions where a person stayed more than 9 seconds as detected and logged by the mobile
guide's positioning system. Nine seconds is a number we have used for previous
analysis and has provided good results. 2) POIsWherePresentationsSeen (PPS) – the
number of positions where the visitor viewed at least one media asset connected to
that position as computed from the logs of the mobile guide.
3)NumberOfPresentationSeen (NPS) – the total number of media assets the visitor
viewed as computed from the logs of the mobile guide.</p>
      <sec id="sec-3-1">
        <title>3 http://www.w3.org/TR/sparql11-query/ 4 http://dbpedia.org</title>
        <p>The system uses annotated internal and external information about cultural places and
events. Internal information is taken from catalogues or websites and is used by the
mobile guide app to describe user preferences by storing the relevant topics related to
exhibits the user has visited and liked. External information is imported from
available Open Data about museums and cultural events and enriched in the domain
ontology, using knowledge from the Linked Open Data cloud (DBpedia dataset). Data is
stored using a domain ontology for tourism called eTourism5. The ontology covers
methodological and practical aspect of services (hotels, B&amp;B, etc.), cultural objects
(museum, cultural places, etc.) and events. It is used as a vocabulary model to map
external Open Data into RDF triples validated by the ontology concepts. For the
present work we have developed a specific module of the eTourism ontology named
Cultural Objects Ontology (coo) that covers (1) properties (such as topic, keywords,
geographical information) of museums or events, exploits the semantic identity with
LOD/DBpedia concepts (using owl:sameAs predicates) and implements (2) user
profile types and topics of interests selections.</p>
        <p>For each museum source, we extract - as a first step, keywords from exhibits of the
Castle Buonconsiglio museum. We exploit the semantic relatedness implementing the
graph similarity approach. We annotate keywords - for each description, and we
disambiguate them to DBpedia concepts using DBpedia Spotlight APIs6. We filter out all
the not relevant concepts and we then obtain a bag of concepts (related to cultural
heritage) like the following:
{dbpedia7:Trentino, dbpedia:Prehistory, dbpedia:Ancient_Rome, dbpedia:Middle_Ages,
dbpedia:Hunter-gatherer, dbpedia:Upper_Paleolithic, dbpedia:Bronze_Age}
In DBpedia, each concept is related to a category using the property dcterms:subject,
then each category is part of a hierarchy structure with nodes connected via
skos:broader properties. For example the below two DBpedia concepts have as
dcterms:subject the DBpedia topic categories:
1) Last_glacial_period (dcterms:subject) -&gt;{Climate_history, Glaciology, Holocene, Ice_ages}
2) Ancient_Rome (dcterms:subject) -&gt;{Ancient_history, Ancient_Rome, Civilizations}
5 Currently under development at ICAR-CNR within the framework of the national project
Dicet-InMoto-Orchestra, (http://www.progettoinmoto.it).
6 http://spotlight.dbpedia.org
7 Prefix for http://dbpedia.org/resource/</p>
        <p>For the second step, we extract from the DBPedia SPARQL endpoint, for each
concept, the topic categories of the DBpedia taxonomy. As result we obtain a wider
bag of DBpedia topic categories describing each museum exhibit. Using the
hierarchical structure of categories is thus possible to discover similarities among concepts
that have ancestor categories in common.</p>
        <p>As external sources, we take the Open Data set delivered by the Italian Cultural
Heritage Minister8 (MIBAC) and we map these objects using the coo ontology; then, for
each object, we exploit the same process applied for the internal resources, in order to
annotate and extract the corresponding bag of topics. As a result, we obtain a list of
information for each MIBAC Cultural Object (cultural place or event), as in the
following example:
foaf:name = “Memorie della Grande Guerra”,
coo:mainCategory = http://dbpedia.org/resource/Category:History
Bag of Concepts (dcterms:description) -&gt;
{1918_disestablishments, Aftermath_of_World_War_I, Austria-Hungary, Austria_articles
needing attention, States_and_territories_established_in_1867, Anoxic_waters,
Backarc_basins, Contemporary Italian_history, History_of_Austria-Hungary,
History_of_modern_Serbia, Wars_involving_Italy, World_War_I }
In order to select suitable Cultural Objects candidates for the user, we define a metric
to measure the semantic distance between the user profile tags and the available
cultural objects tags. As a first step, we measure the shortest path distance between each
of the m topic categories in the bag of topics of the user profile and the
coo:mainCategory topic of the suitable candidates (see table 2), and we reduce
candidates cardinality by applying an upper threshold to the distance.
After this step, we refine the result by calculating (via SPARQL queries on the
DBpedia endpoint) the shortest path between the user bag of topics (m) and the suitable
candidates bag of topics (n) on the remaining subset of cultural objects. Its important
to underline that when computing the distance measure between topic categories we
also take into account, for each hop of the shortest path, the number of outgoing links
of the node: the more outgoing links a node has (to other DBpedia taxonomy nodes)
the less it is specific. Broad connected nodes receive low weights while nodes with
less outgoing connection will get higher values. We use each pairwise distance as a
component of a normalized vector of distances, we evaluate, for each museum or
http://dbunico20.beniculturali.it/DBUnicoManagerWeb/#home</p>
        <p>Prefix for http://dbpedia.org/resource/Category:
event an average normalized distance for each m user category and we sum all these
distances to define the relatedness of each cultural object. Again an empirical
threshold on distance is applied to retain a limited number of candidates.
3.2</p>
        <p>Use of characteristics</p>
        <p>Using behavior types we can tailor the amount and presentation of information. For
example for ants and butterflies we can give ten items. For grasshoppers and fish we
may only give two items. Ants and grasshoppers may be given places while
butterflies and fish may be given events. Additional personalization may be possible.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Discussion and Conclusion</title>
      <sec id="sec-4-1">
        <title>The results we get for the four sample users are shown on the table below.</title>
        <p>Our current metric of semantic relatedness doesn't take into account whether the user
profile bag of topics is representative of a sufficiently broad range of museums
categories to cover their cultural preferences. To balance this, when all/most of the user
preferences are of the same topic area (e.g. Prehistory), one or more among suggested
items could be chosen from a minor topic category, to elicit variation in user interests.
Our current research involves, the implementation of the methodology to the Old City
and the Tower of David Museum in Jerusalem, and the evaluation of the user model
and the semantic suggestions results.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Antoniou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lepouras</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lykourentzou</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          et al.:
          <article-title>Connecting Physical Space, Human Personalities and Social Networks</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ardissono</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuflik</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Petrelli</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Personalization in Cultural Heritage: The Road Travelled and the One Ahead. User Modeling</article-title>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>22</volume>
          (
          <year>2011</year>
          )
          <fpage>1</fpage>
          -
          <lpage>27</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Design issues: Linked data</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Falk</surname>
            ,
            <given-names>J. H.</given-names>
          </string-name>
          :
          <article-title>Identity and the museum visitor experience</article-title>
          . Left Coast Press Walnut Creek, CA (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Higgins</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peterson</surname>
            ,
            <given-names>J. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pihl</surname>
            ,
            <given-names>R. O.</given-names>
          </string-name>
          et al.:
          <article-title>Prefrontal Cognitive Ability, Intelligence, Big Five Personality, and the Prediction of Advanced Academic and Workplace Performance</article-title>
          .
          <source>J. Pers. Soc. Psychol</source>
          .,
          <volume>93</volume>
          (
          <year>2007</year>
          )
          <fpage>298</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Jansen-Verbeke</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Van Rekom</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <source>Scanning Museum Visitors: Urban Tourism Marketing. Ann. Tourism Res.</source>
          ,
          <volume>23</volume>
          (
          <year>1996</year>
          )
          <fpage>364</fpage>
          -
          <lpage>375</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Moore</surname>
            ,
            <given-names>J. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steinke</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Tresp</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>A novel metric for information retrieval in semantic networks</article-title>
          .
          <source>ESWC 2011 Workshops 65-79</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Strube</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ponzetto</surname>
            ,
            <given-names>S. P.</given-names>
          </string-name>
          : WikiRelate!
          <article-title>Computing semantic relatedness using Wikipedia</article-title>
          .
          <source>AAAI</source>
          Vol.
          <volume>6</volume>
          , pp.
          <fpage>1419</fpage>
          -
          <lpage>1424</lpage>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Leal</surname>
            ,
            <given-names>J. P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodrigues</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Queirós</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Computing semantic relatedness using DBpedia</article-title>
          .
          <source>SLATE</source>
          <year>2012</year>
          , page
          <volume>3519</volume>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Veron</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Levasseur</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          : Ethnographie de l'exposition.
          <source>Centre Georges Pompidou</source>
          (
          <year>1983</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Zancanaro</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuflik</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boger</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          et al.:
          <article-title>Analyzing Museum Visitors' Behavior Patterns</article-title>
          .
          <source>User Modeling</source>
          <year>2007</year>
          , (
          <year>2007</year>
          )
          <fpage>238</fpage>
          -
          <lpage>246</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>