=Paper= {{Paper |id=None |storemode=property |title=Towards a Context-Aware Photo Recommender System |pdfUrl=https://ceur-ws.org/Vol-889/paper4.pdf |volume=Vol-889 }} ==Towards a Context-Aware Photo Recommender System== https://ceur-ws.org/Vol-889/paper4.pdf

Towards a Context-Aware Photo Recommender System
Fabrício D. A. Lemos1 2, Rafael A. F Carmo 3, Windson Viana1 3, Rossana M. C. Andrade1 2 *
1
Group of Computer Networks, Software Engineering and Systems (GREat)
2
Master and Doctorate in Computer Science (MDCC)
3
UFC Virtual Institute (IUFCV)
Federal University of Ceará (UFC)
{fabriciolemos, rossana}@great.ufc.br
{rafael, windson}@virtual.ufc.br

ABSTRACT improvements and home sensors technologies allow a better
The main challenge of recommender systems is to be able to user’s context characterization and this information can be useful
identify and recommend items that have a greater chance of to improve recommendations.
meeting the interests of their users, which generally have a very In this scenario, this paper presents the Mobile Media to You
subjective and heterogeneous nature. It is imperative, then, that (MMedia2U), a photo recommender system that suggests images
recommender systems, from the identification of each user's previously annotated with contextual information. In order to
profile, could recommend personalized items. However, the user’s execute the recommendation, MMedia2U explores the current
profile is not enough for the system to be able to completely context of the user acquired by his/her mobile device. This
identify the user’s interests. The use of the system in a different domain is interesting since there are available a large number of
context from the usual may cause an unsatisfactory result for the images on sharing sites such as Flickr1 and Picasa Web2. Many of
recommendation, requiring it to be adapted to a new context. This these pictures are captured by mobile devices that store the
paper presents the MMedia2U, a prototype of a mobile photo location, date, time, and other contextual information which can
recommender system that exploits the user’s context and the be explored for recommendation.
context when the photo was created as a means to improve the
recommendation. Three context dimensions area exploited: 2. Context-Aware Recommender Systems
spatial, social and temporal. We describe the similarity measures
used for each dimension and the results of the system evaluation
(CARS)
With the dissemination of ubiquitous computing concepts,
by 13 users following a Gold Standard approach.
context-awareness has become a very important research field. Its
Keywords ideas have been used to increase efficiency and usability of
Recommender Systems, Context-Awareness, Ubiquitous Information Systems, particularly, those systems accessed from
Computing. mobile devices [5]. Context-awareness in recommender system
has been motivated from research, which recognizes the
1. INTRODUCTION dependence of user long-term needs on time, location, and any
One of the most important challenges in Information Systems is information about the physical environment surrounding the user
information overload. Recommender Systems try to cope with this [6][10]. Context-awareness introduces an additional level of
problem by helping people in retrieving information (ex: videos, personalization since it takes into account the influence of the
TV programs, routes, images, people, etc.) that may match their external environment of the user on his/her appreciation of the
preferences and intentions. Recommender Systems try to identify products or items. Recommender systems can take benefits from
items, from the information corpus, that have a greater chance to the context-awareness by considering not only the characteristics
meet the wishes of its users [4][6]. However, the characterization intrinsic to each item and user, but also the characteristics of their
of user’s preferences and intentions is a complex task that, as rule, current situations (both user and item). For example, gathering
needs user intervention to be fulfilled correctly. Another issue of context, a restaurant recommender system will be able to adapt its
Recommender Systems is related to user's context. The use of the recommendations for restaurants that are next to the user, open
system in a different context than usual may cause an and that have available seats for the amount of people who are
unsatisfactory result for the recommendation, since preferences with the user (e.g., his/her family).
and intentions can be influenced by the user's context (location, In the last years, some researchers have showed the feasibility of
trajectory, time, activity, etc.). Context-awareness refers exactly to implementing CARS (e.g., news, movies, music, and services).
the capacity that a system has to detect user’s situation and guide Adomavicius et al. [6], for instance, implement a recommender
the system behaviour accordingly [5]. Nowadays, mobile devices system of movies that takes into account the user context (e.g., if
Permission to make digital or hard copies of all or part of this work for the user is going to watch the movie at home or in the movie
personal or classroom use is granted without fee provided that copies are theatre) and it has attested improvements in the precision and
not made or distributed for profit or commercial advantage and that recall of recommendations. The authors also propose a
copies bear this notice and the full citation on the first page. To copy classification of context-aware systems into two categories. The
otherwise, or republish, to post on servers or to redistribute to lists, first category contains systems that use contextual information as
requires prior specific permission and/or a fee.
CARS-2012, September 9, 2012, Dublin, Ireland.
Copyright is held by the authors. 1
www.flickr.com/
2
http://picasaweb.google.com/lh/explore

*Sponsored by CNPq under Productivity Scholarship DT-2
criterion for filtering items. For instance, COMPASS [1] is a a context model, in which are established the elements that
tourism guide that recommends Points of Interest (POI) taking compose its description and how it should be represented (e.g.,
into account the current context of the user. Kaialeido Photo [2], using ontologies, XML, objects). Fig 1 shows our context model
in turn, performs contextual annotation of images and allows the represented as OWL-DL ontology. Our model has four
user to filter the albums according to preset categories (e.g., an dimensions: spatial (location and points of interest), social (e.g.,
event where the photo was created). In this system, the user has to personal information and activity being performed), temporal
explicitly specify the filters to be used. Columbus [11] is simpler; (date and time) and computational (mobile device).
it is a mobile application that displays georeferenced photos taken
near the user’s location.
The second category comprises systems that use the contextual
information at the time that the user evaluates an item. In addition
to the ratings and characteristics of items and users, systems of
this category also take into account contextual information (e.g.,
location) during recommendation. The work described by
Adomavicius et al. [6] is an example of a recommender system in
this category. The system records time that a user watched a
movie, stores both the score given by the user to the film and the
context in which the movie was watched (e.g., at home, with his
wife). Thus, for example, a film that was well rated by users in a
given context, are more likely to be recommended to a user who is
in a similar context. In the domain of multimedia content, the
system C2_Music [7] incorporates contextual information on
music recommendation. In general, the behaviour of this system is
similar to the movie recommender system, in which the song Fig 1- Our Context Model.
evaluation is enriched with the context in which it was heard (e.g.,
In MMedia2U, these dimensions were explored in the acquisition
day of week, weather conditions).
of knowledge about users and photos. These dimensions are
Such systems depend, however, on a historical database already exploited in context-aware management of photos [3].
describing which items were evaluated by others users (or the user They have been proved to be relevant in organizing and finding
himself/herself) in similar contexts to the current user’s context. personal photos, which is an indicator that can also be exploited in
Another problem occurs when a new item is added to the recommender systems of this type of multimedia document. In
collection. As this item has not yet been used, it will be difficult to MMedia2U, the location attribute is extracted from the user’s
recommend it in accordance with recommendation techniques mobile device (e.g., GPS). Other attributes such as place
such as collaborative filtering (so-called cold start problem). The description (e.g., shopping, beach, etc.) can be derived from freely
work described in this paper differs from the studies available web services such as GeoNames3 e WikiMapia.4 Date
aforementioned since it takes into account both the context of and time considered are the time of use of the system. In the
users and the context in which the items were created. The current version, the activity needs to be informed by the user and
hypothesis is that photos taken in a given context c may be of can be chosen from among the options presented or reported
interest to users who are in similar contexts to c. Then, we do not manually. Examples of activities are: sports, festivals, and
need, in a first moment, of a historical database of landscapes.
recommendation evaluations. MMedia2U uses a knowledge-based
recommendation method trying to avoid the cold start problem of 3.2 Similarity measure
collaborative filtering [8]. Similarity measure is used in the system in order to retrieve those
photos created in contexts more similar to the user's current
3. MOBILE MEDIA TO YOU (MMedia2U) context. The algorithm developed is an adaptation of traditional
In the system presented in this paper, users receive knowledge-based techniques [13], which uses the user context as
recommendations of photos created in contexts similar to current indicative of their preferences and the context of items (i.e.,
users’ context. This similarity computes three contextual photos) as a representation of its features. In our system, the
dimensions (spatial, social, and temporal). The system has as context of items is the context in which the photos were created.
target two types of users. The first type are those who are in an The similarity is calculated between the context of the user U and
unusual context (e.g., visiting a tourist sight for the first time) and the context of an item I using the following formula:
they can enjoy the pictures recommended to have references to
activities or new places to explore. The second group contains
users that have already been in this similar context and that the (1)
recommended photos may give a new vision and perspective of In Formula 1, the similarity is calculated without the need of
the situation they find themselves. Rost et al.[11] have noted that training data. In this case, c is an attribute belonging to the
georeferenced images can influence in a positive and playful way dimensions of the context model (e.g., location); wc is the weight
the exploration of space by these two categories of users. of influence of attribute c (e.g., location has a weight of 50%) and
3.1 Context Modelling simc is the similarity function for attribute c. Those pictures that
A fundamental part in the development of a context-aware system have the highest value of similarity are the ones recommended to
is the definition of what information should compose the
“context”, since elements that describe the contextual information 3
http://www.geonames.org/
depend on the system tasks, and on the system capacity to observe 4
this information. This definition is associated with the creation of http://wikimapia.org/
user U. The function simc is particular to each type of context and provided by our recommender system, and it informs the user's
application domain. current context. MMedia2U server receives the request, and
Each context model dimension must have a method to calculate its performs an enrichment of user context data. The metadata stored
similarity. The location similarity, for instance, can be calculated in the repository of photos are scanned and compared with the
by measuring the distance between the place where the picture current user’s context. In step 5, MMedia2U computes a photo
was taken and the current user’s location. The similarity for ranking according to the results of similarity measures. The
activity is calculated by comparing the activity or occasion that ranking contains the photos URLs and their metadata.
the user is found and the activity or occasion in which the image 3.4 Photo Corpus
was generated. Some of the activities mapped in image shoots and MMedia2U has a repository of photos for recommendation. These
their similarities are shown in Table 1. photos need to be associated with contextual information in order
Shopping Party Leisure Sports to be compared with the current context of the mobile users. The
photos should have as metadata the location where they were
Shopping 1 0 0 0 taken and the activity of the photo’s author at the time of their
Party 0 1 0,5 0 creation. At first, we expect to use photos from Web 2.0
applications, such as Flickr and Picasa Web. However, we find
Leisure 0 0,5 1 0,5 many errors in the metadata of these photos (e.g., time, location).
In order to evaluate the recommendation method without
Sports 0 0 0,5 1
annotation errors, we built a repository of photos from images of
Tab. 1 – Activity Similarity Picasa Web. Manually, we corrected and increased the metadata
Formula 2 is used with numeric context attributes. The similarity returned by the Picasa Web Service. Then, we incorporated the
is defined by how close two values are. new metadata into the photo file by using IPTC and EXIF
headers. Examples of enrichment are the inference of the activity
V c (U )− V c (I ) from the photos description, and the day of week according to the
sim c (U,I )= 1−
max (c )− min (c ) shot date. We hope the evolution of multimedia content
(2) management systems, such as CoMMeDiA [3], will reduce the
In Formula 2, Vc(U) and Vc(I) represent, respectively, the values effort to enrich this kind of image metadata.
of context c for the user and the item; max(c) and min(c)
represent, respectively, the maximum and minimum values for the
3.5 MMedia2U Mobile Application
compared attribute of context c (e.g., for c = hour of the day, The mobile application was developed for the Android5 platform,
min(c) = 0, and max(c) = 12). The similarity between dates can compatible with devices that have version 2.2 or higher. Fig. 3
compare the various attributes related to the moment the photo shows an execution flow of the mobile application.
was shot and the moment the user is found. Some attributes
compared were: the hour of the day, day of the week and month of
the year. The date attributes were compared individually to
analyze the influence of each one (e.g., Has the similarity between
hour of the day a greater influence on the choice of the user than
the similarity of the day of the week?). The similarity between the
months of the year can be calculated by Formula 2, adapting it to
cyclic values (e.g., the distance between January and December is
1, instead of 11).

3.3 System Architecture Fig 3. Execution flow of the mobile application.
We designed the MMedia2U following a client-server architecture The user chooses the "activity/interest," then, the system captures
for mobile computing that is based on RESTful Web Services. the current context (location and date/time), and it sends to the
This design allows mobile devices, even those with low server. This, in turn, returns a list of recommended images (third
processing capabilities, make use of our recommender service. Fig screen). If the user selects a photo, he can see its position and the
2 presents the execution flow of the system. distance between him and the place where the photo was captured.
Clicking on the picture located on the map, its content is displayed
in full screen, and its metadata can be also viewed.
4. EXPERIMENTS
Evaluating a recommendation system is a hard task, due to the
property that an item’s relevancy has a strong personal nature and
is complex to be measured. This difficulty is enhanced when exist
a lack of historical evaluation data, which makes large-scale
studies very costly and difficult to be run. In the case of CARS,
the complexity is even bigger, since we need to range the possible
Fig 2. Execution flow of the MMedia2U recommendation. contexts of real situations (i.e., places, daily situations, etc.). As
In step 1, a mobile application is responsible for gathering user’s we did not have a historical data about recommended photos, we
context. Some types of information (e.g., location) can be created a Gold Standard, which consisted of photos evaluated by
acquired from sensors (e.g., GPS). Other types rely on information
passed by the user (e.g., current activity). In the second step, the
5
mobile application accesses, from a HTTP call, the Web-Service http://developer.android.com/
users in a certain context. The objective was both to use the Gold algorithm, the average precision was 0.28. This precision is
Standard to compare the performance of our recommendation and relatively high since some users have chosen more than 30 photos
to use it as historical data. for a specific context (e.g., a user in particular has selected half
While building this Gold Standard, it was asked for a group of 13 part of the corpus).
users to evaluate photos from 8 different contexts, each one The last two rows of the table show the average precision when
consisting of a stage of evaluation. In each stage, one context using the calculation of similarity of only one of the contextual
(e.g., shopping in some stores on the seaside of the city of dimensions. Combination I, which got the best results, was the
Fortaleza6 during the evening) was presented to the user, who had one we assign twice the importance of Activity in relation to
to visualize a set of photos and choose those that seemed to be Location and four times in relation to temporal attributes.
more appealing for him/her, taking into consideration the context Comparing the results obtained without training (Combination I
he/she was suggested. The photos chosen by the users were and Equal Weights) in relation to random experiments, one can
included in the Gold Standard and provided the historical base in see that it is possible to have much higher precision than random
which the recommendation of MMedia2U was evaluated. The choices (agreeing with the hypothesis H-II and H-I). Moreover,
degree of success on recommendations was then evaluated by the the gain over the random method was not big when considering
ratio of chosen photos that are in the Gold Standard (e.g., if in a only one contextual dimension (e.g., only location or only
given combination of user and context, 10 photos are activity), leading us to believe that a model of full context is
recommended and 6 of these are in the Gold Standard of the same essential for a good context-aware recommendation (hypothesis
combination, then the recommendation precision is 0.6). H-III).
In each stage of evaluation, a mean of 100 photos were visualized Fig 4 shows the F measure (harmonic mean) analysis for the six
by the users. 20 were taken in similar contexts to the one showed recommendation algorithms. Equal weights and Combination I
to the user and 80 were different in some dimensions of the had the better results for Top 3. Regarding the Top 5, Top 10 and
context (e.g., same activity but very distant location). Five of 13 Top 20 lists, Combination I and Least Square were the most
users didn’t know the place chosen as the location for the users effective. For instance, Combination I (0,410 for Top 20) was two
(the seaside of Fortaleza). Ten users were Computer Science times better than Random algorithm (0,201).
graduate students aging from 23 to 28. All of them use mobile
phones every day. For some of the users, we have presented 8 We used the Student t-test in the precision values for the Top 3
photo collections, while for others we have only presented a obtained by the algorithms compared to the results obtained by
subset of it, performing 66 simulations. random selection of photos. This showed that the combined use of
contextual attributes is significantly better than randomization
4.1 QUANTITATIVE RESULTS (95% degree of confidence, probability < 0.0001). In addition, the
The objective of the experiment was to evaluate the following test showed that the use of only one attribute (activity or location,
hypotheses: for example) is not significantly better than the random method.
(H-I) It is possible to make satisfactory recommendations of Finally, the comparison with the results of Combination I and
georeferenced photos without prior knowledge of the user profile, Least Squares resulted in performance differences not statistically
considering only its current context; significant, which may not indicate the need for training to
improve the precision of this kind of CARS.
(H-II) The context in which the photos were taken is relevant in
making recommendations; and Top 3 Top 5 Top 10 Top 20
(H-III) The usage of a context model considering various Equal weights 0.56 0.54 0.45 0.42
contextual dimensions may lead to an improved recommendation Least Squares 0.54 0.56 0.51 0.44
comparing to the result of one which uses only one context
attribute (e.g., location). Combination 1 0.56 0.56 0.51 0.47

In order to verify these hypotheses, first, the algorithm was run Random 0.29 0.28 0.26 0.25
with different weights for each dimension without a previous Localization 0.28 0.30 0.30 0.29
training data. Another implementation used the weights obtained
Activity 0.35 0.33 0.33 0.34
by training the algorithm using a 7-fold method (same way of
adjusting the parameters) [5]. Weights adjustment in the training Table 2. Mean values of obtained precisions.
data was performed by linear regression using the least squares
method. We compared the precision changing the amount of
recommended photos. For this analysis, we also used random Activity
choices as base statement since we were unable to find other Localization
photo recommendation algorithms that use context information. Top 20
Table 2 shows the average precision of our recommendation
Random
Top 10
algorithm in relation to four sizes of recommendation lists (Top 3, Combination I
Top 5, Top 10 and Top 20). The average precision of the Top 5
algorithm without training, assigning equal weights to all Least Squares
similarity measures, was 0.54 for the Top 5 (5 recommended Top 3
Equal weights
items). Using the weights obtained by the least squares, the
precision of the Top 5 was 0.55. Recommending pictures at 0,0000 0,5000
random, without the ranking generated by the recommendation
Fig 4- Mean values of f-measure
6
http://en.wikipedia.org/wiki/Fortaleza
4.2 USERS QUESTIONNAIRE ACKNOWLEDGEMENTS
At each stage of the experiments, a questionnaire was applied to This work is a partial result of the UbiStructure project supported
the users so that relevant factors were investigated in the by CNPq (MCT/CNPq 14/2011 - Universal) under grant number
implementation of CARS for photos. One of the factors to be 481417/2011-7.
investigated was the relevance of a mobile system for photo
recommendation and, in the case of existence of such system, if it 6. REFERENCES
would be interesting to recommend photos taken into account the [1] Van Setten, M., Pokraev, S. and Koolwaaij, J. 2004. Context-
current user’s context. He/she was asked whether, in this specific aware recommendations in the mobile tourist application
context, the user would like to receive recommendations of photos compass. In Adaptive Hypermedia and Adaptative Web-
taken in a similar context. 74% of users answered “yes” for this Based Systems, 235–244
question. When asked whether the recommendation of pictures [2] Xia, S., Gong, X., Wang, W., Tian, Y., Yang, X., and Ma, J.
generated in similar contexts would be interesting, the level of 2010. Context-Aware Image Annotation and Retrieval on
interest was 100%. Another point investigated was the relative Mobile Device. In Second International Conference on
weight of each contextual dimension. Each user was asked what Multimedia and Information Technology, 111-114.
contextual dimension, including location, activity, date and time;
[3] Viana, W., Miron, A., Moisuc, B., Gensel, J., Villanova-
they would prefer to be taken into consideration to build the set of
Oliver M. and Martin, H. 2011. Towards the semantic and
photos. Eight of the users said they think the proximity of the
context-aware management of mobile multimedia. In
location of the photo is the most important factor to increase their
Multimedia Tools and Applications, v. 53, 391-429
interest. Five of the users responded that the activity associated
with the picture portrayed by the activity, which he/she is playing [4] Jannach, D., Zanker, M., Felfernig, A. and Friedrich, G.
at the moment is the most important factor. No users found the 2010. Recommender Systems, an Introduction. In Cambridge
similarity between date and time the most important factor. University Press, first edition

5. CONCLUSION AND FUTURE WORK [5] Fabiana G. Marinho, Rossana M.C. Andrade, Cláudia
Werner, Windson Viana, Marcio E.F. Maia, Lincoln S.
This work presented MMedia2U, CARS for contextual photos.
Rocha, Eldânae Teixeira, João B. Ferreira Filho, Valéria L.L.
During the system development, we specify a context model to
Dantas, Fabrício Lima, Saulo Aguiar, MobiLine: A Nested
cope this application domain and we adapted a knowledge-based
Software Product Line for the domain of mobile and context-
technique to incorporate the context information. MMedia2U
aware applications, Science of Computer Programming,
allows recommendation of photos even those that have never been
Available online 18 May 2012, ISSN 0167-6423,
evaluated by users. The recommendation is performed only from
10.1016/j.scico.2012.04.009.
the context in which the photos were created. This allows users,
without a history of use, to receive recommendations based on [6] Adomavicius, G. and Tuzhilin, A. 2010. Context-Aware
their current context. The recommendation mechanism was Recommender Systems. In Recommender Systems
validated by the construction of a Gold Standard. The average Handbook: A Complete Guide for Research Scientists and
precision achieved by the algorithm allowed us to conclude that, Practitioners, Springer Press, first edition
for the data used, context-awareness can bring gains in the photo [7] Lee, J. S. and Lee, J. C. 2007. Context awareness by case-
recommendation compared to a random list. It is important to note based reasoning in a music recommendation system. In
that even weight combinations without training phases (the Comb Proceedings of the 4th international conference on
I and equal weights) achieved satisfactory results. This strategy Ubiquitous computing systems
can be used to reduce the drawback of cold start problem. In
addition, the user’s survey suggests that systems of this nature are [8] Burke, R. 2000. Knowledge-Based Recommender Systems.
interesting to users. In Encyclopedia of Library and Information Systems, no. 32,
180-200
In this moment, we cannot generalize our performance results
(dependence of users and the images corpus). However, it serves [9] Albanese, M., d'Acierno, A., Moscato, V., Persia, F. and
as a good indication of the quality of the prototype Picariello, A. 2011. A Multimedia Semantic Recommender
recommendation. Analysis of these results was limited to the size System in Cultural Heritage Applications. In Semantic
of the corpus (655 photos, with 335 from two tourist areas of Computing (ICSC), 2011 Fifth IEEE International
Fortaleza, city in Brazil). We think the increase of the database Conference on , pp.403-410, 18-21
could decrease the precision of the algorithm or present [10] Boutemedjet , S. and Ziou , D. 2010. Using Images in
concentration of photos in certain places. In such cases, new Context Aware Recommender Systems, In Proc of 1st Int'l
search filters and clustering algorithms should be used to solve Workshop on Adaptation, Personalization and
these challenges. Nevertheless, the results are already encouraging Recommendation in the Social-semantic Web
the construction of context-aware photo corpus of city sights (e.g.,
[11] Rost, M., Cramer, H., and Holmquist, L. E. 2011. Mobile
points of interest in host cities of 2014 World Cup).
exploration of geotagged photographs. In Personal and
As future work, we want to increase the image corpus by using an Ubiquitous Computing
evolution of the CoMMeDiA system [3]. We aim to ensure the
[12] Siersdorfer, S. and Sizov, S. 2009. Social recommender
accuracy of contextual information, and take benefits from the
systems for web 2.0 folksonomies. In Proceedings of the
automatic context acquisition. We also expect to evaluate
20th ACM conference on Hypertext and hypermedia (HT '09)
MMedia2U with users (tourist or otherwise) in a real mobile
situation. In a new version of the system, we would like to [13] Mcsherry, D. 2003. Similarity and Compromise. In
integrate clustering algorithms in the rank results, and allow users Proceedings of the Fifth International Conference on Case-
to add words of interest in order to refine the recommendations. Based Reasoning.