<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>R.Fakhfakh et al, p.</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>REGIMvid at ImageCLEF2012: Concept-based Query Refinement and Relevance-based Ranking Enhancement for Image Retrieval</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Rim Fakhfakh</string-name>
          <email>fakhfakhrima@yahoo.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ghada Feki</string-name>
          <email>ghada.feki@yahoo.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Amel Ksibi</string-name>
          <email>amel.ksibi@ieee.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anis Ben Ammar</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chokri Ben Amar</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>REGIM: REsearch Group on Intelligent Machines, University of Sfax</institution>
          ,
          <addr-line>ENIS, BP W, 3038, Sfax</addr-line>
          ,
          <country country="TN">Tunisia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2012</year>
      </pub-date>
      <volume>1</volume>
      <issue>2012</issue>
      <abstract>
        <p>In this paper, we present our proposed approach that deals with two important steps in the retrieval process, which are query analysis and relevance-based ranking. First, query analysis takes into account two forms of queries: textual and visual which present the form of queries provided by ImageCLEF in the taskVis“ual concept detection, annotation, and retrieval using Flickr photos ”. The main idea in the query analysis process is to interpret the textual and visual queries and select the most appropriate concepts according to the query to concept mapping. Even the list of concepts is retained, we perform a query refinement process to improve the quality of the interpretation of the query. Second, we try to achieve a high-quality relevance-based result not only with choosing an adequate similarity measure but also with enhancing obtained scores by elaborating a random walk with restart, which is performed over inter-images semantic similarity graph.</p>
      </abstract>
      <kwd-group>
        <kwd>Concept based image retrieval</kwd>
        <kwd>Query to concept mapping</kwd>
        <kwd>Relevance based ranking</kwd>
        <kwd>Inter-concept similarity graph</kwd>
        <kwd>Inter-images semantic similarity graph</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        This paper presents the second participation of our group REGIMvid, within
REGIM research unit, in the “Visual concept detection, annotation, and
retrieval using Flickr photos ” task[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>The aim in this task is to analyze a collection of Flickr photos in terms of their
visual and/or textual features in order to detect the presence of one or more
concepts. The detected concepts can then be used to automatically annotate</p>
      <p>images or to retrieve the best matching images to a given concept-oriented
query. The main challenge for this task is to select the most appropriate
concepts related to a query and so to ensure a good relevance-based result
ranking. For the selection of the most adequate concepts from a query, we
execute a query to concept mapping requiring a calculation of the semantic
similarity measures between query keywords and concepts. Concerning the
ranking step, we try to achieve a high-quality relevance-based result not only
with choosing an adequate similarity measure but also with enhancing
obtained scores by elaborating a random walk with restart. This refinement
step is based on information, which are occurred from the inter-images
semantic similarity graph.</p>
      <p>The rest of the paper is organized as follows. Section 2 describes our
proposed relevance-based approach; Section 3 discusses the experimental
results.
2</p>
      <p>Relevance-based approach: Visual concept retrieval using Flickr
photos task
Relevance computing in our approach implies 3 phases : First, we analyze the
query to enhance the user request understanding . Second, we compare
query concepts with concepts that annotate each image from the collection.
Finally, scores obtained in the previous step are refined by the means of
random walk with restart.</p>
      <p>The following notations will be used. Given a set of query concepts =
{ , ,.., }, denote by = { , ,.., } the collection of images that
are associated with the set of query concepts . This collection, which is a
part of the large collection , is obtained by the inverted
file construction in an off-line stage.</p>
      <p>For image , denote by = { , ,.., }the set of its associated concepts.
The relevance scores of all images in D are represented in a vector
of image</p>
      <p>, whose element denotes the relevance score
with respect to the set of query concepts .</p>
      <p>Relevance score reflects the degree of the existence of a given concept
in the image . This score is normalized that we range it from 0 to 1.
2.1</p>
      <p>Query analysis
Query analysis in a retrieval process is a fundamental and critical step. The
main challenge is to enhance the interpretation of user request in order to
improve retrieved result quality. The proposed process takes into account
the query format: title, description and images (fig2). Firstly, a textual
analy</p>
      <p>sis is performed on title and description components in order to deduct the
main concepts within text. Secondly, a matching process is performed based
on a concept within query and image data. The obtained results are, thirdly,
merged. Finally, a refinement process is executed.</p>
      <p>The following figure shows the query analysis process which goal is the
selection of the most appropriate concepts related to each query.</p>
      <p>Title : grass field recreation
Description : The user is looking
for photos showing one or more
people on a grass field.</p>
      <p>Images :</p>
      <p>Query
grass outdoor happy
two baby tree
male sports_recreation forest_park
female desert child
day three teenager
one small_group elderly
adult big_group plant
family_friends no_blur calm
portrait funny</p>
      <p>Concepts list</p>
      <p>Textual query analysis .</p>
      <p>
        The proposed treatments within textual analysis inherit from natural
language processing standard. First we extract the representative information
by the elimination of non informative terms called stoplist [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Keywords
extracted from the initial query are divided into positive and negative terms
by following a linguistic study: the positive keywords represent what is
required in the query and the negative ones represent some details that should
be avoided. Fig3 shows an example for a query keywords classification into
positive and negative.
Each retained keyword in the query representation is projected on the
concept space; a query to concept mapping. Using WordNet [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] as taxonomy, we
evaluate the similarity values between a keyword and the 94 proposed
concepts. We retain the most related concept to each query term.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Visual query analysis .</title>
      <p>Regarding the initial query textual description, we distinguish between
positive and negative images. Image is considered as positive if it translates the
textual description and negative if there are some particularities which are
not congruent to the textual description.</p>
      <p>
        Alike the textual treatment, we deduce a set of positive and negative
concepts. The related concepts per image are extracted from the annotation file
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>In some query, the same concept appears in positive and negative images. In
this particular case, we retain this concept as positive.</p>
    </sec>
    <sec id="sec-3">
      <title>Textual and visual concepts fusion .</title>
      <p>Textual and visual analyses bring out a set of positive and negative concept
per each modality. The obtained sets are firstly merged. We compute a
weight value for each positive concept.</p>
      <p>Regarding the concept frequency in textual and visual component, the
associated weight is computed as follows:
When the same concept appears in the positive and negative sets, we decide
to maintain it as positive or negative as follows.</p>
      <p>Let’s note:
 w: the weight of the redundant concept
 moy: the average of all weights of positive concepts
 nb: the number of positive concept
 : the standard deviation between weight of positive concept.
According to the following formula we decide if either the redundant
concept is positive or negative.</p>
      <p>If w [ moy- , moy+ ], the concept is considered as positive
and keeps the same weight else, we cancel its weight value.</p>
    </sec>
    <sec id="sec-4">
      <title>Query refinement.</title>
      <p>The final stage of the proposed process is an inter-concept relation study
which goal is to enhance the query interpretation. Two possible scenarios
can occur:
 Query expansion: when adding new concepts to the previous set.
 Concepts reweighting: when modifying existing concepts weights.</p>
      <p>We built an inter-concepts semantic graph where concepts are
interconnected between them. Weights within edges represent the semantic
similarity value computed according to the WordNet hierarchy.</p>
      <p>Fig 5 shows a partial view of the semantic graph:
0.75
0.64
0.25
trees
0.12
0.73
flowers</p>
      <p>0.44
0.67
garden
water
0.57
sky
0.62</p>
      <p>0.6
0.2
0.2
0.44
plants
0.22
0.12</p>
      <p>
        0.25
outdoor
We perform a random walk over the semantic graph to refine our concepts’
list either by reweighting concepts or by adding new concepts according to
the semantic similarity measure between concepts. We attempt to use
another type of graph called contextual graph [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] to bring the best refinement.
We also use another graph called contextual one [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] to bring the best
refinement. This last, is built based on the contextual information
interconcepts dealing with the co-occurrence by exploring Flicker resources as a
largest public available multimedia corpus.
      </p>
      <p>
        To more refine our query after random walk step and the addition of new
relevant concepts, we try to manage the semantic conflict [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] between the
selected concepts: exclusion and implication. An exclusion conflict traduces a
contradiction between selected concepts, i.e concepts “indoor” andt- “ou
door” cannot simultaneously occur in the same query. For casusec,h we
have to decide which one to eliminate. The implication case is implied when
two concepts are closely related (according to the semantic similarity value).
In such case, concepts in relation with theses in the query are added to this
last.
2.2
      </p>
      <p>
        Query-Image similarity
Based on experimental semantic similarity measures study [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], we decide to
adopt an approach for semantic similarity between a given query and an
image that is analogous to Cosine similarity measure. The semantic similarity
between and , which are respectively the sets of query concepts and
image concepts, is defined as:
This semantic similarity is computed between the query concepts and each
concept sets of images that belong to the sub-collection relative to this
query. For other images belonging to , their similarity scores are equal to
zero.
      </p>
      <p>Instead of searching through a large collection, a user query is concerned in
only a part of selected image results from the whole collection. An inverted
file is a data structure where images are indexed by (concept × image)
structure instead of the conventional (image × concept) structure. It allows saving
much time since the algorithm search concerns only the sub-collections.
We denote by the vector of semantic similarity between a query and the
collection . It is defined as follows:</p>
      <p>This vector will be used as an input for refinement phase, which is provided
thanks to a random walk with restart.
2.3</p>
      <p>
        Result refinement
We try to achieve a high-quality relevance-based result not only with
choosing an adequate similarity measure but also with improving obtained scores.
Therefore, elaborating a random walk [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] is an attempt for enhancing results.
In this section, we denote by a similarity matrix whose element
indicates the similarity between images and and the elements on the
diagonal are null.
      </p>
      <p>
        Semantic similarity scores vector according to a given query and the
interimages semantic similarity graph are the input of random walk process
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. It is defined as: consider a random image that starts from node i, the
image iteratively transmits to its neighborhood with the probability that is
proportional to their edge weights. Also at each step, it has some probability
c to return to the node i.
      </p>
      <p>After some iteration we have an optimal scores vector .</p>
      <p>The inter-images semantic similarity graph illustrates the semantic
relationship between images in the collection. It informs on the semantic
similarity degree between each pair of images. We adopt an approach for semantic
similarity between tow images that is analogous to Jaccard similarity. The
similarity between and , which are respectively the sets of image
concepts and image concepts, is defined as:
The semantic similarity graph consists in the matrix , which must be
normalized. In fact, there are several ways to normalize the weighted matrix .</p>
      <p>The most natural way might be the row normalization. Complementarily, we
normalize W using the normalized graph Lapalician, as follows:
With the vector is defined as:
The refinement method consists in a random walk process, and it will
converge to a fixed point. We use the graph that illustrates relations between
images for the random walk since the relevance scores of semantic similar
images should be close. The convergence condition consists in comparing the
difference between two successive scores vectors resulting by the Random
Walk with a fixed parameter ɛ.
3</p>
    </sec>
    <sec id="sec-5">
      <title>Experimental results</title>
      <p>We describe the experimental study conducted to evaluate the proposed
approach within the relevance computing. The experiments were performed
on ImageCLEF12 benchmark.</p>
      <p>The relevance-based approach for image retrieval is experimented with the
Visual concept retrieval using Flickr photos1 task. It consists in analyzing a
Flickr photos collection. We look for extracting concepts from the visual and
textual features. These concepts are used in image retrieval.</p>
      <p>The global results of our system are relatively bad. This fact is explained by
the huge number of selected concepts related to each query. This flow of
information can deviate the real meaning of the initial query.</p>
      <p>For example the treatment of the query n°29 “sleeping baby”with the
description ”The user is looking for photos showing a sleeping (calm, quiet)
ba</p>
      <p>by” , provides a list of concepts composed from 11 items which are: one,
baby, partial_blur, day, no_blur ,portrait, indoor, happy, calm, overlay,
smoke.</p>
      <p>We notice that the appearance of wrong concepts is the consequence of the
query to concept mapping shortage. For example, the keywords “sleeping”
and “quiet” appearing in the previous textual query are respectively
projected to “smoke” and “day” after performing the step of query to concept
mapping.</p>
      <p>For the query n°5 “hot air ballon”, selected concepts after query analysis
process are far from the user request. In fact, these concepts are textually
close to the query (air ballon, airplane) but semantically different.</p>
      <p>The deep study of queries individually shows that our approach can perform
well in some cases. for expamle , the query n°25 “grasscrfeiealtdionr”e
previously mentioned in Fig2. In effect, the majority of extracted keywords
from the textual query belong to the concepts set. (grass is a concept)
Finally, we notice that the setting of the parameter in the random walk has
relatively minor impact on the ordering of the images in the result list.
4</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>In this work, we present our proposed approach that deals with two
important steps in the retrieval process, which are query analysis and
relevance-based ranking. Query analysis takes into account two forms of queries:
textual and visual which present the form of queries provided by ImageCLEF.
Relevance-based result is based on choosing an adequate similarity measure
and enhancing obtained scores by elaborating a random walk with restart,
which is performed over inter-images semantic similarity graph.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Amel</given-names>
            <surname>Ksibi</surname>
          </string-name>
          ,
          <article-title>Anis Ben Ammar and Chokri Ben AmarREGIMvid at ImageCLEF2011: Integrating Contextual Information to Enhance Photo Annotation and Concept-based Retrieval,2011</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>G.</given-names>
            <surname>Salton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mcgill</surname>
          </string-name>
          ,
          <article-title>Introduction to Modern Information Retrieval</article-title>
          .
          <string-name>
            <surname>McGraw-Hill</surname>
          </string-name>
          ,
          <year>1983</year>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Christiane</given-names>
            <surname>Fellbaum WordNet: An Electronic Lexical Database</surname>
          </string-name>
          . Cambridge, MA: MIT Press,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>M.</given-names>
            <surname>Zarka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Elleuch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.B.</given-names>
            <surname>Ammar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.M.</given-names>
            <surname>Alimi</surname>
          </string-name>
          ,
          <article-title>A Fuzzy Ontology-Based Framework for reasoning in Visual Video Content Analysis and Indexing</article-title>
          ,
          <source>In the intl Workshop on Multimedia Data Mining</source>
          ,
          <year>2011</year>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Amel</given-names>
            <surname>Ksibi</surname>
          </string-name>
          , Anis Ben Ammar, Chokri Ben Amar, Integrating Contextual Information to Enhance
          <source>Photo Annotation and Concept-based Retrieval</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Sabrina</given-names>
            <surname>Tollari</surname>
          </string-name>
          , Marcin Detyniecki, Christophe Marsala, Ali Fakeri Tabrizi,
          <string-name>
            <surname>Massih-Reza</surname>
            <given-names>Amini</given-names>
          </string-name>
          , Patrick Gallinari,
          <article-title>"Exploiting Visual Concepts to Improve Text-Based Image Retrieval"</article-title>
          ,
          <source>European Conference on Information Retrieval (ECIR)</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. S.
          <string-name>
            <surname>-H. Cha</surname>
          </string-name>
          , Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions,
          <source>International Journal Of Mathematical Models And Methods In Applied Sciences,Issue</source>
          <volume>4</volume>
          , Vol
          <volume>1</volume>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>H.</given-names>
            <surname>Tong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Faloutsos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-Yu</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <article-title>Random walk with restart: fast solutions and applications</article-title>
          ,
          <source>KnowlInfSyst</source>
          (
          <year>2008</year>
          )
          <volume>14</volume>
          ,
          <fpage>327</fpage>
          -
          <lpage>346</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>K.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.-Sheng</given-names>
            <surname>Hua and H.-J. Zhang</surname>
          </string-name>
          ,
          <article-title>Social Image Search with Diverse Relevance Ranking</article-title>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Boll</surname>
          </string-name>
          et al. (Eds.):
          <source>MMM</source>
          <year>2010</year>
          , LNCS
          <volume>5916</volume>
          ,
          <fpage>174</fpage>
          -
          <lpage>184</lpage>
          , Springer-Verlag Berlin Heidelberg
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>