<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic query expansion for fuzzy proximity information retrieval model</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Author Bissan AUDEH</string-name>
          <email>audeh@emse.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Supervisors Philippe BEAUNE</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michel BEIGBEDER</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olivier BOISSIER</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Affiliation École Nationale Supérieure des Mines de Saint-Etienne</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Aims and Objectives of the Research Our research aim is to ameliorate the recall of the Fuzzy Proximity Information Retrieval Model (FPIRM) of Beigbeder &amp; Mercier (2005), their approach is very “precise” when evaluating internationally agreed upon collections of documents used for benchmarking. The precision in this case for each query is the ratio of relevant documents retrieved over documents returned. However, the recall is weak. By recall we mean the ratio of relevant documents retrieved over all relevant documents in the collection (Rijsbergen, 1979). FPIRM approach evaluates the relevance between a document and a query by using a fuzzy function that takes into account the distance between the occurrences of query terms in a document. Our research studies the use of semantic query expansion to increase the recall of their model.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Our testbed is the collection INEX3 2009, which contains all English articles of Wikipedia
2008 (2’600’000 documents). These documents are annotated using YAGO. We are
considering the use of this ontology to find new terms that are semantically related to query
terms.</p>
      <p>
        Research Results to Date
During our study, we’ve experimented with the use of the lexical thesaurus WordNet, where
we looked up all synonyms of each term in the query. This trivial use of WordNet produced
the query drift. This is a common issue related to query expansion and caused by the
“alteration of the focus of a search topic”
        <xref ref-type="bibr" rid="ref3">(Mitra, Singhal, &amp; Buckley, 1998)</xref>
        . Figure 1 shows
how query expansion using only WordNet synonyms affected the performance of the fuzzy
proximity model.
3 INEX (INitiative for the Evaluation of XML retrieval) an information retrieval evaluation forum
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Beigbeder</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Mercier</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>An information retrieval model using the fuzzy proximity degree of term occurences</article-title>
          .
          <source>Proceedings of the 2005 ACM symposium on Applied computing - SAC '05</source>
          ,
          <fpage>1018</fpage>
          . New York, USA: ACM Press.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raghavan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Schütze</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2008</year>
          ). Introduction to Information Retrieval. Introduction to Information Retrieval (pp.
          <fpage>177</fpage>
          -
          <lpage>194</lpage>
          ). Cambridge University Press.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singhal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Buckley</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>1998</year>
          ).
          <article-title>Improving automatic query expansion</article-title>
          .
          <source>Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '98</source>
          ,
          <fpage>206</fpage>
          -
          <lpage>214</lpage>
          . New York, USA: ACM Press.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Rijsbergen</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          (
          <year>1979</year>
          ).
          <source>INFORMATION RETRIEVAL. Butterworth-Heinemann</source>
          <year>1979</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Croft</surname>
            ,
            <given-names>W. B.</given-names>
          </string-name>
          (
          <year>1996</year>
          ).
          <article-title>Query expansion using local and global document analysis</article-title>
          .
          <source>Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '96</source>
          ,
          <fpage>4</fpage>
          -
          <lpage>11</lpage>
          . New York, USA: ACM Press.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>