<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ub-botswana participation to CLEF eHealth IR challenge 2017: Task 3 (IRTask1 : ad-hoc search)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Edwin Thuma</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nkwebi Motlogelwa</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tebo Leburu-Dingalo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Botswana</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper, we describe the methods deployed in the different runs submitted for our participation to the CLEF eHealth 2017 Task 3: Patient-Centered Information Retrieval, IRTask 1: ad-hoc search. Speci cally, we deploy DPH term weighting model with explicit relevance feedback, where the expansion terms are selected from documents which were previously identi ed as relevant by assessors for each query. As improvement we deployed proximity search using both Full Dependence (FD) and Sequential Dependence (SD) variants of the Markov Random Fields and the Divergence From Randomness (DFR) based dependence models to re-rank documents, which have query terms in close proximity. In another approach, we deploy pseudo relevance feedback, where the expansion terms are selected from the top 3 ranked documents after a rst pass retrieval. In addition, we deploy proximity search using the SD variant of the DFR based dependence model.</p>
      </abstract>
      <kwd-group>
        <kwd>Explicit relevance Feedback</kwd>
        <kwd>Proximity Search</kwd>
        <kwd>Pseudo Relevance Feedback</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In this paper, we describe the methods used for our participation to the CLEF
eHealth 2017 Task 3: Patient-Centered Information Retrieval, IRTask 1:
adhoc search. Detailed task description is available in the overview paper of Task
3 [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. This task is a continuation of the previous CLEF eHealth Information
Retrieval (IR) task that ran in 2013 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], 2014 [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], 2015 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and 2016 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The CLEF
eHealth task aims to evaluate the e ectiveness of information retrieval systems
when searching for health related content on the web, with the objective to
foster research and development of search engines tailored to health information
seeking [
        <xref ref-type="bibr" rid="ref5 ref6">6, 5</xref>
        ]. The CLEF eHealth Information Retrieval task was motivated by
the problem of users of information retrieval systems formulating circumlocutory
queries, using colloquial language instead of medical terms as studied by Zuccon
et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and Stanton et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. In their studies, they found that modern search
engines are ill-equipped to handle such queries; only 3 out of the to 10 results
were highly useful for self diagnosis. In this paper, we attempt to tackle this
problem by using explicit relevance feedback in order to improve the retrieval
e ectiveness. In addition, we deploy proximity search to further improve the
retrieval e ectiveness of our system. Moreover, we investigate whether pseudo
relevance feedback, where the expansion terms are selected from the top 3 ranked
documents after a rst pass retrieval can improve the retrieval e ectiveness. This
paper is structured as follows. Section 2 contains a background on algorithms
used. Section 3 describes the experimental environment. In Section 4, we describe
the experimental the 5 runs submitted by team ub-botswana. Section 5 presents
and discusses results on training data.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>In this section, we begin by presenting a brief but essential background on the
di erent algorithms used in our experimental investigation and evaluation. We
start describing the DPH term weighting model in Section 2.1. We then describe
the Bose-Einstein 1 (Bo1) model for query expansion in Section 2.2.
2.1</p>
      <sec id="sec-2-1">
        <title>DPH Term Weighting Model</title>
        <p>
          For all our experimental investigation and evaluation we used the
parameterfree DPH term weighting model from the Divergence from Randomness (DFR)
framework [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. The DPH term weighting model calculates the score of a
document d for a given query Q as follows:
scoreDPH(d; Q) = Pt2Q qtf norm tf log((tf avlg l ) ( tNfc )) + 0:5 log(2
tf (1 tMLE))
(1)
where qtf , tf and tf c are the frequencies of the term t in the query Q , in the
document d and in the collection C respectively. N is number of documents in
the collection C, avg l is the average length of documents in the collection C
and l is the length of the document d. tMLE = tlf and norm = (1 tMLE)2 .
tf+1
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Bose-Einstein 1 (Bo1) Model for Query Expansion</title>
        <p>
          In our experimental investiagtion and evaluation, we used the Terrier-4.0
Divergence from Randomness (DFR) Bose-Einstein 1 (Bo1) model to select the
most informative terms from the topmost documents after a rst pass document
ranking. The DFR Bo1 model calculates the information content of a term t in
the top-ranked documents as follows [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]:
w(t) = tf x log2
        </p>
        <p>+ log2(1 + Pn(t))
1 + Pn(t)</p>
        <p>Pn(t)
Pn(t) =
tf c
N
(2)
(3)
where Pn(t) is the probability of t in the whole collection, tf x is the frequency
of the query term in the top x ranked documents, tf c is the frequency of the
term t in the collection, and N is the number of documents in the collection.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experimental Setting</title>
      <p>FAQ Retrieval Platform: For all our experimental evaluation, we used
Terrier4.2, an open source Information Retrieval (IR) platform. All the documents
(ClueWeb 12 B13) used in this study were rst pre-processed before indexing
and this involved tokenising the text and stemming each token using the full
Porter stemming algorithm. Stopword removal was enabled and we used Terrier
stopword list. The index was created using blocks to save positional information
with each term. For pseudo relevance feedback, we used Terrier-4.2 DFR
BoseEinstein 1 (Bo1) model for query expansion to select the 10 most informative
terms from the top 3 ranked documents.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Description of the Di erent Runs</title>
      <p>Term Weighting Model: For all our runs, we used the parameter-free DPH
Divergence From Randomness term weighting model in Terrier-4.2 IR platform to
score and rank the documents in the ClueWeb 12 B13 document collection.
ub-botswana IRTask1 run1: We ranked the documents using DPH DFR term
weighting. As improvement, we deployed explicit relevance feedback, where we
selected expansion terms from the top 3 documents that were explicitly marked
relevant by assessors for each query. We used the Terrier-4.2 DFR Bose-Einstein
1 (Bo1) model for query expansion to select the 10 most informative terms from
these documents. In addition, we deployed the Full Dependence (FD) variant
of the Markov Random Fields for terms dependence. Full Dependence assumes
all query terms are in some way dependent on each other. In this work, we
experimentally selected a window size of 15, which yielded the highest retrieval
performance on the training data.
ub-botswana IRTask1 run2: We performed a rst pass retrieval using DPH DFR
term weighting model. As improvement, we deployed explicit relevance feedback,
where we deployed DFR Bo1 model for query expansion to select the expansion
terms.
ub-botswana IRTask1 run3: We produced an initial ranking using DPH DFR
term weighting. As improvement, we deployed explicit relevance feedback and
used the DFR Bo1 model for query expansion to select the expansion terms.
In addition, we deployed the Sequential Dependence (SD) variant of the
Divergence from Randomness based dependence model. Sequential Dependence only
assumes a dependence between neighbouring query terms. In this work, we
experimentally selected a window size of 15, which yielded the highest retrieval
performance on the training data.
ub-botswana IRTask1 run4: We used the parameter-free DPH DFR term
weighting model to produce and initial ranking. As improvement, we deployed a simple
pseudo-relevance feedback on the local collection. We used the Bo1 model for
query expansion to select the expansion terms. We then performed a second pass
retrieval on the local collection with the new expanded query.
ub-botswana IRTask1 run5: We used ub-botswana IRTask1 run4 as the baseline
system. As improvement, we deployed the Sequential Dependence (SD) variant
of the Divergence from Randomness based term dependence model. Sequential
Dependence only assumes a dependence between neighbouring query terms. In
this work, we experimentally selected a window size of 15, which yielded the
highest retrieval performance on the training data.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Results and Discussion</title>
      <p>These working notes were compiled and submitted before the relevance
judgments were released. Below we present the results of our runs using the 2016
query relevance judgments. Please note that the o cial results to be released
will be di erent because new query relevance judgments will be released
Table 1 presents our results on the training data. From this table, we see
a degradation in performance when we incorporate term dependence only in
our ranking (ub-botswana IRTask1 run5 ). However, when we deploy pseudo
relevance feedback (ub-botswana IRTask1 run4 ), we see an improvement in the
retrieval performance in terms of precision at 5 (P@5), precision at 10 (P@10)
and recall (rel ret). Moreover signi cant improvement in the recall is obtained
when explicit relevance feedback is deployed ((ub-botswana IRTask1 run1 ),
(ubbotswana IRTask1 run2 ) and (ub-botswana IRTask1 run3 )). In addition, we
obtain mixed results when we incorporate proximity search after deploying explicit
relevance feedback. For example, there was an improvement in the retrieval
performance in terms of P@5 and P@10 when we deploy the FD variant of the
Markov Random Fields for term dependence using a window size of 15
(ubbotswana IRTask1 run1 ). In contrast, we obtain a degradation in the retrieval
performance in terms of P@5 and P@10 when we deploy the SD variant of the
Divergence from Randomness based term dependence model using a window size
of 15 (ub-botswana IRTask1 run5 ).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>G.</given-names>
            <surname>Amati</surname>
          </string-name>
          .
          <article-title>Probabilistic Models for Information Retrieval based on Divergence from Randomness</article-title>
          . University of Glasgow,UK,
          <source>PhD Thesis</source>
          , pages
          <volume>1</volume>
          {
          <fpage>198</fpage>
          ,
          <year>June 2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>G.</given-names>
            <surname>Amati</surname>
          </string-name>
          , E. Ambrosi,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bianchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Gaibisso</surname>
          </string-name>
          , and
          <string-name>
            <surname>G. Gambosi. FUB</surname>
          </string-name>
          , IASICNR and University of Tor Vergata at
          <article-title>TREC 2007 Blog Track</article-title>
          .
          <source>In Proceedings of the 16th Text REtrieval Conference (TREC-2007)</source>
          , pages
          <fpage>1</fpage>
          {
          <fpage>10</fpage>
          ,
          <string-name>
            <surname>Gaithersburg</surname>
          </string-name>
          , Md., USA.,
          <year>2007</year>
          . Text REtrieval Conference (TREC).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>L.</given-names>
            <surname>Goeuriot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.J.F</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kelly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Leveling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , H. Muller, S. Salantera,
          <string-name>
            <given-names>H.</given-names>
            <surname>Suominen</surname>
          </string-name>
          , and G. Zuccon.
          <source>ShARe/CLEF eHealth Evaluation Lab</source>
          <year>2013</year>
          ,
          <article-title>Task 3: Information Retrieval to Address Patients' Questions when Reading Clinical Reports</article-title>
          .
          <source>In CLEF 2013 Online Working Notes</source>
          , volume
          <volume>8138</volume>
          .
          <string-name>
            <surname>CEUR-WS</surname>
          </string-name>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>L.</given-names>
            <surname>Goeuriot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kelly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Palotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pecina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.J.F</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>H.</given-names>
            <surname>Mueller</surname>
          </string-name>
          . Share/clef ehealth Evaluation
          <source>Lab</source>
          <year>2014</year>
          ,
          <article-title>Task 3: UserCentred Health Information Retrieval</article-title>
          .
          <source>In CLEF 2014 Online Working Notes. CEUR-WS</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Liadh</given-names>
            <surname>Kelly</surname>
          </string-name>
          , Lorraine Goeuriot, Hanna Suominen, Aurelie Neveol,
          <source>Joa~o Palotti, and Guido Zuccon. Overview of the CLEF eHealth Evaluation Lab</source>
          <year>2016</year>
          , pages
          <fpage>255</fpage>
          {
          <fpage>266</fpage>
          . Springer International Publishing, Cham,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>J.</given-names>
            <surname>Palotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Goeuriot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kelly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.J.F.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lupu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pecina</surname>
          </string-name>
          .
          <article-title>CLEF eHealth Evaluation Lab 2015 task 2: Retrieving Information about Medical Symptoms</article-title>
          .
          <source>In CLEF 2015 Online Working Notes. CEUR-WS</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>J.</given-names>
            <surname>Palotti</surname>
          </string-name>
          , G. Zuccon, Jimmy,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pecina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lupu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Goeuriot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kelly</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          .
          <article-title>CLEF 2017 Task Overview: The IR Task at the eHealth Evaluation Lab</article-title>
          . In In Working Notes of Conference and
          <article-title>Labs of the Evaluation (CLEF) Forum</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>I.</given-names>
            <surname>Stanton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ieong</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Mishra</surname>
          </string-name>
          .
          <article-title>Circumlocution in Diagnostic Medical Queries</article-title>
          .
          <source>In Proceedings of the 37th international ACM SIGIR conference on Research &amp; development in information retrieval</source>
          , pages
          <volume>133</volume>
          {
          <fpage>142</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>G.</given-names>
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Koopman</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Palotti</surname>
          </string-name>
          .
          <article-title>Diagnose This If You Can: On the Effectiveness of Search Engines in Finding Medical Self-Diagnosis Information</article-title>
          .
          <source>In Advances in Information Retrieval (ECIR</source>
          <year>2015</year>
          ), pages
          <fpage>562</fpage>
          {
          <fpage>567</fpage>
          . Springer,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>