<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Toward a Cloud-Based Integration of IR Tools</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Allan Hanbury</string-name>
          <email>hanbury@ifs.tuwien.ac.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mihai Lupu</string-name>
          <email>lupu@ifs.tuwien.ac.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Software Technology and Interactive Systems Vienna University of Technology Favoritenstraße 9-11/188 A-1040 Vienna</institution>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <fpage>13</fpage>
      <lpage>16</lpage>
      <abstract>
        <p>This position paper presents the case for creating a cloud-based infrastructure for IR. The infrastructure should provide access to multiple components of IR systems and a method to easily integrate them in various configurations, as well as data and parameters used in IR experiments. Workflow tools are promising for flexible integration of components and sharing of configurations, and have already been adopted in multiple areas of computational science. This infrastructure should lead to better reproducibility of IR experiments, easier take-up of research results by industry and more effective IR evaluation.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>cloud. The use of standardised datasets made available on the cloud would enhance the ease of reproducibility
of experiments.</p>
      <p>Workflows have already gained acceptance in other branches of science as a method for distributing the setup
of a computational process, and should also be considered for adoption in IR. The use of workflows is discussed
briefly in Section 2. The availability of these multiple IR services would also revolutionise IR evaluation, as
discussed in Section 3. Finally, an advantage of making IR components more accessible is the possibility for
them to be easily taken up and experimented with by companies potentially wishing to use them in search
systems. Section 4 presents considerations on the cloud infrastrutures to use for all aspects of the proposed
approach.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Workflows</title>
      <p>Scientific workflows have recently emerged as a paradigm for representing and managing complex distributed
scientific computations. They orchestrate the dataflow across the individual data transformation and analysis
steps, as well as the mechanisms to execute them in a distributed environment [GDE+07, GdR09]. Workflows
have the potential to function as a unifying framework for the integration of IR tools.</p>
      <p>Initial steps to using workflows in IR and annotation were taken by Corubolo et al. [CWH07], who developed
a framework using the Kepler workflow tool combined with the Chesire3 search engine. This work is however
no longer accessible, and was done at an early stage in the development of scientific workflow tools, when more
primitive tools were available.</p>
      <p>In order to advance the use of workflows in IR, it will be necessary to create components for the workflow
implementing commonly used approaches in the IR pipeline (preferably based on open source software, although
executable components without source code are also conceivable). The creation of workflows for IR applications
has the following advantages:
• Connection with existing IR tools
• Sharing of workflows leading to better reproducibility of experiments
• Facilitation of component-based evaluation
• Rapid prototype development</p>
      <p>A number of open source workflow tools are available [DGST09]. A promising candidate is Taverna1, as it
is the most widely used tool on the myExperiment portal for sharing scientific workflows2. Taverna also has
the advantage that it has been integrated with the U-Compare UIMA-based text mining and natural language
processing system3 [KDN+10].</p>
      <p>With workflow techniques, it will be possible to set up IR experiments consisting of multiple standard
components rapidly. When investigating domain-specific search problems, the flexibility will be available to adapt
the workflow to the domain based on domain-specific data and knowledge. The rapid prototyping capabilities
provided by the proposed infrastructure will also be useful in rapid design and testing of user interfaces, in
particular those interfaces designed specificially for users performing highly-specialised search tasks.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Evaluation</title>
      <p>While evaluation is well-established in IR research, it suffers from a number of drawbacks in the way it is
currently implemented [Rob08]. The drawbacks include only evaluating full systems [HM10], poor reproducibility
of experiments and poor choice of baselines for comparison of the results of experiments [AMWZ09]. The
availability of multiple executable components, potentially linked into a workflow system, along with data and
experimental parameters stored in the cloud, would revolutionise IR evaluation [ABB+12]. At a basic level, it
would simplify research on the effect of constituent components of an IR system on the performance of the full
system [KE11]. Furthermore, it would be possible to relatively easily recreate an IR system published in a paper,
and then begin further research from this baseline. It is even conceivable that an automated evaluation using
multiple configurations of components could run continuously, providing a flow of experimental results on the
1http://taverna.org.uk
2http://www.myexperiment.org
3http://u-compare.org
performance of many component configurations analogous to how large experiments in physics (e.g. the Large
Hadron Collider) continuously produce data.</p>
      <p>However, evaluation will also have to be done from another point of view. For a researcher or developer
implementing an IR system for a specific set of data in a specific domain, it would be useful to be able to
evaluate which of the many components of a certain type available are optimal for that specific task. For
example, given a dataset to index, a description of the expected queries and results, which of the available
stemmers would provide the best performance? Being able to answer this type of question requires evaluation
guidelines going beyond the approaches currently standard in evaluation campaigns.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Cloud-based Experimentation</title>
      <p>The cloud has innovated a number of aspects of computing, as it provides the appearance of infinite computing
resources available on demand, eliminates up-front commitment by cloud users and provides the ability to pay
for the use of computing resources on a short-term basis as needed [AFG+10]. The abilities necessary for the
approach described in this paper are:
• Provide the ability to centrally store and make available large datasets — Cloud providers already provide
this service. For example, Amazon hosts public datasets free of charge4.
• Allow multiple users to process the stored data without requiring the data to be transfered elsewhere — this
is done through linking virtual storage drives to computing instances as required. For example, Amazon
public datasets are accessed in this way.
• Allow users to share executable components implemented in computing instances with other users — An
approach for doing this will have to be developed.</p>
      <p>There are a number of challenges in developing a suitable approach for the final point above. One can imagine
that snapshots of computing instances containing executable components are made available in a type of “app
store” for re-use. For open source software, the snapshots should also allow access to the code. In order to
encourage the use of the components in research, they should be made available free-of-charge for this purpose
(i.e., researchers would have to pay for computing time, but no license fees). However, if a company decides to
adopt a component in a system used commercially, then researchers should receive compensation of some kind.</p>
      <p>A further important consideration is who should provide this service. Commercial cloud providers are
already able to provide it, but choosing a single commercial provider could result in a “lock-in” of IR research
to a single provider, due to incompatibilities between services provided by different companies. Potentially, a
publicly-funded cloud infrastructure would be good for running IR research and evaluation experiments.
However, this would make the take-up by industry more complex as commercial services could not be run on this
infrastructure. A possible solution is an interface from the publicly-funded infrastructure to multiple commercial
cloud infrastructures allowing researchers to transfer components to these infrastructures for take-up by industry.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>There is currently a push to make the results of computational science more reproducible. If IR wishes to remain
at the forefront of this development, then it is necessary to implement an experimental infrastructure for IR.
Making executable components for IR systems as well as data and experimental parameters available on a
cloudbased infrastructure, along with tools such as a workflow infrastructure, is a good step toward fully reproducible
IR experiments. A first step toward carrying out evaluation on the cloud is being taken in the recently started
VISCERAL project [HML+12]. In VISCERAL, the focus is on taking advantage of the cloud to allow evaluation
to be done on multiple Terabytes of data, avoiding the necessity of first downloading the data (i.e. bringing the
algorithms to the data instead of the data to the algorithms). However, the creation of the experimental IR
infrastructure discussed in this paper has the ability to revolutionise IR experimentation beyond what is being
considered in the VISCERAL project.</p>
      <p>Acknowledgements
The research leading to these results has received funding from the European Union Seventh Framework
Programme (FP7/2007-2013) under grant agreements 318068 (VISCERAL) and 258191 (PROMISE).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [ABB+12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Agosti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Berendsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Bogers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Braschler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Buitelaar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. M.</given-names>
            <surname>Di Nunzio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Forner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Friberg Heppin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hansen</surname>
          </string-name>
          , A. J¨arvelin,
          <string-name>
            <given-names>B.</given-names>
            <surname>Larsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lupu</surname>
          </string-name>
          , I. Masiero, H. Mu¨ller,
          <string-name>
            <given-names>S.</given-names>
            <surname>Peruzzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Petras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Piroi</surname>
          </string-name>
          , M. de Rijke, G. Santucci, G. Silvello, and
          <string-name>
            <given-names>E.</given-names>
            <surname>Toms</surname>
          </string-name>
          .
          <source>PROMISE Retreat Report: Prospects and Opportunities for Information Access Evaluation. PROMISE Network of Excellence</source>
          ,
          <year>2012</year>
          . http://www.promise-noe.
          <source>eu/promise-retreat-report-2012/.</source>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [ACMS12]
          <string-name>
            <given-names>James</given-names>
            <surname>Allan</surname>
          </string-name>
          , Bruce Croft, Alistair Moffat, and
          <string-name>
            <given-names>Mark</given-names>
            <surname>Sanderson</surname>
          </string-name>
          .
          <article-title>Frontiers, challenges, and opportunities for information retrieval: Report from SWIRL 2012</article-title>
          .
          <source>ACM SIGIR Forum</source>
          ,
          <volume>46</volume>
          (
          <issue>1</issue>
          ):
          <fpage>2</fpage>
          -
          <lpage>32</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [AFG+10]
          <string-name>
            <surname>Michael</surname>
            <given-names>Armbrust</given-names>
          </string-name>
          , Armando Fox, Rean Griffith,
          <string-name>
            <given-names>Anthony D.</given-names>
            <surname>Joseph</surname>
          </string-name>
          , Randy Katz, Andy Konwinski,
          <string-name>
            <given-names>Gunho</given-names>
            <surname>Lee</surname>
          </string-name>
          , David Patterson,
          <string-name>
            <given-names>Ariel</given-names>
            <surname>Rabkin</surname>
          </string-name>
          , Ion Stoica, and
          <string-name>
            <given-names>Matei</given-names>
            <surname>Zaharia</surname>
          </string-name>
          .
          <article-title>A view of cloud computing</article-title>
          .
          <source>Commun. ACM</source>
          ,
          <volume>53</volume>
          (
          <issue>4</issue>
          ):
          <fpage>50</fpage>
          -
          <lpage>58</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>[AMWZ09] Timothy</surname>
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Armstrong</surname>
            , Alistair Moffat,
            <given-names>William</given-names>
          </string-name>
          <string-name>
            <surname>Webber</surname>
            , and
            <given-names>Justin</given-names>
          </string-name>
          <string-name>
            <surname>Zobel</surname>
          </string-name>
          .
          <article-title>Improvements that don't add up: ad-hoc retrieval results since 1998</article-title>
          .
          <source>In CIKM '09: Proceeding of the 18th ACM conference on Information and knowledge management</source>
          , pages
          <fpage>601</fpage>
          -
          <lpage>610</lpage>
          . ACM,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [CWH07] [FS12] [GdR09] [HM10]
          <string-name>
            <given-names>Fabio</given-names>
            <surname>Corubolo</surname>
          </string-name>
          , Paul Watry,
          <string-name>
            <surname>and John Harrison. Location</surname>
          </string-name>
          <article-title>and format independent distributed annotations for collaborative research</article-title>
          .
          <source>In Research and Advanced Technology for Digital Libraries</source>
          , volume
          <volume>4675</volume>
          , pages
          <fpage>495</fpage>
          -
          <lpage>498</lpage>
          . Springer Berlin / Heidelberg,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [DGST09]
          <string-name>
            <given-names>Ewa</given-names>
            <surname>Deelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Dennis</given-names>
            <surname>Gannon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Matthew</given-names>
            <surname>Shields</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Ian</given-names>
            <surname>Taylor</surname>
          </string-name>
          .
          <article-title>Workflows and e-science: An overview of workflow system features and capabilities</article-title>
          .
          <source>Future Generation Computer Systems</source>
          ,
          <volume>25</volume>
          (
          <issue>5</issue>
          ):
          <fpage>528</fpage>
          -
          <lpage>540</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Juliana</given-names>
            <surname>Freire and Claudio</surname>
          </string-name>
          <string-name>
            <given-names>T.</given-names>
            <surname>Silva</surname>
          </string-name>
          .
          <article-title>Making computations and publications reproducible with VisTrails</article-title>
          .
          <source>Computing in Science &amp; Engineering</source>
          ,
          <volume>14</volume>
          (
          <issue>4</issue>
          ):
          <fpage>18</fpage>
          -
          <lpage>25</lpage>
          ,
          <year>August 2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [GDE+07]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gil</surname>
          </string-name>
          , E. Deelman,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ellisman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Fahringer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Fox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Gannon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Goble</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Livny</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Moreau</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Myers</surname>
          </string-name>
          .
          <article-title>Examining the challenges of scientific workflows</article-title>
          .
          <source>Computer</source>
          ,
          <volume>40</volume>
          (
          <issue>12</issue>
          ):
          <fpage>24</fpage>
          -
          <lpage>32</lpage>
          , dec.
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>C.</given-names>
            <surname>Goble</surname>
          </string-name>
          and D. de Roure.
          <article-title>The impact of workflow tools on data-centric research</article-title>
          . In Tony Hey,
          <string-name>
            <given-names>Stewart</given-names>
            <surname>Tansley</surname>
          </string-name>
          , and Kristin Tolle, editors,
          <source>The fourth paradigm: data-intensive scientific discovery</source>
          , pages
          <fpage>137</fpage>
          -
          <lpage>145</lpage>
          . Microsoft Research,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <source>In Multilingual and Multimodal Information Access Evaluation</source>
          , volume
          <volume>6360</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>124</fpage>
          -
          <lpage>135</lpage>
          . Springer,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [HML+12]
          <string-name>
            <surname>Allan</surname>
            <given-names>Hanbury</given-names>
          </string-name>
          , Henning Mu¨ller, Georg Langs, Marc Weber,
          <string-name>
            <given-names>Bjoern</given-names>
            <surname>Menze</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Fernandez</surname>
          </string-name>
          .
          <article-title>Bringing the algorithms to the data: Cloud-based benchmarking for medical image analysis</article-title>
          .
          <source>In Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics</source>
          , volume
          <volume>7488</volume>
          <source>of LNCS</source>
          , pages
          <fpage>24</fpage>
          -
          <lpage>29</lpage>
          . Springer,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>[IHGC12] Darrel</surname>
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Ince</surname>
          </string-name>
          , Leslie Hatton, and John Graham-Cumming.
          <article-title>The case for open computer programs</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Nature</surname>
          </string-name>
          ,
          <volume>482</volume>
          (
          <issue>7386</issue>
          ):
          <fpage>485</fpage>
          -
          <lpage>488</lpage>
          ,
          <year>February 2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [KDN+10]
          <string-name>
            <surname>Yoshinobu</surname>
            <given-names>Kano</given-names>
          </string-name>
          , Paul Dobson, Mio Nakanishi,
          <string-name>
            <surname>Jun'ichi Tsujii</surname>
            , and
            <given-names>Sophia</given-names>
          </string-name>
          <string-name>
            <surname>Ananiadou</surname>
          </string-name>
          .
          <article-title>Text mining meets workflow: linking u-compare with taverna</article-title>
          .
          <source>Bioinformatics</source>
          ,
          <volume>26</volume>
          (
          <issue>19</issue>
          ):
          <fpage>2486</fpage>
          -
          <lpage>2487</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [KE11]
          <article-title>[Rob08] [Sto09] Jens Ku¨rsten and Maximilian Eibl. A large-scale system evaluation on component-level</article-title>
          .
          <source>In Advances in Information Retrieval</source>
          , volume
          <volume>6611</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>679</fpage>
          -
          <lpage>682</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          Springer Berlin / Heidelberg,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Stephen</given-names>
            <surname>Robertson</surname>
          </string-name>
          .
          <article-title>On the history of evaluation in IR</article-title>
          .
          <source>Journal of Information Science</source>
          ,
          <volume>34</volume>
          (
          <issue>4</issue>
          ):
          <fpage>439</fpage>
          -
          <lpage>456</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <source>Computing in Science &amp; Engineering</source>
          ,
          <volume>11</volume>
          (
          <issue>1</issue>
          ):
          <fpage>35</fpage>
          -
          <lpage>40</lpage>
          ,
          <year>February 2009</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>