<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The KIMERA Infrastructure: Shifting from Evaluation-as-a-Service to Evaluation-in-the-Cloud</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andrea Pasin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicola Ferro</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Padua</institution>
          ,
          <addr-line>Padua</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Experimental evaluation plays a key role in Information Retrieval (IR), and Evaluation-as-a-Service (EaaS) was proposed as a viable approach for eficiently running experiments without distributing experimental collections. We now introduce Kubernetes Infrastructure for Managed Evaluation and Resource Access (KIMERA), a cloud-based platform implemented with Kubernetes that advances EaaS toward Evaluation-in-the-Cloud (EitC), enabling researchers to develop and run IR systems directly through a web interface. KIMERA ensures scalability, reproducibility, and fairness across experiments, and it can integrate easy access to external services such as Large Language Models and Quantum Computing via APIs. It supports detailed resource tracking for a comprehensive evaluation of efectiveness and eficiency.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Infrastructure</kwd>
        <kwd>Kubernetes</kwd>
        <kwd>Docker</kwd>
        <kwd>Evaluation</kwd>
        <kwd>Reproducibility</kwd>
        <kwd>Quantum Computing</kwd>
        <kwd>Large Language Models</kwd>
        <kwd>Information Retrieval</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>These limitations became evident during the QuantumCLEF lab [20, 22, 19], which required
using real quantum computers [17, 10], available via APIs, for solving IR and Recommender
Systems (RS) tasks. Participants could not use our quantum resources because API keys cannot
be disclosed. Additionally, the proposed tasks involved the evaluation of both efectiveness and
eficiency, thus requiring accurate resource usage monitoring, which is a feature that is usually
not available in traditional EaaS platforms.</p>
      <p>To overcome these challenges, we introduce Kubernetes Infrastructure for Managed
Evaluation and Resource Access (KIMERA), an open-source infrastructure built with Docker and
Kubernetes to support reproducible, secure, and fair evaluation. KIMERA allows participants to
code and run systems directly from the browser, without local installations or containerization
expertise. Organizers can control resource allocation and track all submissions for enhanced
comparative analysis. While tailored for Quantum Computing (QC), KIMERA can be employed
for other scenarios as well, including LLM-based pipelines, thus shifting from EaaS to a more
lfexible Evaluation-in-the-Cloud (EitC) model.</p>
      <p>This work is organized as follows. Section 2 details our approach. Section 3 describes the
experimental setup and the results. Section 4 presents conclusions and potential future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Methodology</title>
      <p>The KIMERA infrastructure is designed to provide a scalable, reproducible, and accessible IR
evaluation environment using modern containerization and orchestration technologies. KIMERA
is implemented with Docker and Kubernetes. It grants participants access to computational
resources (e.g., real quantum computers) without requiring them to manage API agreements
or infrastructure setup. Users interact with browser-based workspaces that resemble the
Visual Studio Code Integrated Development Environment (IDE), preconfigured for Python
(with support for other languages via extensions), and can code and run experiments even
from smartphones or tablets. Quotas and executions are monitored through dashboards and
logged for reproducibility and analysis. The system supports both efectiveness and eficiency
evaluation by collecting detailed runtime metrics.</p>
      <sec id="sec-2-1">
        <title>2.1. Infrastructure Components</title>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Key Characteristics</title>
        <p>Scalability. KIMERA supports deployment on single or multi-node clusters. Kubernetes
allows horizontal scaling (e.g., replicating Dispatcher/WebApp under high load). To ensure fair
eficiency evaluation, all workspace nodes should have identical hardware.</p>
        <p>Error-handling. Kubernetes monitors all components, making sure to auto-restart pods when
crashing (e.g., after Out-Of-Memory errors), and redistribute workloads upon node failure,
ensuring high availability.</p>
        <p>Resource monitoring. Resource limits are enforced per component. This guarantees fairness,
comparability, and reproducibility of runtime measurements.</p>
        <p>Security. The communication with and within KIMERA is secured through the HTTPS protocol;
only necessary APIs are exposed via the nginx component. Workspaces are protected with
passwords, and confidential configuration data (e.g., API keys) is isolated via Kubernetes Secrets.
Accessibility. Unlike traditional EaaS platforms requiring local development, KIMERA allows
browser-based coding and execution of experiments, lowering the barrier for participants
lacking advanced hardware or containerization skills.</p>
        <p>Shared
components
dns-service</p>
        <p>dnsdeployment</p>
        <p>DNS
ConfigMap
metadata
Secret
password
+ tokens
database-pvc
databasestorage
database
files</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results</title>
      <p>We demonstrate the applicability of KIMERA through two real-world use cases: the
QuantumCLEF 2024 and 2025 shared task and tutorials at ECIR 2024 and SIGIR 2024. Additionally, we
highlight the infrastructure’s potential for broader IR evaluation tasks beyond QuantumCLEF.</p>
      <sec id="sec-3-1">
        <title>3.1. QuantumCLEF</title>
        <p>KIMERA supported the first and second edition of the QuantumCLEF shared task [ 21, 23, 24],
which involved a total of 12 teams developing quantum and traditional solutions for IR and
RS tasks. In total, 8059 submissions were processed, with quantum executions taking about 5
minutes and traditional ones over 16 hours. Submission activity peaked near the deadline of
each edition, marking these as high-load periods.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Quantum Computing Tutorials</title>
        <p>
          KIMERA was also used at ECIR 2024 [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] and SIGIR 2024 [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] to support hands-on quantum
computing tutorials. Since access to quantum hardware typically requires individual API keys
and contracts, KIMERA provided a practical workaround by ofering pre-configured,
browserbased workspaces. Participants, many of whom were engaging with quantum resources for
the first time, gained hands-on experience in a shared, accessible environment, demonstrating
KIMERA’s value for education and training.
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Broad Applicability</title>
        <p>KIMERA can be extended for broader use. Individual researchers could leverage it for
reproducible experiments in pre-configured environments. To support this, we plan to introduce
a component for automatic evaluation, although this will not be available in shared tasks to
prevent overfitting on test sets.</p>
        <p>
          Additionally, as shared tasks increasingly involve LLMs [
          <xref ref-type="bibr" rid="ref7">14, 7</xref>
          ], KIMERA can be adapted
to manage quota-based API access to such models with minimal architectural changes. Its
user-friendly design and minimal hardware requirements make it a versatile platform that can
broaden participation and improve the quality and diversity of shared task submissions.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusions</title>
      <p>This paper presents KIMERA, an infrastructure designed to transition from EaaS to EitC in
IR, providing monitored, quota-based access to API-based computational resources such as
quantum computers and LLMs. Used in the QuantumCLEF shared task, KIMERA enhanced
reproducibility and comparability while lowering the entry barrier. Although originally tailored
for QuantumCLEF, KIMERA can serve as a general-purpose evaluation platform for shared tasks
or individual research. Its support for quota monitoring and user-friendly, browser-accessible
workspaces makes it well-suited for experimental research where resource access is provided
via API keys. In the future, we plan to enhance KIMERA with features for easier shared task
management and automated evaluation.</p>
    </sec>
    <sec id="sec-5">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author did not use any AI tool.
Retrieval, ECIR 2023, Dublin, Ireland, April 2-6, 2023, Proceedings, Part III, volume 13982 of
Lecture Notes in Computer Science, pages 236–241. Springer, 2023.
[10] Muhammed Golec, Emir Sahin Hatay, Mustafa Golec, Murat Uyar, Merve Golec, and
Sukhpal Singh Gill. Quantum cloud computing: Trends and challenges. Journal of Economy
and Technology, 2024.
[11] D. K. Harman. Information Retrieval Evaluation. Morgan &amp; Claypool Publishers, USA,
2011.
[12] D. K. Harman and E. M. Voorhees, editors. TREC. Experiment and Evaluation in Information</p>
      <p>Retrieval. MIT Press, Cambridge (MA), USA, 2005.
[13] Frank Hopfgartner, Allan Hanbury, Henning Müller, Ivan Eggel, Krisztian Balog,
Torben Brodt, Gordon V. Cormack, Jimmy Lin, Jayashree Kalpathy-Cramer, Noriko Kando,
Makoto P. Kato, Anastasia Krithara, Tim Gollub, Martin Potthast, Evelyne Viegas, and
Simon Mercer. Evaluation-as-a-service for the computational sciences: Overview and
outlook. ACM J. Data Inf. Qual., 10(4):15:1–15:32, 2018.
[14] Jussi Karlgren, Luise Dürlich, Evangelia Gogoulou, Liane Guillou, Joakim Nivre, Magnus
Sahlgren, Aarne Talman, and Shorouq Zahra. Overview of ELOQUENT 2024 - shared
tasks for evaluating generative language model quality. In Lorraine Goeuriot, Philippe
Mulhem, Georges Quénot, Didier Schwab, Giorgio Maria Di Nunzio, Laure Soulier, Petra
Galuscáková, Alba García Seco de Herrera, Guglielmo Faggioli, and Nicola Ferro, editors,
Experimental IR Meets Multilinguality, Multimodality, and Interaction - 15th International
Conference of the CLEF Association, CLEF 2024, Grenoble, France, September 9-12, 2024,
Proceedings, Part II, volume 14959 of Lecture Notes in Computer Science, pages 53–72.</p>
      <p>Springer, 2024.
[15] Jimmy Lin and Miles Efron. Evaluation as a service for information retrieval. SIGIR Forum,
47(2):8–14, 2013.
[16] Sean MacAvaney, Andrew Yates, Sergey Feldman, Doug Downey, Arman Cohan, and Nazli
Goharian. Simplified data wrangling with ir_datasets. In Fernando Diaz, Chirag Shah,
Torsten Suel, Pablo Castells, Rosie Jones, and Tetsuya Sakai, editors, SIGIR ’21: The 44th
International ACM SIGIR Conference on Research and Development in Information Retrieval,
Virtual Event, Canada, July 11-15, 2021, pages 2429–2436. ACM, 2021.
[17] Nicolas Maring, Andreas Fyrillas, Mathias Pont, Edouard Ivanov, Eric Bertasi, Mario
Valdivia, and Jean Senellart. One nine availability of a photonic quantum computer on
the cloud toward HPC integration. In Brian La Cour, Lia Yeh, and Marek Osinski, editors,
IEEE International Conference on Quantum Computing and Engineering, QCE 2023, Bellevue,
WA, USA, September 17-22, 2023, pages 112–116. IEEE, 2023.
[18] Henning Müller and Allan Hanbury. Eaas: Evaluation-as-a-service and experiences from
the VISCERAL project. In Nicola Ferro and Carol Peters, editors, Information Retrieval
Evaluation in a Changing World - Lessons Learned from 20 Years of CLEF, volume 41 of The
Information Retrieval Series, pages 161–173. Springer, 2019.
[19] Andrea Pasin, Maurizio Ferrari Dacrema, Paolo Cremonesi, Washington Cunha,
Marcos André Gonçalves, and Nicola Ferro. Quantumclef 2025 - the second edition of the
quantum computing lab at clef. In Advances in Information Retrieval - 47th European
Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6-10, 2025, Proceedings,
Part V, volume 15576 of Lecture Notes in Computer Science, 2025.
[20] Andrea Pasin, Maurizio Ferrari Dacrema, Paolo Cremonesi, and Nicola Ferro. qclef: A
proposal to evaluate quantum annealing for information retrieval and recommender systems.
In Avi Arampatzis, Evangelos Kanoulas, Theodora Tsikrika, Stefanos Vrochidis, Anastasia
Giachanou, Dan Li, Mohammad Aliannejadi, Michalis Vlachos, Guglielmo Faggioli, and
Nicola Ferro, editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction
- 14th International Conference of the CLEF Association, CLEF 2023, Thessaloniki, Greece,
September 18-21, 2023, Proceedings, volume 14163 of Lecture Notes in Computer Science,
pages 97–108. Springer, 2023.
[21] Andrea Pasin, Maurizio Ferrari Dacrema, Paolo Cremonesi, and Nicola Ferro. Overview
of quantumclef 2024: The quantum computing challenge for information retrieval and
recommender systems at CLEF. In Lorraine Goeuriot, Philippe Mulhem, Georges Quénot,
Didier Schwab, Giorgio Maria Di Nunzio, Laure Soulier, Petra Galuscáková, Alba
García Seco de Herrera, Guglielmo Faggioli, and Nicola Ferro, editors, Experimental IR Meets
Multilinguality, Multimodality, and Interaction - 15th International Conference of the CLEF
Association, CLEF 2024, Grenoble, France, September 9-12, 2024, Proceedings, Part II, volume
14959 of Lecture Notes in Computer Science, pages 260–282. Springer, 2024.
[22] Andrea Pasin, Maurizio Ferrari Dacrema, Paolo Cremonesi, and Nicola Ferro. Quantumclef
- quantum computing at CLEF. In Nazli Goharian, Nicola Tonellotto, Yulan He, Aldo Lipani,
Graham McDonald, Craig Macdonald, and Iadh Ounis, editors, Advances in Information
Retrieval - 46th European Conference on Information Retrieval, ECIR 2024, Glasgow, UK,
March 24-28, 2024, Proceedings, Part V, volume 14612 of Lecture Notes in Computer Science,
pages 482–489. Springer, 2024.
[23] Andrea Pasin, Maurizio Ferrari Dacrema, Paolo Cremonesi, and Nicola Ferro.
Quantumclef 2024: Overview of the quantum computing challenge for information retrieval and
recommender systems at CLEF. In Guglielmo Faggioli, Nicola Ferro, Petra Galuscáková,
and Alba García Seco de Herrera, editors, Working Notes of the Conference and Labs of
the Evaluation Forum (CLEF 2024), Grenoble, France, 9-12 September, 2024, volume 3740 of
CEUR Workshop Proceedings, pages 3032–3053. CEUR-WS.org, 2024.
[24] Andrea Pasin, Maurizio Ferrari Dacrema, Washington Cuhna, Marcos Andrè Gonçalves,
Paolo Cremonesi, and Nicola Ferro. Overview of quantumclef 2025: The second quantum
computing challenge for information retrieval and recommender systems at CLEF. In Jorge
Carrillo-de-Albornoz, Julio Gonzalo, Laura Plaza, Alba García Seco de Herrera, Josiane
Mothe, Florina Piroi, Paolo Rosso, Damiano Spina, Guglielmo Faggioli, and Nicola Ferro,
editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. Proceedings
of the Sixteenth International Conference of the CLEF Association (CLEF 2025), Lecture Notes
in Computer Science, 2025.
[25] Andrea Pasin and Nicola Ferro. Kimera: From evaluation-as-a-service to
evaluation-inthe-cloud. In Proceedings of the 48th International ACM SIGIR Conference on Research and
Development in Information Retrieval, SIGIR 2025, Padova, Italy, July 13-18, 2025. ACM,
2025.
[26] Will Reese. Nginx: the high-performance web server and reverse proxy. Linux Journal,
2008(173):2, 2008.
[27] Stephen Robertson. On the history of evaluation in IR. J. Inf. Sci., 34(4):439–456, 2008.
[28] T. Sakai, D. W. Oard, and N. Kando, editors. Evaluating Information Retrieval and Access
Tasks – NTCIR’s Legacy of Research Impact, volume 43 of The Information Retrieval Series.</p>
      <p>Springer International Publishing, Germany, 2021.
[29] Ellen M. Voorhees, Jimmy Lin, and Miles Efron. On run diversity in evaluation as a service.</p>
      <p>In Shlomo Geva, Andrew Trotman, Peter Bruza, Charles L. A. Clarke, and Kalervo Järvelin,
editors, The 37th International ACM SIGIR Conference on Research and Development in
Information Retrieval, SIGIR ’14, Gold Coast , QLD, Australia - July 06 - 11, 2014, pages
959–962. ACM, 2014.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C. W.</given-names>
            <surname>Cleverdon</surname>
          </string-name>
          .
          <source>The Cranfield Tests on Index Languages Devices. Aslib Proceedings</source>
          ,
          <volume>19</volume>
          (
          <issue>6</issue>
          ):
          <fpage>173</fpage>
          -
          <lpage>194</lpage>
          ,
          <year>1967</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Christian</surname>
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Collberg</surname>
            and
            <given-names>Todd A.</given-names>
          </string-name>
          <string-name>
            <surname>Proebsting</surname>
          </string-name>
          .
          <article-title>Repeatability in computer systems research</article-title>
          .
          <source>Commun. ACM</source>
          ,
          <volume>59</volume>
          (
          <issue>3</issue>
          ):
          <fpage>62</fpage>
          -
          <lpage>69</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Di</surname>
          </string-name>
          Cosmo and
          <string-name>
            <given-names>Stefano</given-names>
            <surname>Zacchiroli</surname>
          </string-name>
          .
          <article-title>Software heritage: Why and how to preserve software source code</article-title>
          . In Shoichiro Hara, Shigeo Sugimoto, and Makoto Goto, editors,
          <source>Proceedings of the 14th International Conference on Digital Preservation</source>
          , iPRES
          <year>2017</year>
          , Kyoto, Japan,
          <source>September 25-29</source>
          ,
          <year>2017</year>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Maurizio</given-names>
            <surname>Ferrari</surname>
          </string-name>
          <string-name>
            <surname>Dacrema</surname>
          </string-name>
          , Andrea Pasin, Paolo Cremonesi, and
          <string-name>
            <given-names>Nicola</given-names>
            <surname>Ferro</surname>
          </string-name>
          .
          <article-title>Quantum computing for information retrieval and recommender systems</article-title>
          . In Nazli Goharian, Nicola Tonellotto, Yulan He, Aldo Lipani,
          <string-name>
            <surname>Graham</surname>
            <given-names>McDonald</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Craig</given-names>
            <surname>Macdonald</surname>
          </string-name>
          , and Iadh Ounis, editors,
          <source>Advances in Information Retrieval - 46th European Conference on Information Retrieval</source>
          ,
          <string-name>
            <surname>ECIR</surname>
          </string-name>
          <year>2024</year>
          , Glasgow, UK, March
          <volume>24</volume>
          -28,
          <year>2024</year>
          , Proceedings,
          <string-name>
            <surname>Part</surname>
            <given-names>V</given-names>
          </string-name>
          , volume
          <volume>14612</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>358</fpage>
          -
          <lpage>362</lpage>
          . Springer,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Maurizio</given-names>
            <surname>Ferrari</surname>
          </string-name>
          <string-name>
            <surname>Dacrema</surname>
          </string-name>
          , Andrea Pasin, Paolo Cremonesi, and
          <string-name>
            <given-names>Nicola</given-names>
            <surname>Ferro</surname>
          </string-name>
          .
          <article-title>Using and evaluating quantum computing for information retrieval and recommender systems</article-title>
          .
          <source>In Grace Hui Yang</source>
          ,
          <string-name>
            <surname>Hongning</surname>
            <given-names>Wang</given-names>
          </string-name>
          , Sam Han, Claudia Hauf, Guido Zuccon, and Yi Zhang, editors,
          <source>Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2024</source>
          ,
          <string-name>
            <surname>Washington</surname>
            <given-names>DC</given-names>
          </string-name>
          , USA, July
          <volume>14</volume>
          -
          <issue>18</issue>
          ,
          <year>2024</year>
          , pages
          <fpage>3017</fpage>
          -
          <lpage>3020</lpage>
          . ACM,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          and C. Peters, editors.
          <source>Information Retrieval Evaluation in a Changing World - Lessons Learned from 20 Years of CLEF</source>
          , volume
          <volume>41</volume>
          <source>of The Information Retrieval Series</source>
          . Springer International Publishing, Germany,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Nicola</given-names>
            <surname>Ferro</surname>
          </string-name>
          , Julio Gonzalo, Jussi Karlgren, and
          <string-name>
            <given-names>Henning</given-names>
            <surname>Müller</surname>
          </string-name>
          .
          <article-title>The CLEF 2024 monster track: One lab to rule them all</article-title>
          . In Nazli Goharian, Nicola Tonellotto, Yulan He, Aldo Lipani,
          <string-name>
            <surname>Graham</surname>
            <given-names>McDonald</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Craig</given-names>
            <surname>Macdonald</surname>
          </string-name>
          , and Iadh Ounis, editors,
          <source>Advances in Information Retrieval - 46th European Conference on Information Retrieval</source>
          ,
          <string-name>
            <surname>ECIR</surname>
          </string-name>
          <year>2024</year>
          , Glasgow, UK, March
          <volume>24</volume>
          -28,
          <year>2024</year>
          , Proceedings,
          <string-name>
            <surname>Part</surname>
            <given-names>VI</given-names>
          </string-name>
          , volume
          <volume>14613</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>11</fpage>
          -
          <lpage>18</lpage>
          . Springer,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Maik</given-names>
            <surname>Fröbe</surname>
          </string-name>
          , Jan Heinrich Reimer,
          <string-name>
            <surname>Sean</surname>
            <given-names>MacAvaney</given-names>
          </string-name>
          , Niklas Deckers, Simon Reich, Janek Bevendorf, Benno Stein, Matthias Hagen, and
          <string-name>
            <given-names>Martin</given-names>
            <surname>Potthast</surname>
          </string-name>
          .
          <article-title>The information retrieval experiment platform</article-title>
          .
          <source>CoRR, abs/2305.18932</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Maik</given-names>
            <surname>Fröbe</surname>
          </string-name>
          , Matti Wiegmann, Nikolay Kolyada, Bastian Grahm, Theresa Elstner, Frank Loebe, Matthias Hagen, Benno Stein, and
          <string-name>
            <given-names>Martin</given-names>
            <surname>Potthast</surname>
          </string-name>
          .
          <article-title>Continuous integration for reproducible shared tasks with tira.io</article-title>
          . In Jaap Kamps, Lorraine Goeuriot, Fabio Crestani, Maria Maistro, Hideo Joho, Brian Davis, Cathal Gurrin, Udo Kruschwitz, and Annalina Caputo, editors,
          <source>Advances in Information Retrieval - 45th European Conference on Information</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>