<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Madrid, Spain
* Corresponding author.
†These authors contributed equally.
$ tramirez@centrogeo.edu.mx (T. Ramirez-delreal); dmoctezuma@centrogeo.edu.mx (D. Moctezuma); luis.ruiz@infotec.mx
(G. Ruiz); mario.graf@infotec.mx (M. Graf); eric.tellez@infotec.mx (E. Tellez)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Infotec+CentroGEO at Touché: MCIP, CLIP and SBERT as Retrieval Score</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tania Ramirez-delreal</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniela Moctezuma</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guillermo Ruiz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mario Graf</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eric Tellez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centro de Investigación e Innovación en Tecnologías de la Información y Comunicación (INFOTEC)</institution>
          ,
          <addr-line>Aguascalientes, Ags., 20213</addr-line>
          ,
          <country country="MX">México</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Centro de Investigación en Ciencias de Información Geoespacial (CentroGeo)</institution>
          ,
          <addr-line>Aguascalientes, Ags., 20213</addr-line>
          ,
          <country country="MX">México</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>SECIHTI, Secretaría de Ciencia, Humanidades, Tecnología e Innovación, Benito Juárez, Ciudad de México</institution>
          ,
          <addr-line>03940</addr-line>
          ,
          <country country="MX">México</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>This manuscript presents the Infotec+CentroGEO solution for the Image Retrieval/Generation for Arguments challenge at Touché-2025. This shared task asks for systems that can retrieve an image from a given dataset to support a given argument; these arguments include only a single claim without supporting premises. The team's solutions included the usage of MCIP, CLIP, and SBERT to obtain a value for the representations of the images and claims.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Image Retrieval</kwd>
        <kwd>MCIP</kwd>
        <kwd>CLIP</kwd>
        <kwd>SBERT</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Visual information plays a crucial role in communication, often enhancing the impact and memorability
of textual messages. This phenomenon, known as the picture superiority efect [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], suggests that
images are processed and recalled more efectively than words due to their distinct perceptual and
conceptual qualities. Consequently, images are powerful tools for constructing arguments and improving
comprehension and persuasiveness.
      </p>
      <p>
        Hung et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] studied the relevance judgment of journalists when searching for photographs to
support news stories; several key criteria for image search and selection were identified, highlighting
the importance of associated textual information and personal feelings. On the other hand, Wang
et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] indicate how specialized applications have concentrated on examining the link between text
messages and images in political campaigns circulated on social networks; studying the relationship
between text, objects, and colors to comprehend the propaganda strategy. The work of Kiesel et al.
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] on image retrieval employs computational techniques that use argument mining, justify the use of
images as evidence, and improve information retrieval and comprehension in argumentative contexts.
      </p>
      <p>
        As described in [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ], the task of Touché 2025 involves selecting the best image to support an
argument. During our data analysis and exploration, we saw that several images could equally support
a single argument; nevertheless, the competition organizer did not share any ground truth data, so it
was not easy to assess our solutions automatically.
      </p>
      <p>So, we think the task proposed in the Touché 2025 competition is very complex because the dataset
has many images to pair with each claim, some of which are very similar. A solution cannot apply
supervised learning directly due to the lack of a training set to help learning models; therefore, tasks
require heuristics that replicate human image search and selection strategies. The response solutions are
manually evaluated through a human committee. So, in this manuscript, we describe all the approaches
we used trying to select the best possible pair of image and argument to finally submit our best solutions
to the human evaluation coordinated by the organizers of the Touché 2025 competition. In our case, only
a solution based on image retrieval was submitted because although we tested some image generation
models, we considered not submitting a generation image approach because of our limitations in
computational resources and the time required.</p>
      <p>This manuscript is organized as follows. Section 2 describes the task in which the team participated
and the dataset the organizers provided. Section 3 shows the approaches proposed for retrieving images
for arguments. In Section 4, the evaluation of the results is done with our tiny labeled dataset. Finally,
Section 5 concludes the manuscript and discusses future directions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Task description</title>
      <p>
        The Image Retrieval/Generation for Arguments challenge corresponds to Task 3 in Touché 2025 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
Given a claim, the objective is to find relevant images from a provided image database and rank them
according to how much these images can be used as an argument to support that claim.
      </p>
      <p>Each image includes metadata automatically extracted from the picture using AI models. The metadata
includes the text, labels, and keywords extracted from the image using Google Cloud Vision and a
caption provided by LLaVA. Human experts will evaluate submissions according to defined relevance
criteria. The results were uploaded to the TIRA platform [7].</p>
      <p>The dataset is composed of images depicting scenes or visual concepts, as well as their associated
texts, organized into reference captions representing the content of each image. It contains 32,339
images, with 128 claims derived from 27 diferent topics. Each claim is uniquely recorded in an XML
ifle, although a topic may appear in multiple arguments. Each image includes its respective metadata,
and in this study, text and captions were utilized to improve the representation of the images.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>We tried some state-of-the-art models such as CLIP [8], MCIP [9], and Sentence BERT [10]. In the
following sections, each approach will be explained in detail.</p>
      <sec id="sec-3-1">
        <title>3.1. A tiny labeled dataset for guiding our approach</title>
        <p>We have to manually classify a small subset with 13 claims, from 3 topics, containing 995 images, to
compare and determine whether our strategies improve our results quantitatively. We decided to build
this ground truth data to have a guide to assess our eforts. The 13 claims (see Table 1) provided in the
ifrst stage were used to select among 995 images (also provided in the first stage of the competition, in
a dataset called tiny version) the top three that most support each claim, so for each claim, we selected
the intersection of the top three selected by two humans, to establish the final top three images per
each argument. So, in this labeling produced at least two evaluations per image concerning claims, and
then we selected the five best images for each argument.</p>
        <p>After this manual annotation, diferent techniques for retrieving the images were evaluated. The
architectures used are explained in the following subsections. The score was computed as if the top
 images provided by the model were in the top three images manually selected; it was considered a
success. The score has a higher probability when  is also higher. Although we did not measure the
inter-annotator agreement, we consider it high enough because both persons have a high intersection
between their selected images.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Multimodal encoding models</title>
        <sec id="sec-3-2-1">
          <title>CLIP</title>
          <p>Contrastive Language-Image Pretraining (CLIP) [8] is a multimodal model that is used to predict whether
a pair, image and text is related or not. CLIP was trained on text paired with images on the internet,
and one important limitation mentioned by its authors is the presence of social biases in the model
because this pair of image-text was used unfiltered and uncurated; nevertheless, the results achieved
by CLIP are impressive most of the time. Specifically, we used the CLIP version of OpenAI, 1 with the
ViT-L-14-336 model. We used the pair, image, and text, and obtained the similarity between them, so
we sorted these similarities and chose the top five for the submission.</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>MCIP</title>
          <p>Schall et al. [9] introduce the Multi-Caption-Image-Pairing (MCIP), while it is similar to CLIP, here the
image encoder is trained with a diferent loss function and a collection of related captions per image.
The text encoder is frozen to maintain alignment of the text and image embeddings.</p>
          <p>Argumentative information is extracted from the claims, which constitutes a textual proposition.
Each image could be a good illustration of the claim; the model used is the same as that used for the
caption. This transforms each claim into a normalized semantic vector that can be compared with both
the image and the reference captions.</p>
          <p>Then, we derived textual propositions (arguments) from claims. Each image may efectively
demonstrate the claim, using the same model employed for captions. This process converts each claim into a
standardized semantic vector, enabling comparison with both the image and the associated captions;
thus, we are able to evaluate semantic similarity by comparing the image vector with both the claim
and the caption embeddings. This is done by applying cosine similarity, which evaluates how close the
vectors are in the shared semantic space.</p>
          <p>This methodology aims to determine the relevance of a claim relative to the visual content of an image
by using reference captions as a context. A score is generated to measure the semantic compatibility of
each claim concerning the image and its caption. Ultimately, results are organized that associate each
image with a list of scores that reflect the similarity between the image and all claims. These scores can
be ranked to identify the top five images with the highest score, which implies a stronger semantic
association between the claim, the image, and its caption.</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Text encoders</title>
        <p>Sentence BERT (SBERT) [10] is a language model tailored for tasks involving similarity. Built on the
foundation of a BERT model, SBERT is further optimized to transform text into vectors in such a way
that semantically similar text points correspond within the target latent space. Training of SBERT
involves semantically related pairs and triplets, utilizing a siamese neural architecture that encloses the
base BERT model, along with latent-space pooling and a transformation to align vectors with similar
text sentences.</p>
        <p>A diferent approach we tried was to use text-based information only. Instead of using actual
images, we concatenated the text within the image with the caption provided as input to the Sentence
BERT model; in particular, we used the multi-qa-mpnet-base-dot-v1 variant. So we obtained an
embedding for each image that captures its semantic information. We then concatenated the topic with
the claim and obtained the corresponding embeddings. We normalized all the embeddings from the
images and claims. Finally, for each claim, we used cosine similarity to rank the image embeddings and
retrieved the top five.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>As mentioned above, we did our own tagged dataset with a subset of the data provided by the competition
organizers. This internal evaluation provided some information on the results of our eforts.</p>
      <p>Table 2 shows the results of this evaluation. In the headings, we have all the diferent claims that
are used to develop our models (which means the argument IDs). MCIP, CLIP, and SBERT refer to the
corresponding models explained before and  is indicated, for which the top values  generated by the
model were considered.</p>
      <p>The score was calculated as follows; if one of the  best results is at least one of the three solutions
manually established, it is considered a success (represented by 1), otherwise it is an error (represented
by 0).</p>
      <p>The  values are related to the number of images returned by the model, so the higher this  higher
the probability of being (at least in one element) in our three manually chosen images. So, the score has
the range value 0-1, 0 being the lowest possible result and 1 the contrary case. Then, the best results
were obtained by SBERT  = 5, 3, 1.</p>
      <p>Although SBERT reached the best result, we decided to send as the final solution the three outputs
produced by CLIP, MCIP, and SBERT, respectively. The main reason we decided that is because MCIP
and CLIP are very similar in their results and both considered image-text; on the other hand, SBERT
is the unique solution text-based only and achieved our best results, so, as we had the opportunity to
submit more than one solution, we took advantage of that.</p>
      <p>According to the final results, our best performance was obtained by our CLIP solution, beating the
results achieved by SBERT.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>This work presented our proposals to tackle Touché in the CLEF2025 competition, a task asking language
and vision models to find better images that support a specific claim (text).</p>
      <p>Our methodology involved creating a tiny subset of arguments and images to guide our research
eforts. In terms of models, we evaluated two multimodal models, a text-based language model and
one text-based only model. Our most successful strategy used SBERT to evaluate and align images
(their textual descriptions) with claims, allowing us to identify the top five images related to each claim.
This undertaking is particularly complex due to the necessity for human assessment, which presents
considerable obstacles in refining models to achieve specific objectives.</p>
      <p>Our results suggest that the retrieval problem presents significant challenges and necessitates further
research to address and meet individuals’ informational needs efectively.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used Grammarly and Writefull’s model for grammar
and spelling checks. After using these services, the authors reviewed and edited the content as needed
and assumed full responsibility for the content of the publication.
[7] M. Fröbe, M. Wiegmann, N. Kolyada, B. Grahm, T. Elstner, F. Loebe, M. Hagen, B. Stein, M. Potthast,
Continuous Integration for Reproducible Shared Tasks with TIRA.io, in: J. Kamps, L. Goeuriot,
F. Crestani, M. Maistro, H. Joho, B. Davis, C. Gurrin, U. Kruschwitz, A. Caputo (Eds.), Advances
in Information Retrieval. 45th European Conference on IR Research (ECIR 2023), Lecture Notes
in Computer Science, Springer, Berlin Heidelberg New York, 2023, pp. 236–241. doi:10.1007/
978-3-031-28241-6_20.
[8] A. Radford, J. W. Kim, C. Hallacy, A. Ramesh, G. Goh, S. Agarwal, G. Sastry, A. Askell, P. Mishkin,
J. Clark, et al., Learning transferable visual models from natural language supervision, in:
International conference on machine learning, PmLR, 2021, pp. 8748–8763.
[9] K. Schall, K. U. Barthel, N. Hezel, K. Jung, Optimizing CLIP models for image retrieval with
maintained joint-embedding alignment, in: E. Chávez, B. Kimia, J. Lokoc, M. Patella, J. Sedmidubsk
(Eds.), Similarity Search and Applications - 17th International Conference, SISAP 2024, Providence,
RI, USA, November 4-6, 2024, Proceedings, volume 15268 of Lecture Notes in Computer Science,
Springer, 2024, pp. 97–110. doi:10.1007/978-3-031-75823-2_9.
[10] N. Reimers, I. Gurevych, Sentence-bert: Sentence embeddings using siamese bert-networks,
in: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing,
Association for Computational Linguistics, 2019, pp. 3982–3992. URL: https://aclanthology.org/
D19-1410.pdf.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>E. B.</given-names>
            <surname>Thomas</surname>
          </string-name>
          ,
          <article-title>Visual superiority efect in memory (pictures are worth a thousand words)</article-title>
          ,
          <source>in: Proceedings of the Human Factors Society Annual Meeting</source>
          , volume
          <volume>27</volume>
          ,
          <string-name>
            <given-names>SAGE</given-names>
            <surname>Publications Sage</surname>
          </string-name>
          <string-name>
            <surname>CA</surname>
          </string-name>
          : Los Angeles, CA,
          <year>1983</year>
          , pp.
          <fpage>714</fpage>
          -
          <lpage>714</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.-Y.</given-names>
            <surname>Hung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zoeller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lyon</surname>
          </string-name>
          ,
          <article-title>Relevance judgments for image retrieval in the field of journalism: A pilot study</article-title>
          ,
          <source>in: Digital Libraries: Implementing Strategies and Sharing Experiences: 8th International Conference on Asian Digital Libraries, ICADL</source>
          <year>2005</year>
          , Bangkok, Thailand,
          <source>December 12-15</source>
          ,
          <year>2005</year>
          . Proceedings 8, Springer,
          <year>2005</year>
          , pp.
          <fpage>72</fpage>
          -
          <lpage>80</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>M.-H. Wang</surname>
            ,
            <given-names>W.-Y.</given-names>
          </string-name>
          <string-name>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.-H. Kuo</surname>
          </string-name>
          , K.-Y. Tsai,
          <article-title>Analyzing image-based political propaganda in referendum campaigns: from elements to strategies</article-title>
          ,
          <source>EPJ Data Science</source>
          <volume>12</volume>
          (
          <year>2023</year>
          )
          <fpage>29</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Kiesel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Reichenbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <article-title>Image retrieval for arguments using stance-aware query expansion</article-title>
          ,
          <source>in: Proceedings of the 8th workshop on argument mining</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>36</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Kiesel</surname>
          </string-name>
          , Ç. Çöltekin,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gohsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Heineking</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Heinrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fröbe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hagen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Aliannejadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Erjavec</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hagen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kopp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ljubešić</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Meden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Mirzakhmedova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Morkevičius</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Scells</surname>
          </string-name>
          , I. Zelch,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          , Overview of Touché 2025:
          <article-title>Argumentation Systems</article-title>
          , in: J.
          <string-name>
            <surname>C. de Albornoz</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Gonzalo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Plaza</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>García Seco de Herrera</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Piroi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Spina</surname>
          </string-name>
          , G. Faggioli, N. Ferro (Eds.),
          <source>Experimental IR Meets Multilinguality, Multimodality, and Interaction. 16th International Conference of the CLEF Association (CLEF</source>
          <year>2025</year>
          ), Lecture Notes in Computer Science, Springer, Berlin Heidelberg New York,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>B.</given-names>
            <surname>Ionescu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Müller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.-C.</given-names>
            <surname>Stanciu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.-G.</given-names>
            <surname>Andrei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Radzhabov</surname>
          </string-name>
          , Prokopchuk, Ştefan, Liviu-Daniel, M.-G. Constantin,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dogariu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kovalev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Damm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rückert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Ben</given-names>
            <surname>Abacha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>García Seco de Herrera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Friedrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bloch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Brüngel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Idrissi-Yaghir</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Schäfer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. M. G.</given-names>
            <surname>Pakull</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bracke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Pelka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Eryilmaz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Becker</surname>
          </string-name>
          , W.-W. Yim,
          <string-name>
            <given-names>N.</given-names>
            <surname>Codella</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Novoa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Malvehy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. J.</given-names>
            <surname>Dimitrov</surname>
          </string-name>
          ,
          <string-name>
            <surname>Dimitar Das</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Xie</surname>
            ,
            <given-names>H. M.</given-names>
          </string-name>
          <string-name>
            <surname>Shan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Nakov</surname>
            , I. Koychev,
            <given-names>S. A.</given-names>
          </string-name>
          <string-name>
            <surname>Hicks</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Gautam</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Riegler</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Thambawita</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Halvorsen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Fabre</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Macaire</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Lecouteux</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Schwab</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Potthast</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Heinrich</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Kiesel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Wolter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Stein</surname>
          </string-name>
          , Overview of imageclef 2025:
          <article-title>Multimedia retrieval in medical, social media and content recommendation applications</article-title>
          , in: J.
          <string-name>
            <surname>C. de Albornoz</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Gonzalo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Plaza</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>García Seco de Herrera</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Piroi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Spina</surname>
          </string-name>
          , G. Faggioli, N. Ferro (Eds.),
          <source>Experimental IR Meets Multilinguality, Multimodality, and Interaction. 16th International Conference of the CLEF Association (CLEF</source>
          <year>2025</year>
          ), Lecture Notes in Computer Science, Springer, Berlin Heidelberg New York,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>