<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SSNdhanyadivyakavitha at MEDIQA-Sum 2023: Medical Dialogue Summarization using Linear Support Vector Classification Technique</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dhanya Krishnan</string-name>
          <email>dhanya2010402@ssn.edu.in</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Divya Srinivasan</string-name>
          <email>divya2010335@ssn.edu.in</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kavitha Srinivasan</string-name>
          <email>kavithas@ssn.edu.in</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science Engineering, Sri Sivasubramaniya Nadar College of Engineering</institution>
          ,
          <addr-line>Rajiv Gandhi Salai, Kalavakkam- 603110</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This research paper proposes a working model for the “ImageCLEFmed MEDIQA-Sum task” of clinical note summarization in healthcare using machine learning algorithms. The model is developed using a “Linear Support Vector Classification (SVC) algorithm” with TF-IDF features to perform text classification on the doctor-patient conversation snippets given in the dataset. A training accuracy of 0.99 and a validation accuracy of 0.69 were obtained. Linear SVC results in improved accuracy when the classes are distinguishable and separated into the respective section classes. In addition, TF-IDF is used to efficiently convert and extract the information given in the dataset. The model also employs several preprocessing techniques to improve the accuracy and random oversampling is performed to combat the heavy imbalance between classes. Other models using CART and CNN are also analyzed. Moreover, this paper discusses the importance and need for text summarization in the medical field with the scope for improving diagnosis and treatment.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        ImageCLEF, an evaluation platform originally proposed by Mark Sanderson and Paul Clough
from the Department of Information Studies, University of Sheffield seeks to provide a cross-language
annotation and retrieval [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This forum creates the necessary infrastructure for the evaluation of visual
information retrieval systems operating in monolingual, cross-language, and language-independent
contexts. ImageCLEF's main objective is to support the development in the field of visual media analysis,
indexing, classification, and retrieval [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This objective was motivated by the requirement to serve
multilingual users from a global community who wanted to access the constantly expanding visual data
and text data to build efficient models for real-time data analysis.
      </p>
      <p>
        Text summarization is the practice of reducing lengthy texts into manageable paragraphs or
sentences. Text summary seeks to extract key details while keeping the overall meaning of the paragraph
intact. Different methods exist in the literature for keyword and semantic-based text summarization, where
the medical data in the healthcare domain plays a vital role in the analysis [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        The clinical summary can be defined as a process of collecting, reviewing, and analyzing patient
data to assist in the conclusion of different clinical needs. Existing automated text summarization techniques
are particularly relevant and useful for medical teams and researchers when there is a need for research
evidence of the COVID pandemic [
        <xref ref-type="bibr" rid="ref4 ref5">4,5</xref>
        ]. When analyzing patient data, clinicians are frequently given
abundant data from several sources that must be examined independently. Patients have the chance to
elaborate on the story by employing natural language processing techniques to paraphrase the text, which
also improves and increases accuracy. Text summarisation helps in summarizing medical records and
improving diagnosis. Scaling up to large collections and diversifying applications, text summarization is a
useful tool to be incorporated [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The task chosen from the CLEF forum is ImageCLEFmed
MEDIQASum 2023. This task aims to generate a summarization of clinical notes between the clinician and the patient
via three subtasks, out of which we have participated in Subtask A and the implementation with results are
submitted for the same. The volume of published medical research continues to grow rapidly and staying
updated with the best available evidence is a challenge for clinicians [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Many text summarization systems
are difficult to customize or are deployed in low-resource settings. This paper aims at an efficient yet
accurate summarisation approach. Given the number and complexity of medical text records, it has been
realized that there is no added value for the larger quantities of data. Easier access to required information
through models that facilitate information retrieval increases the scope of research and the medical domain
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. A model taking user interests into account, and presenting findings about a user model based on an
existing patient record is highly recommendable [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Researchers examined a great variety of techniques
and applied the same in different domains to yield practical research[10].
      </p>
      <p>Subtask A: Dialogue2Topic Classification. This task aims at identifying the topic given a snippet of the
conversation between the doctor and the patient. The topics have to be identified from the given list of 20
topics or section headers (e.g. Assessment, Diagnosis, Exam, Medications, Past Medical History).
Subtask B: Dialogue2Note Summarization. In this task, the conversation snippet between a doctor and
patient with a section header is given.</p>
      <p>Subtask C: Full-Encounter Dialogue2Note Summarization. Given a full conversation between a doctor and
patient, participants are tasked to find the complete clinical note summarizing the conversation.</p>
      <p>The remaining part of the paper spans across following subsections. Section 2 of this paper
describes the dataset provided by the organizers. Further in Section 3, the proposed methodologies
including Linear Support Vector Machine, Classification and Regression Trees and Convolutional Neural
Networks are described in detail. The results section of the paper analyzes the differences in accuracy
between the different models. Section 5 provides a brief description of the System Specifications required
to implement the developed models. A summary of the inferences is presented in Section 6. The conclusion
and future work are summarized at the end.</p>
      <p>Researchers and healthcare professionals often need to stay up-to-date with the latest medical
literature. Text summarization algorithms can summarize complex research articles, enabling researchers
to quickly note the key findings, methodologies, and implications without reading the entire paper. Overall,
text summarization in healthcare has the potential to save time, improve information retrieval, support
evidence-based decision-making, and enhance patient care by distilling vast amounts of textual data into
concise and meaningful summaries.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Dataset Description</title>
      <p>The dataset is analyzed and a text summarization algorithm is used to create a system that can
automatically produce brief overviews of doctor-patient conversations, to gain valuable insights and save
the doctor's time.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed Methodologies</title>
      <p>The ImageCLEFmed MEDIQA-Sum 2023 aims to simplify the tedious task of clinical note-writing
and summarizing clinician-patient conversations. In subtask A, categorizing the given snippets of
doctorpatient conversation into appropriate headers falls under the problem of Text Classification. Several
algorithms can be used for this purpose, including but not limited to Naive-Bayes Algorithm, Support
Vector Machines, ID3 Algorithm, Latent Derelict Allocation, etc.</p>
      <p>A flow diagram for the developed model is given in Fig. 2. Since there is only a limited training
dataset (of 1201 samples), this study has chosen to apply the following methodologies to achieve the best
results:</p>
    </sec>
    <sec id="sec-4">
      <title>3.1. Model 1: Linear SVC with Random Oversampling</title>
      <p>Linear SVC is a classification algorithm, used for text classification, image classification, natural
language processing, etc. It aims to find a linear decision boundary which effectively separates the different
classes present in the data set.</p>
      <p>This model uses a Linear Support Vector algorithm along with TF-IDF features (Term Frequency
Inverse Document Frequency). It employs several pre-processing techniques, including removing digits,
punctuations, and stopwords from NLTK's stopword corpus, as well as the conversion of the text to
lowercase. The TF-IDF vectorizer class from Scikit-learn converts text data into a numerical representation.</p>
      <p>Due to severe imbalances between the different classes, the accuracy of the model might be
affected. To combat this, this technique proposes to use the RandomOverSampler class from the
imbalanced-learn library, which helps even out the imbalances between classes and improves the
performance of the model. The Linear SVC is then trained on the training dataset, and validated using the
validation set. Finally, we use the model to predict the “Section Headers” for the conversation snippets in
the test data.</p>
    </sec>
    <sec id="sec-5">
      <title>3.2. Model 2: CART with Logistic Regression</title>
      <p>Classification and Regression Tree (CART) is a decision tree-based algorithm. It recursively
partitions the dataset based on features to construct a tree structure wherein, each internal node represents
a feature test, each branch describes a possible outcome of the test, and each leaf node depicts a class label.
A CART model with logistic regression was developed for the subtask. The data is first preprocessed using
lemmatization which helps in standardising words to their root form. This is followed by label encoding
which assigns a numerical value to each class. Countvectorizer then represents the text as a matrix of word
counts. The model is then trained and validated on the provided datasets and used to predict section headers
of the test dataset.</p>
    </sec>
    <sec id="sec-6">
      <title>3.3. Model 3: Convolutional Neural Networks</title>
      <p>The data is preprocessed using the Tokenizer class and pad_sequences function from the Keras
library to help with standardising it. The embedding layer learns dense vector representations for each word
index in the input text, capturing semantic meaning and relationships. It uses 100-dimensional vectors to
represent the words. The MaxPooling1D layer downsamples the feature maps by taking the maximum value
from each local region, reducing dimensionality while preserving important information. A dropout layer
is added to prevent overfitting by randomly setting a fraction of input units to 0 during training, promoting
regularization. The Flatten layer converts the feature maps into a 1D vector, followed by a Dense layer with
softmax activation for final classification. Softmax assigns probabilities to each class, indicating the model's
confidence for each label. The developed model is then trained and validated.</p>
    </sec>
    <sec id="sec-7">
      <title>4. Result and Performance Analysis</title>
      <p>The Linear SVC model is trained on the training set provided by the organizer. The model displayed
a commendable training accuracy of 0.99. This suggests that the model learned the features in the training
set effectively. However, there is a drop in validation accuracy, with 0.69. This drop in performance when
evaluated on unseen data may imply that the model is overfitted, ie, the model does not generalize well to
new data. Methods to address this issue are discussed in Section 5.</p>
      <p>Test accuracies are considered to be representational of how the model performs in real-world situations.
Analysis of the model’s performance on classifying the conversation snippets in the test dataset shows that
it obtained a test accuracy of 0.72. It is also noteworthy that the model trains quickly, in 5 seconds. The
current standing of this model after the evaluation of the MEDIQA-Sum subtask A is at rank 10. The results
published by the organizers are displayed in Table 4.</p>
    </sec>
    <sec id="sec-8">
      <title>5. System Specifications</title>
      <p>The hardware and software requirements for the MEDIQA-Sum subtask A of medical dialogue
summarization are as follows : (i) Dual-core Intel Core i5, clocked between 2.3GHz and 3.5GHz, 8GB of
2133MHz LPDDR3 onboard memory and 256GB PCIe-based onboard SSD. (ii) MacOS Ventura 13.4
operating system, Python 3.7 package with required libraries like tensorflow, torch, sklearn, nltk, pickle,
etc.</p>
    </sec>
    <sec id="sec-9">
      <title>6. Inference</title>
      <p>The scope for improvement in the task of medical dialogue summarization using Machine Learning
models is abundant. The main strengths of our model are that it has high training accuracy and it uses
techniques to even out imbalance between classes. Furthermore, it is very time efficient. However, there is
reason to believe that this particular model might be overfitted since there is a drop in its validation
accuracy.</p>
      <p>To address the issue of overfitting, additional training data with more samples, if available, can be
used to train the model. Regularization is also an option to be considered, as it adds a penalty to the loss
function of the model. It also reduces the impact of less relevant features on the prediction. Early stopping
can also be used to identify the optimal point in training. Addressing overfitting could potentially lead to
better performance.</p>
      <p>It has also been established that there is an imbalance between classes in the training data with
some headings having merely 2-8 samples. To mitigate this, random oversampling has been used. However,
other techniques such as data augmentation and class weighting can be explored. In data augmentation, new
samples for the minority classes are obtained by transforming the existing samples. This may help even out
the imbalance. Class weighting may also prove to be of use, as minority classes can be given more
importance during the optimization process.</p>
      <p>With further research and experimentation, it seems plausible that accuracy can be improved in the
future. The task of manually summarizing clinical notes is a mammoth task. But by implementing the
proposed technique, this can be completed in a matter of seconds, which highlights the indispensability of
Machine Learning in the medical field.</p>
    </sec>
    <sec id="sec-10">
      <title>7. Conclusion</title>
      <p>The ImageCLEFmed MEDIQA-Sum 2023, subtask A is implemented and the results are analyzed.
The following observations were made in the dataset: There are 1200 samples in the training set. There is
an imbalance between the classes, where the maximum number of samples in one class is 351, whereas the
minimum is 2. Several models were trained and tested on the given data sets and further analyzed to improve
accuracy.</p>
      <p>The Linear SVC model developed showed an improvement of 3% in test data when compared to
the CART model with Logistic Regression. A model using CNN was also experimented with but needs
further refining as it only has an accuracy of 0.66. Thus, this study concludes that the Linear SVC model is
the most suitable for the given task out of the models experimented with the given dataset.</p>
    </sec>
    <sec id="sec-11">
      <title>8. Acknowledgements</title>
      <p>We would like to thank the ImageCLEF 2023 organisers for providing us with the dataset, which
was imperative for this study. We would also like to express our gratitude to our professors for extending
their support throughout.</p>
    </sec>
    <sec id="sec-12">
      <title>9. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>W.</given-names>
            <surname>Yim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Ben</given-names>
            <surname>Abacha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Snider</surname>
          </string-name>
          , G. Adams,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Yetisgen, Overview of the MEDIQA-Sum Task at ImageCLEF 2023: Summarization and Classification of Doctor-Patient Conversations in CLEF 2023 Working Notes</article-title>
          , CEUR Workshop Proceedings, CEUR-WS.org, Thessaloniki, Greece,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Nicolson</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dowling</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Koopman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2022</year>
          ,
          <year>August 25</year>
          ).
          <article-title>ImageCLEF 2021 Best ofLabs: The Curious Case of Caption Generation for Medical Images</article-title>
          .
          <article-title>ImageCLEF 2021 Best of Labs: The Curious Case of Caption Generation for Medical Images|SpringerLink</article-title>
          . https://doi.org/10.1007/978-3-
          <fpage>031</fpage>
          -13643-6_
          <fpage>15</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Guglielmo</given-names>
            <surname>Faggioli</surname>
          </string-name>
          , Nicola Ferro, Allan Hanbury,
          <article-title>Martin Potthast, Overview of the MEDIQASum Task at ImageCLEF 2023: Summarization and Classification of Doctor-Patient Conversations CEUR-WS</article-title>
          .org/Vol-3180
          <string-name>
            <surname>-</surname>
          </string-name>
          (
          <year>2022</year>
          ,
          <article-title>August 9).</article-title>
          . http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          /
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Alberto</given-names>
            <surname>Barrón-Cedeño</surname>
          </string-name>
          , Giovanni Da San Martino, Mirko Degli Esposti, Fabrizio Sebastiani,
          <source>CLEF 2023 Working Notes Experimental IR Meets Multilinguality, Multimodality, and Interaction, Lecture Notes in Computer Science (LNCS</source>
          , volume
          <volume>13390</volume>
          ). https://link.springer.com/book/10.1007/978-3-
          <fpage>031</fpage>
          -13643-6
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Alberto</given-names>
            <surname>Barrón-Cedeño</surname>
          </string-name>
          , Giovanni Da San Martino, Mirko Degli Esposti, Sebastiani, Craig Macdonald, Gabriella Pasi, Allan Hanbury, Martin Potthast, Guglielmo Faggioli, Nicola Ferro,
          <source>Experimental IR Meets Multilinguality, Multimodality, and Interaction 13th International Conference of the CLEF Association, CLEF</source>
          <year>2022</year>
          , Bologna, Italy, September 5-
          <issue>8</issue>
          ,
          <year>2022</year>
          . https://link.springer.com/book/10.1007/978-3-
          <fpage>031</fpage>
          -13643-6
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Adhika</given-names>
            <surname>Pramita</surname>
          </string-name>
          <string-name>
            <given-names>Widyassari</given-names>
            , Supriadi Rustad, Guruh Fajar Shidik, Edi Noersasongko, Abdul Syukur, Affandy Affandy,
            <surname>De Rosal Ignatius Moses Setiadi</surname>
          </string-name>
          ,
          <article-title>A Review of automatic text summarization techniques &amp; methods</article-title>
          . (
          <year>2020</year>
          , May 20). Volume
          <volume>34</volume>
          , issue
          <volume>4</volume>
          (Pages
          <fpage>1011</fpage>
          -
          <lpage>1624</lpage>
          ).
          <article-title>Review of automatic text summarization techniques &amp; methods - ScienceDirect</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Sarker</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>Y. C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Al-Garadi</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Abbas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2020</year>
          , November 13).
          <article-title>A Light-Weight Text Summarization System for Fast Access to Medical Evidence</article-title>
          .
          <source>BRIEF RESEARCH REPORT article Front. Digit. Health, 04 December</source>
          <year>2020</year>
          ,
          <string-name>
            <given-names>Sec</given-names>
            <surname>Health</surname>
          </string-name>
          <string-name>
            <surname>Informatics</surname>
          </string-name>
          , Volume
          <volume>2</volume>
          - 2020. https://doi.org/10.3389/fdgth.
          <year>2020</year>
          .585559
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Stergos</given-names>
            <surname>Afantenos Vangelis Karkaletsis Panagiotis Stamatopoulos</surname>
          </string-name>
          ,
          <article-title>Summarization from medical documents: a survey (2004, December)</article-title>
          .
          <source>Summarization From Medical Documents Artificial Intelligence in Medicine</source>
          , Volume
          <volume>33</volume>
          ,
          <string-name>
            <surname>Issue</surname>
            <given-names>2</given-names>
          </string-name>
          ,
          <string-name>
            <surname>February</surname>
            <given-names>2005</given-names>
          </string-name>
          , Pages
          <fpage>157</fpage>
          -
          <lpage>177</lpage>
          . https://doi.org/10.1016/j.artmed.
          <year>2004</year>
          .
          <volume>07</volume>
          .017
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Noemie</given-names>
            <surname>Elhadad</surname>
          </string-name>
          ,
          <article-title>User-sensitive text summarization: Application to the medical domain</article-title>
          . Columbia University, ProQuest Dissertations Publishing,
          <year>2006</year>
          . 3203749. https://www.proquest.com/openview/c0c29a6f5b3229c21418f355b1c57c28/1?pq-origsite=gscho
          <string-name>
            <given-names>B.</given-names>
            <surname>Ionescu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Müller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Drăgulinescu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Yim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Ben</given-names>
            <surname>Abacha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Snider</surname>
          </string-name>
          , G. Adams,
          <string-name>
            <given-names>M.</given-names>
            <surname>Yetisgen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rückert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Garcıa Seco de Herrera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Friedrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bloch</surname>
          </string-name>
          , R. Brün- gel, A.
          <string-name>
            <surname>Idrissi-Yaghir</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Schäfer</surname>
            ,
            <given-names>S. A.</given-names>
          </string-name>
          <string-name>
            <surname>Hicks</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Riegler</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Thambawita</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Storås</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Halvorsen</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Papachrysos</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Schöler</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Jha</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Andrei</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Radzhabov</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Coman</surname>
            , V. Ko- valev, A. Stan, G. Ioannidis,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Manguinhas</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Ştefan</surname>
            ,
            <given-names>M. G.</given-names>
          </string-name>
          <string-name>
            <surname>Constantin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Dogariu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Deshayes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Popescu</surname>
          </string-name>
          , Overview of ImageCLEF 2023:
          <article-title>Multimedia retrieval in medical, social media and recommender systems applications</article-title>
          , in Experimental IR Meets Multilinguality, Multimodality, and
          <string-name>
            <surname>Interaction</surname>
          </string-name>
          ,
          <source>Proceedings of the 14th International Conference of the CLEF Association (CLEF</source>
          <year>2023</year>
          ), Springer Lecture Notes in Computer Science LNCS, Thessaloniki, Greece,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>