<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>NL4AI 2024: Overview of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI 2024)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Giovanni Bonetta</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Claudiu Daniel Hromei</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lucia Siciliani</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Antonio Stranisci</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Enterprise Engineering, University of Rome Tor Vergata</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Fondazione Bruno Kessler</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Bari Aldo Moro</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Turin</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The Natural Language for Artificial Intelligence (NL4AI) workshop serves as a platform to explore the area situated at the intersection between Natural Language Processing (NLP) and Artificial Intelligence (AI), with a special emphasis on recent activities carried out in both fields in Italy. The eighth edition of the workshop had 18 submissions, of which 16 were accepted. The submissions span a broad spectrum of topics, encompassing foundational NLP research, applied NLP, and works that bridge the realms of NLP and AI. This edition exhibited a strong international presence, featuring contributions from authors representing 6 countries. The submissions also reflect a diversity of languages (e.g., English, Italian) and modalities (e.g., text, vision), underscoring the workshop's commitment to inclusivity and comprehensive exploration.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] investigated LLMs’ potential to automatically fill clinical questionnaires using patient records,
achieving promising results in extracting relevant medical information.
      </p>
      <p>
        Expanding the scope of AI multi-agent systems, Gosmar et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] introduce an extension to the
MultiAgent Interoperability framework, improving the coordination of AI agents in multiparty conversations.
This work introduce roles like Floor Manager and Convener Agent, along with mechanisms for handling
interruptions and uninvited agents, which enhance agent collaboration and ensure eficient, structured
multiparty exchanges. Brenna et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] focused on proactivity in task-oriented dialogues, proposing the
"last utterance proactivity prediction" task. Their research consists in instructing a model to detect when
participants provide proactive, unrequested information, in dialogue snippets. This approach opens
avenues for models capable of naturally generating proactive contributions, akin to human dialogue
behavior.
      </p>
      <p>
        Several authors have advanced domain-specific applications of AI, addressing key areas such as
clinical data handling, legal text processing, educational tools, mental health support, and sign language
generation. For instance, Styll et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] introduced an NLP pipeline to automate the extraction of clinical
data from free-text admission notes, using Named Entity Recognition (NER), for eficient integration
into EHR systems, aimed at enhancing workflow and supporting healthcare management. In the legal
domain, Valerio et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] adapted a large language model to Italian legal texts, constructing a specialized
corpus from public records and refining the model with Low-Rank Adaptation (LoRA), resulting in
improved coherence and domain relevance across varying prompts and corpus sizes. In educational
applications, Siragusa et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] developed UniQA, a bilingual question-answering dataset focused on
university course information, which includes 1k documents and 14k QA pairs. They assessed it with a
Retrieval Augmented Generation model, making it suitable for both question-answering and translation
tasks in Italian and English. For accessibility, Colonna et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] introduced a model for generating
Italian Sign Language (LIS) gestures for digital avatars, to enhance interaction for the deaf community,
with potential applications in digital accessibility and education. Finally, Scozzaro et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] conducted
an interdisciplinary readability analysis of recent amendments to the Italian Constitution, incorporating
readability metrics and language model evaluations to assess legislative clarity, contributing to the
understanding of democratic document accessibility.
      </p>
      <p>
        Multiple studies presented in this workshop focus on evaluating language models across diverse
contexts, particularly on applications for Italian. The dissemination work presented by Seveso et
al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] introduced a benchmark based on the INVALSI educational assessments to evaluate LLMs’
proficiency in Italian, adapting the test format for automated scoring. Their findings highlight gaps in
LLMs’ performance relative to human standards and discuss educational implications. Scaiella et al.
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] evaluated a multimodal model, MiniCPM-V 2.6, on GQA-it, Italy’s first large-scale VQA dataset,
showing that fine-tuning improved its accuracy from 33.4% to 59.4%, underscoring the importance of
language-specific adaptation for VQA tasks. Papucci et al. [ 12] addressed label selection in text-to-text
classification, developing Value Zeroing, an attention-based method to optimize label representation
for IT5, an Italian pre-trained T5 model. Their approach resulted in performance gains on the topic
classification task. Lastly, Sartor et al. [ 13] examined coherence evaluation in small Italian language
models, assessing 15 Transformer-based LLMs. They demonstrated that coherence modeling techniques,
such as perplexity and semantic distance, show variable eficacy depending on text genre and data
perturbations, revealing intricate dependencies that afect model performance on coherence tasks.
      </p>
      <p>Di Quilio et al. [14] introduced a comprehensive framework for Aspect-Category Sentiment Analysis
(ACSA), combining data conversion, semi-automatic annotation, and prediction-based reporting. They
adapted an existing Aspect-Category-Opinion Sentiment (ACOS) tool to ACSA, developing a web
application for annotating and enhancing their novel beauty dataset through manual or semi-automatic
methods. Musacchio et al. [15] proposed LLaVA-NDiNO, a series of multimodal large language models
tailored for the Italian language. By training these models on Italian-translated datasets derived from
English vision-language resources, they address the gap in multimodal capabilities for non-English
languages. Their work contributes to open science by releasing the models, data, and code, enabling
further development in multimodal Italian LLMs.
models for visual question-answering in italian, in: G. Bonetta, C. D. Hromei, L. Siciliani, M. A.
Stranisci (Eds.), Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence
(NL4AI 2024) co-located with 23th International Conference of the Italian Association for Artificial
Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[12] M. Papucci, A. Miaschi, F. Dell’Orletta, Fantastic labels and where to find them:
Attentionbased label selection for text-to-text classification, in: G. Bonetta, C. D. Hromei, L. Siciliani,
M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop on Natural Language for Artificial
Intelligence (NL4AI 2024) co-located with 23th International Conference of the Italian Association
for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[13] M. Sartor, F. Dell’Orletta, G. Venturi, Coherence evaluation in italian language models, in:
G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop
on Natural Language for Artificial Intelligence (NL4AI 2024) co-located with 23th International
Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[14] L. Di Quilio, F. Fioravanti, A comprehensive framework for aspect-category sentiment analysis, in:
G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop
on Natural Language for Artificial Intelligence (NL4AI 2024) co-located with 23th International
Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[15] E. Musacchio, L. Siciliani, P. Basile, G. Semeraro, Llava-ndino: Empowering llms with multimodality
for the italian language, in: G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings
of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI 2024) co-located
with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA
2024), CEUR-WS.org, 2024.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>L.</given-names>
            <surname>Laraspata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Cardilli</surname>
          </string-name>
          , G. Castellano, G. Vessio,
          <article-title>Enhancing human capital management through gpt-driven questionnaire generation</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>V.</given-names>
            <surname>Nardoni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lippi</surname>
          </string-name>
          , G. Hyeraci,
          <string-name>
            <given-names>M.</given-names>
            <surname>Maccari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Tarazjani</surname>
          </string-name>
          , G. Virgili,
          <string-name>
            <given-names>R.</given-names>
            <surname>Gini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Marinai</surname>
          </string-name>
          ,
          <article-title>Towards automatically filling questionnaires from clinical records with large language models</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Gosmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. C.</given-names>
            <surname>Deborah A. Dahl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Attwater</surname>
          </string-name>
          ,
          <article-title>Ai multi-agent interoperability extension for managing multiparty conversations</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Brenna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Magnini</surname>
          </string-name>
          ,
          <article-title>Last utterance proactivity prediction in task-oriented dialogues</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Styll</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Kusa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          ,
          <article-title>Enhancing clinical data capture: Developing a natural language processing pipeline for converting free text admission notes to structured ehr data</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>F.</given-names>
            <surname>Valerio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Basile</surname>
          </string-name>
          , M. de Gemmis,
          <article-title>Adapting a large language model to the legal domain: A case study in italian</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>I. Siragusa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Pirrone</surname>
          </string-name>
          ,
          <article-title>Uniqa: an italian and english question-answering data set based on educational documents</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>E.</given-names>
            <surname>Colonna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Arezzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Roberto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Landi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Vitulano</surname>
          </string-name>
          , G. Vessio, G. Castellano,
          <article-title>Towards italian sign language generation for digital humans</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Scozzaro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Delsanto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mastropaolo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Mensa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Revelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Radicioni</surname>
          </string-name>
          ,
          <article-title>On the reform of the italian constitution: an interdisciplinary text readability analysis</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Seveso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Mercorio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mezzanzanica</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Potertì</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Serino</surname>
          </string-name>
          ,
          <article-title>Disce aut deficere: Evaluating llms proficiency on the invalsi italian benchmark</article-title>
          , in: G. Bonetta,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Siciliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Stranisci</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI</source>
          <year>2024</year>
          )
          <article-title>co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS</article-title>
          .org,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Scaiella</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Margiotta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Hromei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Croce</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Basili</surname>
          </string-name>
          , Evaluating multimodal large language
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>