=Paper=
{{Paper
|id=Vol-3877/overview
|storemode=property
|title=None
|pdfUrl=https://ceur-ws.org/Vol-3877/overview.pdf
|volume=Vol-3877
}}
==None==
NL4AI 2024: Overview of the Eighth Workshop on Natural
Language for Artificial Intelligence (NL4AI 2024)
Giovanni Bonetta1 , Claudiu Daniel Hromei2 , Lucia Siciliani3 and Marco Antonio Stranisci4
1
Fondazione Bruno Kessler, Italy
2
Department of Enterprise Engineering, University of Rome Tor Vergata, Italy
3
University of Bari Aldo Moro, Italy
4
University of Turin, Italy
Abstract
The Natural Language for Artificial Intelligence (NL4AI) workshop serves as a platform to explore the area
situated at the intersection between Natural Language Processing (NLP) and Artificial Intelligence (AI), with a
special emphasis on recent activities carried out in both fields in Italy. The eighth edition of the workshop had
18 submissions, of which 16 were accepted. The submissions span a broad spectrum of topics, encompassing
foundational NLP research, applied NLP, and works that bridge the realms of NLP and AI. This edition exhibited
a strong international presence, featuring contributions from authors representing 6 countries. The submissions
also reflect a diversity of languages (e.g., English, Italian) and modalities (e.g., text, vision), underscoring the
workshop’s commitment to inclusivity and comprehensive exploration.
The Natural Language for Artificial Intelligence (NL4AI) workshop is an annual initiative aimed at
promoting a reflection and discussion about various interactions within the field of Artificial Intelligence
(AI). The workshop specifically emphasizes the importance of Natural Language Processing (NLP)
in AI research, highlighting its role in learning, knowledge representation, and cognitive modeling.
Recent AI achievements demonstrate positive impact on complex inference tasks and offer extensive
application possibilities in linguistic modeling, processing, and inferences. Nevertheless, Natural
Language Understanding remains a rich research topic, whose cross-fertilization extends to diverse
areas such as Cognitive Computing, Robotics, and Human-Computer Interaction. For AI, Natural
languages serve not only as the central focus for paradigms and applications but also as fundamental
elements, playing a crucial role in the automation, autonomy, and learnability of a broad spectrum of
intelligent phenomena — from Vision to Planning and Social Behavior. Reflecting on these diverse and
promising interactions constitutes a significant objective for ongoing AI studies, aligning seamlessly
with the core mission of AIxIA. Specifically, the NL4AI workshop is endorsed by the Special Interest
Group on NLP of the Italian Association for Artificial Intelligence (AIxIA) and by the Italian Association
of Computational Linguistics (AILC). This year’s edition attracted 18 submissions of high-quality papers
by 64 distinct authors from Italy (51), United Kingdom (3), United States (3), Pakistan (3), Austria (3),
Netherlands (1). After the peer-review process, 16 papers out of the initial 18 were accepted (Acceptance
rate: 88.89%), which we believe provides a good balance between the different topics related to the
workshop.
The contributions to the workshop cover a spectrum of topics, ranging from generative and conver-
sational AI to applications of LLMs and Multimodal models in the NLP domain. In what follows, we
provide a short overview of the accepted papers grouped by main topics.
In line with advancing the use of AI for questionnaire-related tasks, recent studies have leveraged
large language models (LLMs) for both generating and filling questionnaires across different domains.
Laraspata et al. [1] explored the use of GPT-3.5-Turbo and GPT-4-Turbo models to automate ques-
tionnaire generation for HR Management, releasing a novel dataset of HR survey questions. Their
research show that, while AI-generated questionnaires are still distinguishable from human-authored
ones, GPT-driven question generators are nontheless a viable solution. On the other hand, Nardoni et
NL4AI 2024: Eighth Workshop on Natural Language for Artificial Intelligence, November 26-27th, 2024, Bolzano, Italy
$ gbonetta@fbk.eu (G. Bonetta); hromei@ing.uniroma2.it (C. D. Hromei); lucia.siciliani@uniba.it (L. Siciliani);
marcoantonio.stranisci@unito.it (M. A. Stranisci)
© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
CEUR
ceur-ws.org
Workshop ISSN 1613-0073
Proceedings
al. [2] investigated LLMs’ potential to automatically fill clinical questionnaires using patient records,
achieving promising results in extracting relevant medical information.
Expanding the scope of AI multi-agent systems, Gosmar et al. [3] introduce an extension to the Multi-
Agent Interoperability framework, improving the coordination of AI agents in multiparty conversations.
This work introduce roles like Floor Manager and Convener Agent, along with mechanisms for handling
interruptions and uninvited agents, which enhance agent collaboration and ensure efficient, structured
multiparty exchanges. Brenna et al. [4] focused on proactivity in task-oriented dialogues, proposing the
"last utterance proactivity prediction" task. Their research consists in instructing a model to detect when
participants provide proactive, unrequested information, in dialogue snippets. This approach opens
avenues for models capable of naturally generating proactive contributions, akin to human dialogue
behavior.
Several authors have advanced domain-specific applications of AI, addressing key areas such as
clinical data handling, legal text processing, educational tools, mental health support, and sign language
generation. For instance, Styll et al. [5] introduced an NLP pipeline to automate the extraction of clinical
data from free-text admission notes, using Named Entity Recognition (NER), for efficient integration
into EHR systems, aimed at enhancing workflow and supporting healthcare management. In the legal
domain, Valerio et al. [6] adapted a large language model to Italian legal texts, constructing a specialized
corpus from public records and refining the model with Low-Rank Adaptation (LoRA), resulting in
improved coherence and domain relevance across varying prompts and corpus sizes. In educational
applications, Siragusa et al. [7] developed UniQA, a bilingual question-answering dataset focused on
university course information, which includes 1k documents and 14k QA pairs. They assessed it with a
Retrieval Augmented Generation model, making it suitable for both question-answering and translation
tasks in Italian and English. For accessibility, Colonna et al. [8] introduced a model for generating
Italian Sign Language (LIS) gestures for digital avatars, to enhance interaction for the deaf community,
with potential applications in digital accessibility and education. Finally, Scozzaro et al. [9] conducted
an interdisciplinary readability analysis of recent amendments to the Italian Constitution, incorporating
readability metrics and language model evaluations to assess legislative clarity, contributing to the
understanding of democratic document accessibility.
Multiple studies presented in this workshop focus on evaluating language models across diverse
contexts, particularly on applications for Italian. The dissemination work presented by Seveso et
al. [10] introduced a benchmark based on the INVALSI educational assessments to evaluate LLMs’
proficiency in Italian, adapting the test format for automated scoring. Their findings highlight gaps in
LLMs’ performance relative to human standards and discuss educational implications. Scaiella et al.
[11] evaluated a multimodal model, MiniCPM-V 2.6, on GQA-it, Italy’s first large-scale VQA dataset,
showing that fine-tuning improved its accuracy from 33.4% to 59.4%, underscoring the importance of
language-specific adaptation for VQA tasks. Papucci et al. [12] addressed label selection in text-to-text
classification, developing Value Zeroing, an attention-based method to optimize label representation
for IT5, an Italian pre-trained T5 model. Their approach resulted in performance gains on the topic
classification task. Lastly, Sartor et al. [13] examined coherence evaluation in small Italian language
models, assessing 15 Transformer-based LLMs. They demonstrated that coherence modeling techniques,
such as perplexity and semantic distance, show variable efficacy depending on text genre and data
perturbations, revealing intricate dependencies that affect model performance on coherence tasks.
Di Quilio et al. [14] introduced a comprehensive framework for Aspect-Category Sentiment Analysis
(ACSA), combining data conversion, semi-automatic annotation, and prediction-based reporting. They
adapted an existing Aspect-Category-Opinion Sentiment (ACOS) tool to ACSA, developing a web
application for annotating and enhancing their novel beauty dataset through manual or semi-automatic
methods. Musacchio et al. [15] proposed LLaVA-NDiNO, a series of multimodal large language models
tailored for the Italian language. By training these models on Italian-translated datasets derived from
English vision-language resources, they address the gap in multimodal capabilities for non-English
languages. Their work contributes to open science by releasing the models, data, and code, enabling
further development in multimodal Italian LLMs.
References
[1] L. Laraspata, F. Cardilli, G. Castellano, G. Vessio, Enhancing human capital management through
gpt-driven questionnaire generation, in: G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.),
Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI 2024)
co-located with 23th International Conference of the Italian Association for Artificial Intelligence
(AI*IA 2024), CEUR-WS.org, 2024.
[2] V. Nardoni, M. Lippi, G. Hyeraci, M. Maccari, A. D. Tarazjani, G. Virgili, R. Gini, S. Marinai,
Towards automatically filling questionnaires from clinical records with large language models, in:
G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop
on Natural Language for Artificial Intelligence (NL4AI 2024) co-located with 23th International
Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[3] D. Gosmar, E. C. Deborah A. Dahl, D. Attwater, Ai multi-agent interoperability extension for
managing multiparty conversations, in: G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.),
Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI 2024)
co-located with 23th International Conference of the Italian Association for Artificial Intelligence
(AI*IA 2024), CEUR-WS.org, 2024.
[4] S. Brenna, B. Magnini, Last utterance proactivity prediction in task-oriented dialogues, in:
G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop
on Natural Language for Artificial Intelligence (NL4AI 2024) co-located with 23th International
Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[5] P. Styll, W. Kusa, A. Hanbury, Enhancing clinical data capture: Developing a natural language
processing pipeline for converting free text admission notes to structured ehr data, in: G. Bonetta,
C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop on Natural
Language for Artificial Intelligence (NL4AI 2024) co-located with 23th International Conference of
the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[6] F. Valerio, P. Basile, M. de Gemmis, Adapting a large language model to the legal domain: A case
study in italian, in: G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings of the
Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI 2024) co-located with
23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024),
CEUR-WS.org, 2024.
[7] I. Siragusa, R. Pirrone, Uniqa: an italian and english question-answering data set based on
educational documents, in: G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings
of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI 2024) co-located
with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA
2024), CEUR-WS.org, 2024.
[8] E. Colonna, A. Arezzo, D. Roberto, D. Landi, F. Vitulano, G. Vessio, G. Castellano, Towards
italian sign language generation for digital humans, in: G. Bonetta, C. D. Hromei, L. Siciliani,
M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop on Natural Language for Artificial
Intelligence (NL4AI 2024) co-located with 23th International Conference of the Italian Association
for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[9] C. J. Scozzaro, M. Delsanto, A. Mastropaolo, E. Mensa, L. Revelli, D. P. Radicioni, On the reform of
the italian constitution: an interdisciplinary text readability analysis, in: G. Bonetta, C. D. Hromei,
L. Siciliani, M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop on Natural Language for
Artificial Intelligence (NL4AI 2024) co-located with 23th International Conference of the Italian
Association for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[10] A. Seveso, F. Mercorio, M. Mezzanzanica, D. Potertì, A. Serino, Disce aut deficere: Evaluating
llms proficiency on the invalsi italian benchmark, in: G. Bonetta, C. D. Hromei, L. Siciliani,
M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop on Natural Language for Artificial
Intelligence (NL4AI 2024) co-located with 23th International Conference of the Italian Association
for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[11] A. Scaiella, D. Margiotta, C. D. Hromei, D. Croce, R. Basili, Evaluating multimodal large language
models for visual question-answering in italian, in: G. Bonetta, C. D. Hromei, L. Siciliani, M. A.
Stranisci (Eds.), Proceedings of the Eighth Workshop on Natural Language for Artificial Intelligence
(NL4AI 2024) co-located with 23th International Conference of the Italian Association for Artificial
Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[12] M. Papucci, A. Miaschi, F. Dell’Orletta, Fantastic labels and where to find them: Attention-
based label selection for text-to-text classification, in: G. Bonetta, C. D. Hromei, L. Siciliani,
M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop on Natural Language for Artificial
Intelligence (NL4AI 2024) co-located with 23th International Conference of the Italian Association
for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[13] M. Sartor, F. Dell’Orletta, G. Venturi, Coherence evaluation in italian language models, in:
G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop
on Natural Language for Artificial Intelligence (NL4AI 2024) co-located with 23th International
Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[14] L. Di Quilio, F. Fioravanti, A comprehensive framework for aspect-category sentiment analysis, in:
G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings of the Eighth Workshop
on Natural Language for Artificial Intelligence (NL4AI 2024) co-located with 23th International
Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), CEUR-WS.org, 2024.
[15] E. Musacchio, L. Siciliani, P. Basile, G. Semeraro, Llava-ndino: Empowering llms with multimodality
for the italian language, in: G. Bonetta, C. D. Hromei, L. Siciliani, M. A. Stranisci (Eds.), Proceedings
of the Eighth Workshop on Natural Language for Artificial Intelligence (NL4AI 2024) co-located
with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA
2024), CEUR-WS.org, 2024.