<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>AIxPA</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Understanding Italian Administrative Texts: A Reader-Oriented Study for Readability Assessment and Text Simplification</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Martina Miliani</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco S. G. Senaldi</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gianluca E. Lebani</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessandro Lenci</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Linguistics and Comparative Cultural Studies, Ca' Foscari University of Venice</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Philology</institution>
          ,
          <addr-line>Literature, and Linguistics</addr-line>
          ,
          <institution>University of Pisa</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of Psychology, McGill University</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University for Foreigners of Siena</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <volume>1</volume>
      <fpage>0000</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>The complexity of administrative texts can preclude citizens with language disparities from accessing relevant information. Recent deep-learning models of readability assessment and text simplification would greatly benefit from training materials that are annotated with the specific needs of the target readers. The aim of the present work is to investigate how diferently second language learners of Italian and elderly Italian native speakers read and comprehend administrative texts of diferent readability levels in digital format, as compared to a control group of Italian native speakers. To this end, we conducted a study where 86 participants from the three groups were asked to perform a comprehension task via smartphone. Participants read administrative texts in their original and simplified form, where simplification was performed on the basis of linguistic features that previous literature considered typical of the administrative domain. Although the applied simplification did not seem to afect text comprehension, we observed diferences across the three subject groups, especially in relation to participants' background.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Natural Language Processing</kwd>
        <kwd>Reading Comprehension</kwd>
        <kwd>Public Administration</kwd>
        <kwd>L2</kwd>
        <kwd>Elderly</kwd>
        <kwd>Automatic Readability Assessment</kwd>
        <kwd>Automatic Text Simplification</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Even though public institutions communicate more and more through the web and innovative
digital technologies and have been encouraged to use a plain language [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], Italian administrative
texts appear still far from being easily readable [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Writing easy-to-read text is a non trivial
task if we consider what text comprehension means. According to [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], text comprehension
is determined by the interplay of three factors: the reader and their background, the reading
context (e.g., the medium used), and the text itself. For this reason, not only should algorithms
for Automatic Readability Assessment (ARA) and Automatic Text Simplification (ATS) take
into consideration the linguistic features of a text that pertain to its domain and genre [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], but
should also be tuned to the specific needs of the target audience [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ].
      </p>
      <p>
        These issues become particularly compelling when it comes to the administrative language.
Its complexity can become a barrier to the accessibility of information related to citizens’ rights
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This is especially true for citizens with language disparity, namely those who do not have
an optimal level of language proficiency [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>In this paper, we present a study that involved 86 participants belonging to two groups
with language disparities, i.e., Italian second-language speakers and elderly Italian native
speakers, and to a control group, i.e., Italian native speakers. Participants were asked to
perform a comprehension task in a digital context (via smartphone) on original and simplified
administrative texts. This simplification was carried out by only considering those features that
previous literature considered typical of the linguistic complexity of the administrative domain.
The goals of the present study are manifold:
• Assessing if there is a diference between the three groups in the comprehension of
administrative texts with two diferent levels of readability;
• Exploring the efect of participants’ background (e.g., education and digital literacy) on
the comprehension of administrative texts;
• Detecting which linguistic features afect the comprehension performance across the
three groups, over and above those strictly related to the administrative language.</p>
      <p>Details on the experimental setting, including materials, selected participants, experimental
design, and extracted linguistic features are described in Section 3. Section 4 shows the results of
the three reading tasks and the linguistic feature analysis, whose implications are then discussed
in Section 51.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Classic readability formulae were designed in the early 20s to detect the complexity of a text in
relation to educational stages [
        <xref ref-type="bibr" rid="ref10 ref8 ref9">8, 9, 10</xref>
        ]. Since these formulae took only raw linguistic features
into consideration, such as word and sentence length, they were not fully reliable [
        <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
        ].
Advancements in Machine Learning (ML) led to the implementation of more complex models
that are informed by a wider and less superficial set of linguistic features [
        <xref ref-type="bibr" rid="ref13 ref14">13, 14</xref>
        ].
      </p>
      <p>
        As for Italian, readability formulae started being implemented only in the late 80s, by [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]
and [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], authors of the Flesch-Vacca formula and the Gulpease Index, respectively. The first
ML-based index for Italian is Read-It [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]: The index measures the probability of a text to be
labelled as complex by an SVM trained on newspaper articles on the basis of linguistic features
ranging from the lexical to the syntactic level. Inspired by Coh-Metrix [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], [19] considered
also discourse-level features, e.g., cohesion, to design Coease, an index for texts related to the
educational domain. Finally, CTAP is a web-based readability tool available also for the Italian
1Anonymized and aggregated data, and code are available at https://github.com/Unipisa/ita_admin_user_study
language [20], which extracts 253 diferent linguistic features. Such features are not related to
any reference corpora, and it is up to the user to give an interpretation to the extracted values.
      </p>
      <p>
        Some models for Italian text readability were also designed for targeted reader groups, such
as Italian second language learners. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] implemented MALT-IT2, a tool that automatically
classifies texts by assigning one of the proficiency levels of the Common European Framework
of Reference for Languages (CEFR). This tool is based on a SVM, trained on raw, lexical,
morphosyntactic, syntactic, and discursive features.
      </p>
      <p>For what concerns the administrative language, [21] automatically analyzed several linguistics
features extracted from a parallel corpus composed of administrative texts and their simplified
versions. The author aimed at distinguishing between features that are expression of the intrinsic
complexity of the administrative language and those used in the so-called “bureaucratese”, a
term used to indicate the “artificial” and “obscure” style that sometimes characterizes the
administrative writing [22]. [23] extracted complexity features from about 100 institutional
texts for foreigners, showing the gap between the language used in these texts and those tailored
for Italian second language speakers. This gap was also confirmed by a comprehension test
carried out on specific target readers [24].</p>
      <p>A way to detect which features best predict the readability of a certain text for a certain
target is in fact to collect data from human participants. [25] built two models trained on several
linguistic features to predict pairwise scores for text comprehension and reading time collected
through online crowdsourcing. [26] showed that scrolling interactions are predictive of text
readability also for specific target users, such as English second language speakers.</p>
      <p>User studies were also conducted for the administrative domain. [27] collected judgments on
readability from public administration staf and extracted linguistics features from the analyzed
texts in French. [28] analyzed complexity features for administrative texts in German, and
evaluated their model through the correlation of such features with non-experts’ judgments on
readability.</p>
      <p>To the best of our knowledge, this is the first study on readability that focuses on the
administrative Italian language and addresses multiple subject groups, such as second-language
and elderly Italian readers in a digital context.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Experimental settings</title>
      <p>In the comprehension task, three diferent groups of participants were involved: Italian
secondlanguage (L2) speakers, Italian first-language speakers who were older than 60 (elderly), and
Italian native speakers younger than 60 years old with a medium-high literacy level (control).
All participants had to perform the test via smartphone.</p>
      <sec id="sec-3-1">
        <title>3.1. Materials</title>
        <p>We collected four portions of texts extracted from documents of diferent nature, covering
various topics related to public administration, and published by oficial websites of Italian
city halls from all over the country (see Table 1). Texts with a similar distribution of specific
linguistic features2 were selected, such as the length of the whole text and the average length
of sentences (in tokens), the token/type ratio, and the percentage of tokens belonging to the
Base Vocabulary of the Italian language [30].</p>
        <p>
          A simplified version of each text was then created based on the features of the Italian
administrative language that were singled out by [21]. The presence of these features in the
administrative texts is claimed not to be justified based on the complexity of the public bodies
2Texts were analyzed by using the Python NLP library Stanza [29].
and the procedures they describe, nor on the performative nature of such language [31], but
to rather lead to “bureaucratese” [22]. The adopted annotation schema was firstly presented
by [32] and then used by [33] for the annotation of SIMPITIKI, where a single simplification
operation was performed on each sentence. We adapted this schema by annotating all the
simplification operations applied to each sentence and by indicating the motivation for the
performed operation, i.e., the detected linguistic feature to be simplified (see statistics in Table 2
and Table 3, and see Table 3 for an example of the simplification operation). The simplification
was validated through a test, which involved 43 Italian native speakers. For each
originalsimplified pair, the participants were asked which text was the simpler one and how similar
they were (Fig. 1), to assess if the information contained in the original text was preserved
in the simplification process. Four multiple-choice questions for each text were formulated
by analyzing their macro and micro informative structure. Drawing inspiration from [34], we
split each text into sentences (microstructure) and, at a higher and more abstract level, we
split the text according to the subject matter (macrostructure). Then, we checked that the
obtained micro and macro structures were preserved after the simplification process and we
selected the portion of texts on which to test the participants. This ensured that questions
covered each element of the macrostructure. The same questions were asked to participants
reading the original or the simplified version of each text. We choose to ask multiple-choice
questions, since they are usually adopted in comprehension tasks [26], have already been used
for an efective simplification of texts [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] and are widely adopted for assessing the proficiency of
second language learners [35]. Item readability was then analyzed employing Read-It [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], which
provides a readability score at the sentence level, and items were then simplified accordingly.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Participants</title>
        <p>We recruited a total of 111 participants, 47 for the group L2, 29 for the elderly group and 35 for
the control group. For the control and elderly group, we only included the participants who
were born and were living in the Tuscany region, in order to limit the influence of regional
varieties of Italian on the comprehension of texts. People without a high school diploma were
excluded from the control group.3 For what concerns the L2 group, we eventually included
only non-native speakers of Italian with A2 and B1 language certificates (according to the
CEFR) and currently residing in Italy, as well as and non-native speakers of Italian without any
proficiency certificate who had lived in Italy for at least 5 years. We assumed that people who
had been living in Italy for at least 5 years had higher chances to be frequently exposed to public
administration language in everyday life. By filtering participants based on these criteria, we
were eventually left with 86 subjects: 26 for the group L2, 29 for elderly, and 31 for the control
group. L2 group participants were aged 18 to 55 and 69.2% of them were female. They were
born in Morocco (15.38%), Senegal (11.54%), Albania (11.54%), Georgia (7.69%), Indonesia (7.69%),
Nigeria (7.69%), Russia (7.69%) and other countries (30.77%). For what concerns the education
level, 61.54% of participants had at least a high school diploma. Elderly participants’ age ranged
from 60 to 82 and the 51.72% of them were female. In this case, only 55.17% of participants had
at least the high school diploma.</p>
        <p>We collected such information through a demographic questionnaire. We grouped the
questions into diferent topics, starting with those regarding all the participants [ 36]. The
questionnaire also included questions about digital literacy, familiarity with the administrative
domain, education and reading habits.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Test implementation and design</title>
        <p>We implemented the test on a multiple-step web page, using HTML, CSS and JavaScript. Choices
about the test layout, such as line spacing, font type, and dimension were made following the
Design Guidelines for Public Administration Web Sites and Services4 provided by AGID (Agency
for Digital Italy).</p>
        <p>We administered the test in a hybrid format, partly in person and partly remotely. Each
participant read two texts, one presented in its original version and the other in its simplified
version. The 8 texts (4 original and 4 simplified) that were obtained through the procedure
described in Section 3.1 were rotated across participants so that no participant saw the same text
in both conditions. In the four resulting lists, the order in which the original and simplified text
appeared was counterbalanced. Firstly, participants answered to the demographic questionnaire
and then completed the comprehension task for one text at a time.</p>
        <p>We showed each single question on a diferent step page, right below the related text. For
each multiple choice question, we provided a key, two distractors, and the “I don’t know” option,
to try to limit participants’ guessing.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Feature Extraction</title>
        <p>
          We extracted 13 linguistic features from each text to assess which ones mostly afected
participants’ reading speed and comprehension. Features were selected based on existing literature
on the readability of administrative language [
          <xref ref-type="bibr" rid="ref2">21, 2</xref>
          ], Italian Second Language Learning [
          <xref ref-type="bibr" rid="ref6">37, 6</xref>
          ],
3In 2001, the average year of scholar education per person was about 11,7 years [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. People without a high
school diploma, which in Italy is obtained after 13 years of scholar education, were thus considered having a low
literacy level.
        </p>
        <p>4https://docs.italia.it/italia/design/lg-design-servizi-web/it/versione-corrente/index.html
and language processing in elderly people [38]. Such features are related to diferent linguistic
levels:
• Raw. Average length of sentences in tokens;
• Lexical. Percentage of words belonging to the Fundamental Vocabulary5, average number
of multiword units and entities per sentence, average number of collateral technicisms6
per sentence;
• Psycholinguistic. Percentage of abstract nouns 7;
• Morphosyntactic. Percentage of deverbal nouns, participles verbs, and indicatives verbs;
• Syntactic. Average depth of the parsing tree, ratio between subordinate and total number
of clauses, average length of the prepositional chains;
• Discourse and Style. Average number of asides and parenthetical expressions per sentence,
average number of common nouns among adjacent sentences.</p>
      </sec>
      <sec id="sec-3-5">
        <title>3.5. Preprocessing</title>
        <p>Participants’ performance in comprehension questions was measured in terms of error rate.
Sociolinguistic data from the demographic questionnaire were operationalized in three diferent
indices. We measured the use of smartphone by averaging for each participant the Likert values
for the questions “How many hours per day you spent on the phone last week?” and “Do
you use the smartphone to work or study (i.e. reading books and articles, writings, making
analysis, doing some research, etc.)”. The second question was motivated by the fact that people
used to quick interactions with their smartphone struggle in focusing on longer task [39]. This
averaging procedure resulted in the digital index. Familiarity with the administrative domain
was analyzed though the admin index, obtained by averaging Likert values for the questions
“How often have you paid taxes, filled forms or asked for financial support in the last month?”
and “How often did you read forms, notices, call for applications, regulations or similar in
the last month?”. Finally, we merged information about education and reading habits into the
readedu index. This index was obtained by averaging responses to the questions: “How many
books have you read in the last year?”, “How often have you read newspapers and magazines
(also online) in the last month?”, and “Which is your highest degree?”8. Such indices were then
centered and scaled.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>Analyses were run to assess if there was any diference between the three subject groups
(L2, elderly, control) in the comprehension of administrative texts and of their easier-to-read
versions, simplified according to the sole linguistic features that are specifically related to the
5A subset of the Italian Base vocabulary.</p>
      <p>6Collateral technicisms are terms related to sectorial or special languages used to give the text a high linguistic
registry but that lack specific communicative function.</p>
      <p>7Given the lack of annotated data for the administrative domain, nouns’ lemma were manually annotated by a
linguist as abstract or concrete. We then computed the percentage of abstract nouns for each portion of text.</p>
      <p>8For L2 and elderly, the index is only based on the answers to the first two questions for those participants who
did not precised their educational level.
administrative language. Furthermore we analyzed how familiarity with the administrative
domain, digital literacy, reading habits, and education afected comprehension, by using the
three indices described in Section 3.5.</p>
      <sec id="sec-4-1">
        <title>4.1. Error rates</title>
        <p>We were interested in detecting any significant diference among groups in relation to
participants’ error rate when answering the comprehension questions. A significant interaction
emerged between group, complexity, and the digital index ( = .012). While L2 speakers were
overall less accurate in answering comprehension questions, they specifically made more errors
on simpler texts when digital exposure was lower (see Fig. 2). We also observed a main efect of
the admin index on participants’ error rate ( = .040). L2 speakers’ error rate was higher for
both original and simplified texts when their familiarity with the administrative domain was
lower (see Fig. 3). By contrast, the performance of participants seemed not to be impacted by
reading habits and education level.
4.1.1. Focus on L2
In a subsequent step of our analysis, we zoomed in on L2 speakers only, to shed light on the
role of L2-specific demographic variables in text comprehension. In particular, we analyzed
the interaction between text complexity, the number of years spent in Italy, Italian proficiency
(i.e., the language certificate), the years employed in studying Italian as a second language, and
the use of Italian when communicating at home, at work, and with friends. In this case, texts
were not included in the random efects. Finally, for this analysis we considered the education
level separated from the information about reading habits.9 In line with the results obtained on
the three groups, by analyzing L2 participants we observed a significant trend concerning the
interaction between the digital index and text complexity on error rates ( = .085).10 Namely,
the error rate was higher for those participants with lower digital literacy when answering
questions on simplified texts (see Fig. 4).</p>
        <p>9This analysis involved 24 participants out of 26: two participants where excluded since they did not indicated
which was their education level.</p>
        <p>10We considered only participants as random efect here.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Feature analysis</title>
        <p>We conducted a preliminary analysis on the linguistic features we described in Section 3.4, to
detect which ones are more predictive of participants’ error percentage in the comprehension
task. We performed a Principal Component Analysis (PCA) on the linguistic features to reduce
the dimensionality of the data while preserving as much as possible of their original information.
The first two principal components cumulatively accounted for 71% of the variance of the
original variables. When inspecting the loadings matrix, PC1 seemed to be mostly influenced by
morphological features, i.e., the number of participles and indicative verbs, and by features that
afect the sentence length: the average number of multiwords units and entities per sentence,
the average length of the prepositional chains, and the average length of sentences in tokens. By
contrast, PC2 appeared to be influenced by the average number of common nouns in adjacent
sentences, which is related to text cohesion, and morphosyntactic features, i.e., the average
depth of the parsing tree per sentence and the number of deverbal nouns.</p>
        <p>When predicting participants’ error rates,11 we observed an efect of PC2 on each group, and
in particular an almost-significant efect on L2 participants (  = .060). As shown in Figure 5,
when PC2 is higher, the error rate increases for the three groups, especially for L2.</p>
        <p>A significant efect is observed in the interaction among group, complexity, and PC1 (  =
.030). Figure 6 shows that L2 participants’ error rates increases along with PC1 values for
questions regarding simplified texts. An higher error rate is registered also for control participants,
but such increment is not significant.</p>
        <p>It is paramount to underscore the preliminary nature of this exploratory analysis. Future
contributions will better clarify the role of specific linguistic features through finer-grained and
targeted analyses.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Interactive task</title>
        <p>The test originally included also an interactive task, where we asked participants to underline
the portions of text they perceived as more dificult. By doing so, we wanted to compare
response accuracy against a more subjective judgment of the text complexity. Participants
were free not to underline any portion of texts. However, we tried to encourage the readers’
participation by also providing a tutorial, to reduce the limitations posed by the low familiarity
of some participants with digital devices. Unfortunately, only a few participants underlined a
portion of text (10 L2, 6 elderly and 14 control participants) and thus, we did not carry out any
statistical analysis on this data. We do believe that this happened because most participants did
not perceive any part of the text as complex. However, when including data from participants
that were left out in the initial filtering, we noticed that on average, L2 speakers underlined as
many portions of text as control participants in the simpler condition, whereas they underlined
more passages on original texts. However, when inspecting the underlined text more closely,
we noticed that L2 speakers underlined fewer tokens, i.e., they tended to underline single words
rather than entire phrases (see Fig. 7 for an example). We do not discuss data related to the
elderly group, since only six people took part in the task.</p>
        <p>11We considered only the participants as random efect here.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion</title>
      <p>The higher error rate registered in L2 participants revealed a significant diference with respect to
elderly and control participants in text comprehension. However, in light of our current results,
we could not confidently conclude that text complexity afected participants’ comprehension
across subject groups. The fact that participants’ comprehension did not improve when dealing
with simplified texts could highlight the need for a simplification strategy that focuses more on
linguistic features specific to each target group.</p>
      <p>Furthermore, we saw that participants’ background afected text comprehension to some
extent. For example, digital literacy seemed to specifically afect L2 learners’ comprehension of
simplified texts. Namely, participants who used their smartphone less frequently, in particular
not for reading and writing, struggled more when answering the questions.</p>
      <p>When focusing only on the L2 group, we also found a marginal efect of digital literacy on
participants’ error rates, whereas we did not register any efect of proficiency as assessed by a
certificate or concerning years spent in Italy.</p>
      <p>Furthermore, familiarity with the administrative domain seems to play a role in subjects’
understanding. The lower the admin index, and therefore the exposure to administrative texts
and public administration, the less accurate were L2 participants’ in answering questions related
to simplified texts.</p>
      <p>The analysis of linguistic features showed that, regardless the text readability, each group
and L2 in particular struggles in understanding sentences with a low number of common nouns
among adjacent sentences and with a complex syntactic structure. In fact, by looking at the
loading matrix, PC2 grows with sentences with a deeper parsing tree and when the percentage
of deverbal nouns in the text decreases. Deverbal nouns, in fact, tend to condense information,
even though this produces further complexity related to their abstractness and high information
density (e.g., “la percezione dell’integrazione salariale” [the receipt of wage subsidies] instead of
“i lavoratori che ricevono l’integrazione salariale” [workers that receive wage subsidies]).</p>
      <p>For what concerns the analysis of the linguistic features expressed by PC1, we also observed
that L2 participants also struggle when reading simple texts with long sentences. Furthermore,
we could say that L2 participants find dificult understanding texts with compound tenses of
verbs, since their error rate increase with a high number of participle verbs, and a lower number
of indicative verbs. Moreover, L2 group’s comprehension is also afected by the lexicon: their
error rate increase with a higher number of multiwords and entities. According to what we
observed in the interactive task, only this lexical aspect of complexity is pointed out by L2
participants’, who seem to perceive lexical features as more complex than syntactic ones.</p>
      <p>PC1’s linguistic features do not seem to have an efect when participants deal with original
texts. In particular, the presence or absence of such features does not help participants to answer
questions correctly when dealing with original and - thus - more complex texts. On the contrary,
with simplified texts, L2 participants’ error rate is higher than for the other two groups: elderly
participants seem to benefit from PC1’s features, whereas for control, and even more so for L2,
texts may require further simplification based on such features.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions</title>
      <p>The goal of the present work was to investigate how diferently second language learners of
Italian and elderly Italian native speakers read and comprehend administrative texts in a digital
context. We designed a comprehension task to be completed online which allowed us to collect
the error rate in answering comprehension questions for each selected text. Furthermore, we
wanted to assess if other linguistic features afected the comprehension of participants other
than those strictly related to the administrative domain. Such features were used to simplify
the selected texts, and these simplified versions were also shown within the two tasks.</p>
      <p>We observed a diference in comprehension for L2 participants compared to the elderly and
control group, and found out that text complexity did not afect text comprehension across
the three groups. However, participants’ background had some efect on the comprehension
process, especially for what concerns speakers’ digital literacy and their familiarity with the
administrative domain. Finally, we detected the efect of specific linguistic features on participants’
comprehension for all the three groups.</p>
      <p>In future contributions, we would like to further investigate the registered efect of digital
literacy on L2 participants’ comprehension. For example, it would be interesting to analyze the
comprehension of texts when read on paper and on digital devices.</p>
      <p>
        We plan to increase our participant sample and the number of selected texts, in order to have
a dataset that is more representative of the various types of documents that can be found in the
administrative domain. Furthermore, we intend to include also other groups of target users,
like low-literacy people. Then, we aim to propose a simplification procedure for administrative
texts that takes into account a larger set of linguistic features focused on the need of specific
target groups. The obtained data might be used to build a neural model for the simplification of
administrative texts that takes into account the needs of such groups. For example, this would
be possible by leveraging Controllable simplification [
        <xref ref-type="bibr" rid="ref5">5, 40, 41</xref>
        ], which aims at constraining the
neural model output by using special tokens with information on sentence and word length,
Levenshtein distance between input and generated sentence, and so on.
193–202.
[19] S. Tonelli, K. M. Tran, E. Pianta, Making readability indices readable, in: Proceedings
of the First Workshop on Predicting and Improving Text Readability for target reader
populations, 2012, pp. 40–48.
[20] N. Okinina, J.-C. Frey, Z. Weiss, CTAP for Italian: Integrating components for the analysis
of Italian into a multilingual linguistic complexity analysis tool, in: Proceedings of the
12th Conference on Language Resources and Evaluation (LREC 2020), 2020, pp. 7123–7131.
[21] D. Brunato, A study on linguistic complexity from a computational linguistics perspective.
a corpus-based investigation of italian bureaucratic texts, Ph.D. thesis, University Of Siena,
2015.
[22] S. Lubello, Il linguaggio burocratico, Le bussole, Carocci, 2014.
[23] G. Lombardi, La leggibilità dei testi istituzionali italiani destinati agli stranieri, in: M. E.
      </p>
      <p>Favilla, S. Machetti (Eds.), Lingue in contatto e linguistica applicata: individui e società,
AItLA - Associazione Italiana di Linguistica Applicata, Bologna, 2021, pp. 199–214.
[24] G. Lombardi, Capire i documenti in L2: dall’analisi della comprensibilità di un corpus di
testi istituzionali per stranieri alla sperimentazione di approcci didattici e linguistici, Ph.D.
thesis, Università degli Studi di Genova, 2020.
[25] S. A. Crossley, S. Skalicky, M. Dascalu, Moving beyond classic readability formulas: New
methods and new models, Journal of Research in Reading 42 (2019) 541–561.
[26] S. Gooding, Y. Berzak, T. Mak, M. Shari,fi Predicting text readability from scrolling
interactions, arXiv preprint arXiv:2105.06354 (2021).
[27] T. François, L. Brouwers, H. Naets, C. Fairon, Amesure: a readability formula for
administrative texts (amesure: une plateforme de lisibilité pour les textes administratifs)[in french],
in: Proceedings of TALN 2014 (Volume 2: Short Papers), 2014, pp. 467–472.
[28] T. vor der Brück, S. Hartrumpf, H. Helbig, A readability checker with supervised learning
using deep indicators, Informatica 32 (2008) 429–435.
[29] P. Qi, Y. Zhang, Y. Zhang, J. Bolton, C. D. Manning, Stanza: A Python natural language
processing toolkit for many human languages, arXiv preprint arXiv:2003.07082 (2020).
[30] T. De Mauro, I. Chiari, Il nuovo vocabolario di base della lingua italiana, Internazionale,
28/11/2020. (2016). URL: https://www.internazionale.it/opinione/tullio-de-mauro/2016/12/
23/il-nuovo-vocabolario-di-base-della-lingua-italiana.
[31] A. Fioritto, Manuale di stile. Strumenti per semplificare il linguaggio delle amministrazioni
pubbliche, Il mulino, 1997.
[32] D. Brunato, F. Dell’Orletta, G. Venturi, S. Montemagni, Design and annotation of the first
italian corpus for text simplification, in: Proceedings of The 9th Linguistic Annotation
Workshop, 2015, pp. 31–41.
[33] S. Tonelli, A. P. Aprosio, F. Saltori, Simpitiki: a simplification corpus for italian., in:</p>
      <p>Proceedings of the Third Italian Conference on Computational Linguistics (CLiC-it), 2016.
[34] W. Kintsch, T. A. Van Dijk, Toward a model of text comprehension and production,</p>
      <p>Psychological review 85 (1978) 363–394.
[35] M. Barni, A. Villarini, La questione della lingua per gli immigrati stranieri: insegnare,
valutare e certificare l’italiano L2, volume 39, FrancoAngeli, 2001.
[36] D. A. Dillman, J. D. Smyth, L. M. Christian, Internet, phone, mail, and mixed-mode surveys:</p>
      <p>The tailored design method, John Wiley &amp; Sons, 2014.
[37] L. Forti, A. Milani, L. Piersanti, F. Santarelli, V. Santucci, S. Spina, Measuring text complexity
for italian as a second language learning purposes, in: Proceedings of the Fourteenth
Workshop on Innovative Use of NLP for Building Educational Applications, 2019, pp.
360–368.
[38] S. Norman, S. Kemper, D. Kynette, Adults’ Reading Comprehension: Efects of Syntactic</p>
      <p>Complexity and Working Memory, Journal of Gerontology 47 (1992) 258–265.
[39] L. E. Annisette, K. D. Lafreniere, Social media, texting, and personality: A test of the
shallowing hypothesis, Personality and Individual Diferences 115 (2017) 154–158.
[40] J. Mallinson, M. Lapata, Controllable sentence simplification: Employing syntactic and
lexical constraints, arXiv preprint arXiv:1910.04387 (2019).
[41] M. Maddela, F. Alva-Manchego, W. Xu, Controllable text simplification with explicit
paraphrasing, arXiv preprint arXiv:2010.11004 (2020).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D.</given-names>
            <surname>Fortis</surname>
          </string-name>
          ,
          <article-title>Il dovere della chiarezza. quando farsi capire dal cittadino è prescritto da una norma</article-title>
          ,
          <source>Rivista Italiana di Comunicazione Pubblica</source>
          <volume>25</volume>
          (
          <year>2005</year>
          )
          <fpage>82</fpage>
          -
          <lpage>116</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Cortelazzo</surname>
          </string-name>
          ,
          <article-title>Il linguaggio amministrativo: principi e pratiche di modernizzazione, Studi superiori</article-title>
          , Carocci,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Vedovelli</surname>
          </string-name>
          , T. De Mauro,
          <article-title>Dante, il gendarme e la bolletta: la comunicazione pubblica in Italia e la nuova bolletta ENEL</article-title>
          , Laterza,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Dell'Orletta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Venturi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Montemagni</surname>
          </string-name>
          ,
          <article-title>Genre-oriented readability assessment: A case study</article-title>
          ,
          <source>in: Proceedings of the Workshop on Speech and Language Processing Tools in Education</source>
          ,
          <year>2012</year>
          , pp.
          <fpage>91</fpage>
          -
          <lpage>98</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>L.</given-names>
            <surname>Martin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Sagot</surname>
          </string-name>
          , E. de la Clergerie,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bordes</surname>
          </string-name>
          , Controllable sentence simplification, arXiv preprint arXiv:
          <year>1910</year>
          .
          <volume>02677</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Forti</surname>
          </string-name>
          , G. Grego,
          <string-name>
            <given-names>S.</given-names>
            <surname>Filippo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Santucci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Spina</surname>
          </string-name>
          , Malt-it2:
          <article-title>A new resource to measure text dificulty in light of cefr levels for italian l2 learning</article-title>
          ,
          <source>in: 12th Language Resources and Evaluation Conference</source>
          ,
          <source>The European Language Resources Association (ELRA)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>7206</fpage>
          -
          <lpage>7213</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>T. De Mauro</surname>
          </string-name>
          ,
          <article-title>L'educazione linguistica democratica</article-title>
          ,
          <source>Gius. Laterza &amp; Figli Spa</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>B. A.</given-names>
            <surname>Lively</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. L.</given-names>
            <surname>Pressey</surname>
          </string-name>
          ,
          <article-title>A method for measuring the vocabulary burden of textbooks</article-title>
          ,
          <source>Educational administration and supervision 9</source>
          (
          <year>1923</year>
          )
          <fpage>389</fpage>
          -
          <lpage>398</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Flesch</surname>
          </string-name>
          ,
          <article-title>A new readability yardstick</article-title>
          ,
          <source>Journal of Applied Psychology</source>
          <volume>32</volume>
          (
          <year>1948</year>
          )
          <fpage>221</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Kincaid</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. P.</given-names>
            <surname>Fishburne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Rogers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. S.</given-names>
            <surname>Chissom</surname>
          </string-name>
          ,
          <article-title>Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel</article-title>
          ,
          <source>in: Institute for Simulation and Training</source>
          ,
          <year>1975</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>L.</given-names>
            <surname>Si</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Callan</surname>
          </string-name>
          ,
          <article-title>A statistical model for scientific readability</article-title>
          ,
          <source>in: Proceedings of the tenth international conference on Information and knowledge management</source>
          ,
          <year>2001</year>
          , pp.
          <fpage>574</fpage>
          -
          <lpage>576</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>K.</given-names>
            <surname>Collins-Thompson</surname>
          </string-name>
          ,
          <article-title>Computational assessment of text readability: A survey of current and future research</article-title>
          , ITL-
          <source>International Journal of Applied Linguistics</source>
          <volume>165</volume>
          (
          <year>2014</year>
          )
          <fpage>97</fpage>
          -
          <lpage>135</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S. E.</given-names>
            <surname>Schwarm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ostendorf</surname>
          </string-name>
          ,
          <article-title>Reading level assessment using support vector machines and statistical language models</article-title>
          ,
          <source>in: Proceedings of the 43rd annual meeting of the Association for Computational Linguistics (ACL'05)</source>
          ,
          <year>2005</year>
          , pp.
          <fpage>523</fpage>
          -
          <lpage>530</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Vajjala</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Meurers</surname>
          </string-name>
          ,
          <article-title>On improving the accuracy of readability classification using insights from second language acquisition</article-title>
          ,
          <source>in: Proceedings of the seventh workshop on building educational applications using NLP</source>
          ,
          <year>2012</year>
          , pp.
          <fpage>163</fpage>
          -
          <lpage>173</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>V.</given-names>
            <surname>Franchina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Vacca</surname>
          </string-name>
          ,
          <article-title>Taratura dell'indice di flesch su testo bilingue italianoinglese di unico autore, in: Atti dell'incontro di studio su: Leggibilità e Comprensione, Linguaggi, a</article-title>
          . III,
          <year>1986</year>
          , pp.
          <fpage>47</fpage>
          -
          <lpage>49</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>P.</given-names>
            <surname>Lucisano</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. E. Piemontese,</surname>
          </string-name>
          <article-title>GULPEASE: una formula per la predizione della dificoltà dei testi in lingua italiana</article-title>
          ,
          <source>La Nuova Italia</source>
          (
          <year>1988</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>F.</given-names>
            <surname>Dell'Orletta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Montemagni</surname>
          </string-name>
          , G. Venturi, Read-it:
          <article-title>Assessing readability of italian texts with a view to text simplification</article-title>
          ,
          <source>in: Proceedings of the second workshop on speech and language processing for assistive technologies</source>
          ,
          <year>2011</year>
          , pp.
          <fpage>73</fpage>
          -
          <lpage>83</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A. C.</given-names>
            <surname>Graesser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>McNamara</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. M. Louwerse</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Cai</surname>
          </string-name>
          ,
          <article-title>Coh-metrix: Analysis of text on cohesion and language</article-title>
          , Behavior research methods, instruments, &amp; computers
          <volume>36</volume>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>