=Paper=
{{Paper
|id=Vol-3878/93_main_long
|storemode=property
|title=Morphological vs. Lexical Antonyms in Italian: A Computational Study on Lexical Competition
|pdfUrl=https://ceur-ws.org/Vol-3878/93_main_long.pdf
|volume=Vol-3878
|authors=Martina Saccomando,Andrea Zaninello,Francesca Masini
|dblpUrl=https://dblp.org/rec/conf/clic-it/SaccomandoZM24
}}
==Morphological vs. Lexical Antonyms in Italian: A Computational Study on Lexical Competition==
<pdf width="1500px">https://ceur-ws.org/Vol-3878/93_main_long.pdf</pdf>
<pre>
                                Morphological vs. Lexical Antonyms in Italian:
                                A Computational Study on Lexical Competition
                                Martina Saccomando1,∗ , Andrea Zaninello2 and Francesca Masini1
                                1
                                    Alma Mater Studiorum - University of Bologna, Bologna (Italia)
                                2
                                    Fondazione Bruno Kessler, Trento; Free University of Bozen-Bolzano (Italy)


                                                 Abstract
                                                In this paper, we examine the competition between pairs of adjectives in Italian that are antonyms of the same term: one is a
                                                “morphological antonym” formed by negative prefixation, the other is a “lexical antonym” with no morphological relationship
                                                with the term in question. We consider pairs of adjectives that are reported as antonyms in lexicographic resources and
                                                extract the nouns that can be modified by both adjectives from a large corpus. We select a set of 8 nouns for each pair
                                                that present higher, lower, and comparable frequencies combined with each antonym respectively and then we perform
                                                two experiments with a LLM. Firstly, we perform experiments for masked-token prediction of the adjective, to study the
                                                correlation between prediction accuracy and the frequency of the noun-antonym pair. Secondly, we perform a polarity-flip
                                                experiment with a multilingual LLM, asking to change the adjective into its positive counterpart. Our results point to the
                                                conclusion that the lexical antonym seems to have a narrower lexical coverage and scope than the morphological antonym.

                                                 Keywords
                                                 Antonymy, Morphological antonyms, Lexical antonyms, Competition, Corpus analysis, Large language model, Token
                                                 prediction, Polarity flip, Italian


                                1. Introduction                                                                                felice                                                  infelice

                                Antonymy is the semantic relationship between terms
                                                                                                                                                                                       triste
                                with opposite meanings. In their canonical form, two
                                antonyms’ meanings can be represented as the poles of a
                                semantic continuum [1] where one term has a “positive”                                 Figure 1: Two possible antonyms, one morphological, one
                                semantic value, the other a “negative” one [2].                                        lexical, for the same word.
                                   In Italian, given a word (e.g., the adjective felice
                                ‘happy’), antonyms can either be realized via prefixa-
                                tion of that word (e.g., infelice ‘unhappy’) or through                                                          lexical antonyms; explicitly in the case of morphological
                                an independent lexeme (e.g., triste ‘sad’). In our work,                                                         antonyms, by adding a prefix with a negative, contra-
                                we refer to these types of antonyms as morphological                                                             dicting value. Considering their different morphological
                                antonym and lexical antonym, respectively. A word in                                                             structure, one possible hypothesis on their lexical com-
                                the lexicon may have both a morphological and a lexical                                                          petition is that the morphological antonym has a more
                                antonym, only one of them, or neither. In this paper, we                                                         restricted semantics, representing the negation of the se-
                                are interested in triplets of adjectives where a positive                                                        mantics of its adjectival base, while the lexical antonym
                                adjective (e.g., felice) presents two possible antonyms (or                                                      has a broader semantic coverage, as it is morphologically
                                “co-antonyms”), one formed morphologically by prefix-                                                            independent from its positive counterpart (see Section
                                ation (e.g., infelice), one morphologically independent                                                          3).
                                (e.g., triste) (Figure 1).                                                                                           To the best of our knowledge, despite the wealth of
                                   In this paper, we are interested in studying the factors                                                      literature on antonyms (Section 2), there is no empirical
                                that govern the selection of the morphological antonym                                                           in-depth study that investigates the competition between
                                vs. the lexical one. These two types of antonyms express                                                         morphological and lexical antonyms in single languages,
                                “negative” semantics with respect to the opposite, “pos-                                                         including Italian. Studies on antonyms do identify the
                                itive” term in different ways: implicitly in the case of                                                         two types of antonyms but generally do not address the
                                                                                                                                                 factors influencing the preference for one type over the
                                CLiC-it 2024: Tenth Italian Conference on Computational Linguistics, other intralinguistically.
                                Dec 04 — 06, 2024, Pisa, Italy                                                                                       This study investigates the competition between these
                                ∗
                                     Corresponding author.                                                                                       two types of antonyms by firstly studying their distri-
                                Envelope-Open martina.saccomando@studio.unibo.it (M. Saccomando);                                                bution in corpora (Section 5.1); secondly, by testing the
                                azaninello@fbk.eu (A. Zaninello); francesca.masini@unibo.it
                                (F. Masini)                                                                                                      ability of a native-Italian language model to predict them
                                                    © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License in a masked-token prediction task (Section 5.2); and, fi-
                                                    Attribution 4.0 International (CC BY 4.0).


CEUR
                  ceur-ws.org
Workshop      ISSN 1613-0073
Proceedings
nally, by performing a substitutability task within the        only one of the antonyms possesses, preventing their full
same context by switching the polarity of the context sen-     synonymy and interchangeability.
tence with a SOTA multilingual LLM, in order to study              Justeson and Katz [12] take a different approach. Using
when the adjective is switched to the positive un-prefixed     the Brown Corpus and Deese’s antonym dataset, they
adjective or to another, positive but morphologically un-      were among the first to study antonymy based on a cor-
related one (Section 5.3).                                     pus. They found that antonymic terms co-occur more
                                                               frequently than expected, confirming a syntagmatic rela-
                                                               tionship between them (in addition to the paradigmatic
2. Background                                                  one). This syntagmatic relationship was confirmed by
                                                               a more extensive work carried out in subsequent years:
Whereas the exploration of the competition between mor-
                                                               Jones [11] collected 56 antonym pairs, analyzing jour-
phological and lexical antonyms, addressed in this paper,
                                                               nalistic texts, to identify eight discursive functions of
has not gained much attention so far, the literature on
                                                               antonym co-occurrence.
antonymy in general is abundant, especially in relation
                                                                   A study that does address the competition between
to the English language.
                                                               forms is Aina et al. [13], who studied syntactically
   A term’s antonym is related to it according to three
                                                               negated adjectives and morphological antonyms (e.g.,
main characteristics: polarity, gradability and canon-
                                                               not happy vs unhappy, respectively). Using distributional
icity. The first two characteristics refer to the position-
                                                               semantics they found that a syntactically negated ad-
ing of the two antonymic terms w.r.t. the two poles
                                                               jective is more similar to the positive adjective than to
(polarity) of a graded (gradability) scale [3], along
                                                               its lexical antonym. Additionally, they show that the
which free positions can be occupied by other similar
                                                               morphological antonym is less similar to its lexical base
but differently graded terms. The scale is based on a spe-
                                                               compared to the similarity between a negated adjective
cific property that the two terms share. For example, in
                                                               and its non-negated counterpart.
the pair long-short, the two antonyms share the property
                                                                   Last but not least, a very recent typological study [14]
of “length”, defining the start and end of an axis whose
                                                               examines 37 antonym pairs across 55 languages, focusing
poles are defined by two terms, with long representing
                                                               on antonym formation. When a derived form is attested,
the “unmarked” base term of the opposition (this is why
                                                               it typically applies to the member of the pair with lower
we ask for “how long” rather than “how short” something
                                                               valence or lesser magnitude. Antonyms related to core
would last [4]).
                                                               property concepts (dimension, age, value, color) tend to
   However, there are cases where antonymic pairs
                                                               be expressed through distinct lexical forms (resulting in
are formed with potentially competiting antonyms,
                                                               lexical antonyms), while those related to peripheral prop-
like friendly-unfrendly vs. friendly-hostile: friendly-
                                                               erty concepts (physical property, human property, speed)
unfriendly is placed on a scale that defines a greater or
                                                               are generally encoded using derived forms (namely mor-
lesser degree (gradability) of a property, while friendly-
                                                               phological antonyms). Although the study is insightful
hostile are certainly in opposition but belong to two scales
                                                               and inspiring in a number of ways, the specific question
of incompatible properties. In terms of their gradability,
                                                               of which reasons underlie the preference for lexical or
therefore, it seems that the morphological antonym is
                                                               morphological forms in a single language remains unan-
“more gradable” than the lexical one.
                                                               swered.
   Canonicity, according to Paradis and Willners [5],
defines two semantically related and conventionalized
terms as a pair in the language. It is a gradual property      3. Morphological vs. Lexical
and can be possessed to a greater or lesser extent. A
high degree of canonicity translates into a high degree           Antonyms
of semantic-lexical embedding in memory and leads to
                                                               A morphological antonym (e.g., inattivo ‘inactive’
conventionalization in usage.
                                                               from attivo ‘active’) is immediately recognizable as a
   Psycholinguistic studies also suggest that canonical
                                                               negative term due to the presence of the negative prefix
antonyms derive from the speakers’ experience with the
                                                               un- ‘in-/un-’, while the nature of the opposition is less
language: the two terms are inseparable, one elicits the
                                                               immediate with a lexical antonym (e.g., passivo ‘pas-
other [6, 7, 8, 9, 10]. When a term has two structurally
                                                               sive’), because one has to identify the property shared by
different but semantically similar antonyms (Figure 1),
                                                               the two opposing terms.
canonicity is influenced by factors such as learned pref-
                                                                  Moreover, these two ways of forming an antonym
erence for specific pairings, the speaker’s familiarity due
                                                               from the same term create an asymmetric system: while
to exposure, and different nuances of meaning [11]. Out
                                                               one of the two terms in lexical opposition (e.g., attivo
of context, the antonyms may appear equivalent, but
                                                               ‘active’) has its morphological antonym (e.g., inattivo ‘in-
within context, a specific meaning may be activated that
active’), the other term (e.g., passivo ‘passive’) does not     a wider range of ways of not being friendly (such as being
(e.g., *apassivo, *impassivo, ‘*unpassive’). This imbalance     aloof) whereas hostile is fairly specific” (Murphy 2003:
is due to the greater emphasis that language places on          202). So, the morphological antonym would be more
everything that requires more precise specifications [15].      polysemous, while the lexical counterpart would have
The situation is not always perfectly equivalent interlin-      a narrower scope.
guistically: for example, a form like *invero to indicate          Given these two competing hypotheses, we aim to
the opposite of vero ‘true’ is not attested in Italian, while   empirically verify:
it is possible in English (untrue).
    However, there are cases in both Italian and English              • whether the lexical antonym is more frequent
where the two competitors have different nuances: for                   than the morphological antonym;
example, infelice-triste / unhappy-sad cover different con-           • whether the morphological antonym is actually
texts in that triste and sad convey a stronger emotional                less polysemous than the lexical antonym.
meaning, while infelice and unhappy encompass certainly
                                                                In order to do this, we design a set of experiments. We
strong but less intense feelings.[16]
                                                                first select antonym pairs; we calculate their frequency
    Regarding distribution in usage, lexical antonyms
                                                                of co-occurrence with nouns to have a defined context;
are predominant for more basic meanings, supporting the
                                                                then, we perform two tasks: (i) masked-token prediction
Principle of Least Effort theorized by Zipf [17], which sug-
                                                                and (ii) polarity flip.
gests that we expect the most used concepts to be coded
with short and simple words: basic terms therefore tend
to have structurally simple antonyms even when more             4. Dataset Construction
complex morphological antonyms would be possible. For
example, for a pair like alto-basso ‘tall-short’, there is no   4.1. Antonym Pair Selection
morphological counterpart that can be associated with ei-
ther term: neither *inalto ‘*untall’ nor *inbasso ‘*unshort’    For our study, we decided to focus on adjectives, as this
exist. These are canonical antonyms referring to basic          class is the most suitable for investigating antonymy,
language concepts: in cases like this, but not exclusively,     given that it includes content words that normally ex-
preference for simpler and more immediate words blocks          press qualities. Moreover, adjectives are semantically
the potential formation of a morphological antonym.             simpler compared to other word classes as they usually
    According to Murphy [4], culturally salient concepts        describe a single property that may be or may not be
necessitate clear and concise linguistic expressions. For       present to a greater or lesser degree ( Jones et al. [1]).
this reason, lexical antonyms (e.g., passivo ‘passive’) are        Starting from 1535 adjectives of the fundamental Ital-
the most frequent choice because they require less cogni-       ian lexicon extracted from the Italian dictionary Zingarelli
tive effort to understand. Although it is possible to create    2024, we filtered 303 items marked as ‘contr.’.
new opposite terms through derivational and morpholog-             We then selected, for this pilot study, 5 adjectives with
ical processes with speaker’s creativity, this option is less   the following properties: they needed to be adjectives
commonly employed in this context, as it is perceived as        with both a morphological and a lexical antonym; they
a deviation from the linguistic norm.                           needed to be maximally interchangeable in different con-
    The competition between the two terms of the                texts.1
antonym pair, i.e., the situation in which the usage con-          Finally, we created our initial dataset by pairing each
text of both terms is nearly the same and allowing for a        adjective with its corresponding morphological antonym
certain degree of substitutability, is still debated.           and a possible lexical antonym.
    According to one hypothesis, since the morphological           The morphological antonym was formed by using one
antonym is the “perfect” negation of a specific lexical         of the three possible prefixes productively used in Italian
base, it should occur in more restricted contexts (i.e., a      to create antonyms, namely: dis-, s-, in- [20]2 .
subset of the contexts of its positive counterpart) and            The lexical antonym was chosen randomly among all
should therefore have a narrower semantics (cf. [18,            possible options, taking into account synonymy with the
19, 15]). So, morphological antonyms would be less              morphological antonym and semantic neutrality. This
polysemous. On the other hand, the lexical antonym,             means that the lexical antonyms were selected to be ide-
not sharing identical lexical properties with the opposed       ally substitutable with the morphological ones in as many
term, should occur in broader contexts and thus be more         contexts as possible, and roughly possessing the same
polysemous.                                                     1
                                                                  The admittedly limited size of the dataset is due to the exploratory
    However, Murphy [4], examining the English triple             nature of our study.
friendly-unfriendly-hostile, notes that “The two antonyms       2
                                                                  The prefix in- is the most productive and the most widespread. It
are hardly equivalent, though, since unfriendly describes         often adapts phonetically to the bases it attaches to, forming the
                                                                  allomorphs im-, ir-, and il- via assimilation [20].
number of senses according to the dictionary (cf. Table 2). 4.3. Lexical Context Definition and
The synonymy between the two types of antonyms was                     Example-Sentence Extraction
further confirmed using Il grande dizionario dei Sinonimi
e dei Contrari, Zingarelli 2013 [21].                          Subsequently, for each antonym pair, eight nouns with
   Summing up, the antonym pairs examined are:                 different co-occurrence frequencies were selected. Specif-
                                                               ically, we considered both nouns that typically occur with
      • infelice ‘unhappy’ - triste ‘sad’                      one of the antonyms (e.g., matrimonio infelice), falling
      • impreciso ‘imprecise’ - approssimativo ‘approximative’ within the domain of collocations, and (more generic)
      • scorretto ‘incorrect’- sbagliato ‘wrong’               nouns whose co-occurrence frequencies with the lexical
      • imprudente ‘imprudent’ - avventato ‘reckless’          and the morphological antonym are very similar (e.g.,
      • insufficiente ‘insufficient’ - scarso ‘scant’          donna infelice and donna triste) (cf. Table 1).
resulting in 5 triplets (base, morph_ant, lex_ant).                          Noun + Adjective                                Frequency
                                                                             matrimonio infelice ‘unhappy marriage’             886
4.2. Corpus-based Analysis                                                   matrimonio triste ‘sad marriage’                    24
                                                                             donna infelice ‘unhappy woman’                     325
We analyzed the occurrences of the selected adjectives                       donna triste ‘sad marriage’                        316
with nouns in the itTenTen20 corpus, a large web corpus
                                                                       Table 1
of written Italian, searched through the SketchEngine
                                                                       Differences between high and low frequency name+adjective
platform https://www.sketchengine.eu/.
                                                                       co-occurrences
   The analysis of the occurrences highlighted that the
two types of antonyms display partially different collo-
cational preferences (see Appendix A, Table 3).                 The latter case is especially interesting for our current
   Overall, we can split the antonymic adjective-noun        purposes, as it represents possible ground for “compe-
couples in two groups according to whether the co-           tition”, namely a situation where the context of use is
occurrence is:                                               nearly the same and allows for a certain substitutability
                                                             between the two terms of the antonym pair.4
     • (i) polarization towards one of the two ad-              After defining the noun list, for each noun we ran-
       jectives: in these cases, we can speak of fairly domly selected 10 sentences containing the noun fol-
       stable distributional preferences, falling within lowed by the morphological antonym and ten sentences
       the realm of collocations or idiomaticity (e.g., mat- containing the same noun followed by the corresponding
       rimonio preferably selecting infelice rather than lexical antonym from itTenTen20. This was done for all
       triste);                                              eight nouns and for all five antonym pairs, resulting in a
     • (ii) similar with the two adjectives, indicating 800-sentence dataset.
       potential substitutability of the two antonyms in
       the same contexts (e.g., donna selecting both infe-
       lice and triste with similar relative frequencies).
                                                             5. Experiments
    Both groups are relevant to explore the context of use In Section 3 we outlined two possible hypotheses re-
of the two types of antonyms, although, for our current    garding the competition between the two terms of the
purposes, we specifically targeted the nouns in group      antonym pair and we selected the following as a working
(ii), namely nouns that occur with both adjectives, sug-   hypothesis: the morphological antonym, being formed
gesting a certain degree of competition between the two    by a negative prefix applied to a specific lexical base,
antonyms: see sentence 1, where infelice ‘unhappy’ can     would have more restricted usage contexts (possibly a
be replaced by the lexical antonym triste ‘sad’.           subset of the contexts of the lexical base), and therefore
                                                           be less polysemous than the lexical one; on the other
    1. Un ritratto preciso ma discontinuo che ci restitu- hand, the lexical antonym, not sharing morphological
       isce l’immagine di una donna infelice, umiliata, structure with the opposing term, would semantically
       affranta, ma non distrutta, non arresa alla sorte 3 cover some or all of its meanings along with other inde-
                                                           pendent meanings, resulting in broader usage contexts
                                                           and greater polysemy.
                                                             To verify this hypothesis, we performed 2 sets of exper-
                                                           iments: (i) masked-token prediction, to estimate the
3
‘A precise but fragmented portrait that gives us the image of an un-
                                                                       4
happy, humiliated, distraught woman, but not destroyed, not resigned       For a detailed view of the selected adjectives and nouns, as well as
to fate’.                                                                  their co-occurrence frequencies, see Appendix A.
probability of occurrence of one antonym or the other ac-       We used bert-base-italian-xxl-cased language model5
cording to a native Italian language model; (ii) polarity to perform a token prediction task by masking the
flip, to transform the collected sentences from a negative antonym present in each sentence. The model was asked
meaning to a positive meaning.                               to predict the probability of occurrence between the two
                                                             possible antonyms; then, we took the alternative with the
5.1. Word Senses and Lexical Variety                         highest probability according to the model as the model
                                                             prediction.
Our analysis started with the identification of adjectives
and their possible antonyms, which, as mentioned (Sec-
                                                             5.3. Polarity-flip
tion 4.1), have been chosen on the basis of their possible
substitutability within the same context. For this reason, In this task, we asked a SOTA LLM, GPT-4o, to transform
first of all, the various dictionary definitions of antonyms the sentences extracted from the dataset, both those con-
were taken into consideration. We counted how many taining the morphological antonym and those containing
senses are associated to each antonym in the Zingarelli the lexical antonym, into positive sentences.
dictionary [22], taking the number of senses reported as        We used the same prompt for all antonym pairs,
a first proxy of semantic broadness.                         parametrising the antonyms and sentences presented,
   As a second proxy, the semantic coverage of each by asking the model to flip the sentence from a negative
antonym was taken into account. We conducted an anal- sense to a positive one, always by changing the adjective
ysis of the lexical variety of each group’s context in the accompanying the target noun.6
selected sentences, by calculating the token/type ratio         We then fetched the new adjective generated by the
for each group. We report the results in Table 4. As can model, and calculated when the new, positive adjective
be seen, no relevant differences were found according to coincided with the lexical base, and when not.
these features.                                                 The rationale behind this is that the senses of an
                                                             antonym can be separated through the various positive
5.2. Masked-token Prediction                                 terms with which it can be changed. We expected that
                                                             sentences containing the morphological antonym would
According to our hypothesis, in this task we expect that be turned into positive using their lexical base more of-
the predicted antonym will have a higher prediction ac- ten than their lexical counterparts, indicating a narrower
curacy in the sentences with the highest occurrence of semantics.
the adjective with the selected words (represented by
high relative frequency). In contexts with similar fre-
quencies, we expect that accuracy should be similar for 6. Analysis and Results
both antonyms, showing a genuine competition between
the two, as the language model should not have a specific As regards the number of meanings listed in the dictio-
preference.                                                  nary for the two terms of the antonym pair (Table 2),
   We previously said that the words that form the these are almost the same, indicating that the recognized
antonym pairs can be considered synonymous. In fact, senses of each antonym alone are not decisive to de-
full synonyms are rather rare (Murphy [4], among oth- termine the selection between one or the other. As for
ers), also because languages tend to avoid synonymy by lexical variability (see Appendix B, Table 4), token/type
differentiation in terms of meaning or distribution [23]. ratio also fails to reveal a significant difference between
Therefore, the two terms of the pair are better regarded as morphological and lexical antonyms; in only three cases
near-synonyms, meaning that one term can cover almost does the token/type ratio of lexical antonyms exceed
all the meanings of the other but not all of them.           that of morphological antonyms. This seems to indicate
   To evaluate the factors that lead to the choice of one that factors other than contextual variability underlie the
antonym over the other, we decided to observe how a na- preference for one or the other.
tive Italian language model pre-trained for masked-token        Let us now consider Table 3, Appendix A. In addition to
prediction model behaves in terms of the probability of      the co-occurrences of nouns with the two antonyms, the
occurrence of an antonym in a given context. In this accuracy for the two tasks for each noun-adjective pair is
respect, see Niwa et al.[24], who used BERT to predict also provided (further divided into noun-morphological
antonyms in specific contexts: experiments on Japanese antonym and noun-lexical antonym). The distribution
slogans showed a top-1 accuracy of 29.3% and a top-10 of nouns in this table follows a specific order: nouns at
accuracy of 53.8%, with human evaluations confirming the extremes exhibit occurrence frequencies polarized
that over 85% of predicted antonyms were appropriate,       5
                                                                dbmdz/bert-base-italian-xxl-uncased
demonstrating the method’s effectiveness in capturing       6
                                                                For detailed information on the prompt used in this study, please
contextually relevant antonyms.                                 refer to the Appendix B.
                                                                                   accuracy      accuracy
                                                                                                                 polarity    polarity
                                   senses       senses      rel.freq   rel.freq   mask token    mask token
                                                                                                                   flip        flip
                                                                                  prediction    prediction
  Couple                         Exp.morf      Exp.lex     Exp.morf    Exp.lex     Exp.morf       Exp.lex       Exp.morf      Exp.lex
  infelice-triste                    4           4           0.48       0.52         0.525         0.687          0.125        0.137
  impreciso-approssimativo           2           2           0.57       0.43         0.256         0.833          0.081        0.222
  scorretto-sbagliato                3           3           0.48       0.52         0.025         0.986          0.076        0.144
  imprudente-avventato               2           1           0.43       0.57         0.437         0.637          0.012        0.075
  insufficiente-scarso               3           3           0.40       0.60         0.743         0.662          0.012        0.037


Table 2
Final experiment results. Number of senses, average of relative frequency and average of the accuracy of the two tasks: mask
token prediction and polarity flip.


towards either the lexical antonym or the morpholog-                range of senses (see, e.g., Murphy [4]).
ical antonym. Some nouns form with the adjective a                     We believe that these results, that contradict our ini-
fairly stable collocation, while other nouns form freer ex-         tial hypothesis, open up new avenues for future research
pressions. For the purposes of this study, particularly in          in this area, despite the limitations of the present study,
relation to the analysis of competition, the central nouns          which has an exploratory nature and a narrow empiri-
with similar frequencies are of greater importance.                 cal coverage. Indeed, only 5 adjectives were analyzed,
   Upon examining the occurrences and accuracy, we                  exclusively belonging to core vocabulary. Another short-
observe that the values are comparable.                             coming is that, unlike for English, there are no in-depth
   As for masked-token prediction, the consistent higher            studies on antonyms in Italian. However, we want to
values of the lexical antonym indicate higher predictabil-          stress the importance of conducting studies on languages
ity and/or higher degree of idiomaticity, which contra-             other than English to avoid the well-known Anglocentric
dicts our working hypothesis that lexical antonyms dis-             bias.
play a broader semantic coverage.                                      Hopefully, our results will be challenged by further
   Finally, as for the polarity flip, consistently with the         studies in the future, which might even overturn our con-
masked-token experiments, the sentences containing the              clusions entirely, if a larger data set were considered. Fur-
morphological antonym were turned into positive using               thermore, it would be interesting to investigate whether
the lexical base fewer times than their lexical counter-            the results obtained for Italian are also found for other
parts, suggesting that the latter may have a more re-               languages that present both lexical and morphological
stricted semantic spectrum, contrary to our initial hy-             antonyms, including languages with a different morpho-
pothesis.7                                                          logical system. With a view of deepening the analysis
                                                                    methodologically, it would be interesting to focus on
                                                                    additional linguistic factors that might drive the choice
7. Conclusion and Future Work                                       between lexical and morphological antonyms, such as
                                                                    semantic networks or word frequency, and to expand the
Our study investigated the differences and competition
                                                                    testing to the psycholinguistic dimension.
between two types of antonyms, morphological and lexi-
                                                                       What is sure is that the relationship between mor-
cal, focusing on a computational account of their context
                                                                    phological and lexical antonyms is more complex than
of use. While a lexical analysis did not prove decisive,
                                                                    previously thought and that the choice of one type of
experiments on masked-token prediction and polarity
                                                                    antonym over another depends on a variety of intercon-
flip, aimed at approximating their semantic coverage, in-
                                                                    nected factors that are still to be fully unveiled.
dicate that, unlike what is suggested in some studies on
antonymy, the lexical antonym seems to have a narrower
lexical coverage and scope, supporting the view that it References
is actually the morphological antonym, despite its closer
relationship with the lexical base, that displays a wider             [1] S. Jones, M. L. Murphy, C. Paradis, C. Willners,
                                                                          Antonyms in English: Construals, Constructions
7
  An anonymous reviewer observes that this result is even more re-        and Canonicity, Cambridge University Press, 2012.
  markable given the potential purely morphological (rather than      [2] D. A. Cruse, Lexical semantics, Cambridge Univer-
  semantic) bias due to the derivational relatedness of the morpho-       sity Press, 1986.
logical antonym, that could be predicted to favour its replacement
by the target positive adjective to some degree.
 [3] E. Sapir, Grading, a study in semantics., Philosophy    [20] C. Iacobini, Prefissazione, in: M. G. e F. Rainer (Ed.),
     of Science 11(2) (1944) 93–116.                              La formazione delle parole in italiano, Tübingen:
 [4] M. L. Murphy, Semantic relations and the lexicon,            Niemeyer, 2004, pp. 97–163.
     Cambridge University Press, 2003.                       [21] G. Pittàno, Il grande dizionario dei Sinonimi e dei
 [5] C. Paradis, C. Willners, Antonymy and negation -             Contrari, Zanichelli, 2013.
     the boundedness hypothesis, Journal of Pragmatics       [22] N. Zingarelli, Zingarelli 2024. Vocabolario della lin-
     38 (2006) 1051–1080.                                         gua italiana, Zanichelli, 2023.
 [6] D. S. Palermo, J. J. Jenkins, Word association norms:   [23] M. Aronoff, Competition and the lexicon, in: E. An-
     grade school through college, Minneapolis: Univer-           nibale, C. Iacobini, M. Voghera (Eds.), Livelli di anal-
     sity of Minnesota Press, 1964.                               isi e fenomeni di interfaccia, Roma: Bulzoni, 2016.
 [7] J. Deese, The structure of associations in language     [24] A. Niwa, K. Nishiguchi, N. Okazaki, Predicting
     and thought, Baltimore: Johns Hopkins University             antonyms in context using BERT, in: A. Belz,
     Press, 1965.                                                 A. Fan, E. Reiter, Y. Sripada (Eds.), Proceedings
 [8] W. G. Charles, G. A. Miller, Contexts of antonymous          of the 14th International Conference on Natural
     adjectives, Applied Psycholinguistics 10 (1989)              Language Generation, Association for Computa-
     357–375.                                                     tional Linguistics, Aberdeen, Scotland, UK, 2021, pp.
 [9] D. J. Hermann, G. Conti, D. Peters, P. H. Robbins,           48–54. URL: https://aclanthology.org/2021.inlg-1.6.
     R. J. S. Chaffin, Comprehension of antonymy and              doi:10.18653/v1/2021.inlg- 1.6 .
     the generality of categorization models, Journal
     of Experimental Psychology: Human Learning and
     Memory 5 (1979) 585–597.
[10] D. Gross, U. Fischer, G. A. Miller, The organization
     of adjectival meanings, Journal of Memory and
     Language 28 (1989) 92–106.
[11] S. Jones, Antonymy: a corpus-based approach,
     Routledge, 2002.
[12] J. S. Justeson, S. M. Katz, Co-occurrences of antony-
     mous adjectives and their contexts, Computational
     Linguistics 17 (1991) 1–20. URL: https://aclantholo
     gy.org/J91-1001.
[13] L. Aina, R. Bernardi, R. Fernández, Negated adjec-
     tives and antonyms in distributional semantics: not
     similar, in: Proceedings of the Fifth Italian Confer-
     ence on Computational Linguistics, IJCoL, 2019, pp.
     57–71. URL: http://journals.openedition.org/ijcol/4
     57. doi:https://doi.org/10.4000/ijcol.457 .
[14] M. Koptjevskaja-Tamm, M. Miestamo, C. Börstell,
     A cross-linguistic study of lexical and derived
     antonymy, Linguistics (2024). URL: https://doi.
     org/10.1515/ling-2023-0140. doi:doi:10.1515/li
     ng- 2023- 0140 .
[15] F. Vicario, Note sull’ordine degli elementi in coppie
     di verbi antonimi., Linguistica XLIII 43 (2003) 3–12.
[16] V. Muehleisen, Antonymy and Semantic range
     in English, Ph.D. thesis, Northwestern University,
     Evanston, 1997.
[17] G. K. Zipf, Human behavior and the principle of
     least effort, Cambridge Addison-Wesley, 1949.
[18] R. von Jhering, Der Zweck im Recht, Leipzig: Bre-
     itkopf and Hӓrtel, 1883.
[19] K. E. Zimmer, Affixal negation in english and other
     languages: an investigation of restricted productiv-
     ity, Word. Journal of the Linguistic Circle of New
     York 20 (1964).
APPENDIX A
                                                               accuracy     accuracy     accuracy     accuracy
        Noun                 infelice            triste
                                                                mtp (M)      mtp (L)      pf (M)        pf (L)
        matrimonio              886                24             0.9          0.5          0.2          0.4
        scelta                  821                39             0.8          0.5           0           0.1
        adolescenza             33                 16              1           0.2           0           0.2
        donna                   325               316             0.5          0.8          0.2          0.1
        uomo                    300               440             0.5          0.6          0.4          0.1
        situazione              120               339             0.4          0.9           0            0
        momento                 145              2.686            0.1           1            0           0.2
        pagina                  19                544              0            1           0.2           0
                                                               accuracy     accuracy     accuracy     accuracy
        Noun                impreciso      approssimativo
                                                                mtp (M)      mtp (L)      pf (M)        pf (L)
        affermazione            64                 11             0.2          0.8          0.1          0.1
        notizia                 223                58             0.4          0.8           0           0.1
        terminologia            23                 17             0.3          0.7          0.1          0.2
        ricezione               50                 18             0.7          0.3          0.2          0.1
        misurazione             31                 52             0.4          0.9          0.1          0.5
        traduzione              54                235             0.2          0.7           0            0
        conoscenza              33                226             0.1          0.9           0            0
        calcolo                 23               1.305            0.2           1           0.1          0.6
                                                               accuracy     accuracy     accuracy     accuracy
        Noun                 scorretto         sbagliato
                                                                mtp (M)      mtp (L)      pf (M)        pf (L)
        gioco                  1.124               72             0.1          0.7           0            0
        uso                    2.901              276              0            1            0           0.3
        alimentazione          2.926             1.228             0           0.9           0           0.1
        posizione              1.317              910             0.1           1           0.3          0.2
        abitudine               648              1.077            0.1           1           0.1           0
        informazione            720              1.903             0            1           0.2          0.3
        mossa                    90               741              0            1            0           0.1
        messaggio                88              1.214            0.1          0.9           0           0.1
                                                               accuracy     accuracy     accuracy     accuracy
        Noun               imprudente         avventato
                                                                mtp (M)      mtp (L)      pf (M)        pf (L)
        condotta                394               14              0.2          0.8           0           0.2
        comportamento           680               77              0.5          0.8           0            0
        parola                  49                46              0.3          0.5           0            0
        manovra                 24                35              0.5          0.6          0.1          0.4
        gesto                   54                195             0.7          0.4           0            0
        azione                  63                245             0.3          0.8           0            0
        scelta                  73                373             0.5          0.7           0            0
        decisione               24                478             0.5          0.5           0            0
                                                               accuracy     accuracy     accuracy     accuracy
        Noun               insufficiente        scarso
                                                                mtp (M)      mtp (L)      pf (M)        pf (L)
        apporto                 263               19              0.9          0.4           0           0.2
        quantità               1.097              162              1           0.4          0.1           0
        alimentazione           300               120              1           0.5           0            0
        produzione              208               158             0.9          0.4           0            0
        utilizzo                 16               27              0.9          0.9           0            0
        pulizia                  63               148             0.4          0.8           0            0
        partecipazione           13               37              0.5          0.9           0            0
        visibilità               14               274             0.4           1            0           0.1


Table 3
Co-occurrence frequencies of noun + morphological antonym and noun + lexical antonym.
Accuracy of the two task: mtp (Masked-Token Prediction) and pf (Polarity Flip) related to Morphological Antonyms (M) and
Lexical Antonyms (L).
                     morphological antonym             TTR          lexical antonym       TTR
                              infelice             0.4694864048            triste     0.4504979496
                            impreciso              0.4726656991      approssimativo   0.4814814815
                             scorretto             0.4496086106         sbagliato     0.4500775996
                           imprudente              0.476119403          avventato     0.4644572526
                          insufficiente            0.4582118562           scarso      0.4805725971


Table 4
Token Type Ratio of 5 antonym pair from sentences extracted from itTenTen20


APPENDIX B
system_message = '''In una frase l'aggettivo originale è stato sostituito da un token [MASK]. Tu devi riscrivere la
      frase facendo minimi cambiamenti e sostituire l'aggettivo mascherato con un altro aggettivo, in modo che la frase
       risulti volta al positivo.
Il tuo output deve essere SOLO un json nel seguente formato e con i seguenti campi:
    {"new_sentence": "<tua nuova frase>",
    "new_adj": "<l'aggettivo con cui hai sostituito [MASK] nella nuova frase>"}'''
user_message = f'''Frase originale: "{masked_sent}" (aggettivo originale: {agg})'''


{system_message = f'''In a sentence, the original adjective has been replaced by a [MASK] token.
You need to rewrite the sentence making minimal changes and replace the masked adjective with another adjective, so
      that the sentence is positively oriented.
Your output must be ONLY a json in the following format and with the following fields:\n''' + '''{"new\_sentence": "<
      your new sentence>", "new\_adj": "<the adjective with which you replaced [MASK] in the new sentence"}'''
user\_message = f'''Original sentence: "{masked\_sent}" (original adjective: {agg})'''}

</pre>