<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A distributional study of negated adjectives and antonyms</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Laura Aina</string-name>
          <email>laura.aina@upf.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Raffaella Bernardi University of Trento Trento</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Raquel Ferna ́ ndez University of Amsterdam Amsterdam</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Universitat Pompeu Fabra Barcelona</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>English. In this paper, we investigate the relation between negated adjectives and antonyms in English using Distributional Semantics methods. Results show that, on the basis of contexts of use, a negated adjective (e.g., not cold) is typically more similar to the adjective itself (cold) than to its antonym (hot); such effect is less strong for antonyms derived by affixation (e.g., happy - unhappy). Italiano. In questo lavoro, analizziamo la relazione fra aggettivi negati e antonimi in inglese utilizzando metodi di Semantica Distribuzionale. I risultati mostrano che, sulla base dei contesti di uso, la negazione di un aggettivo (ad es. “not cold”; it.: “non freddo”) e` tipicamente piu` simile all'aggettivo stesso (“cold”; it.: “freddo”) che al suo antonimo (“hot”; it.: “caldo”). Tale effetto e` meno accentuato per antonimi derivati tramite affissi (ad es. “happy”-“unhappy”; it.: “felice”“infelice”).</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>Negation has long represented a challenge
for theoretical and computational linguists (see
Horn (1989) and Morante and Sporleder (2012)
for overviews): in spite of the relative simplicity
of logical negation (:p is true $ p is false),
complexity arises when negation interacts with
morphosyntax, semantics and pragmatics.</p>
      <p>In this work, we focus on the negation of
adjectives in English, expressed by the particle not
modifying an adjective, as in not cold. A na¨ıve</p>
      <p>
        Part of the work presented in this paper was carried
out while the first author was at the University of Amsterdam.
account of these expressions would be to equate
them to antonyms, and hence take them to
convey the opposite of the adjective (e.g., not cold =
hot). In fact, this simplifying assumption is
sometimes made in computational approaches which
model negation as a mapping from an adjective to
its antonym (e.g., The Pham et al., (2015), Rimell
et al., (2017)). However, a range of studies
support what is known as mitigation hypothesis
        <xref ref-type="bibr" rid="ref12 ref15 ref9">(Jespersen, 1965; Horn, 1972; Giora, 2006)</xref>
        ,
according to which a negated adjective conveys an
intermediate meaning between the adjective and its
antonym (e.g., not large medium-sized). The
meaning of the adjective is mitigated by negation,
while some emphasis on it still persists in
memory
        <xref ref-type="bibr" rid="ref8">(Giora et al., 2005)</xref>
        . This view is
compatible with pragmatic theories predicting that the use
of a more complex expression (not large) when a
simpler one is available (small) triggers the
implicature that a different meaning is intended (e.g.,
medium-sized)
        <xref ref-type="bibr" rid="ref10 ref13">(Grice, 1975; Horn, 1984)</xref>
        .
Computational models predicting similar mitigating
effects are those by Hermann et al., (2013) and
Socher et al., (2012; 2013).
      </p>
      <p>
        In this work, we investigate negated
adjectives from the perspective of Distributional
Semantics
        <xref ref-type="bibr" rid="ref19 ref31">(Lenci, 2008; Turney and Pantel, 2010)</xref>
        .
We study antonymic adjectives and their negations
in terms of their distribution across contexts of
use: to this end, we employ an existing dataset
of antonyms, whose annotation we further extend,
and the distributional representations of these and
their negated version, as derived with a standard
distributional model. This allows us to conduct
a data-driven study of negation and antonymy
that covers a large set of instances. We compare
pairs of antonyms with distinct lexical roots and
those derived by affixation, i.e., lexical and
morphological antonyms
        <xref ref-type="bibr" rid="ref16">(Joshi, 2012)</xref>
        (e.g., small
large and happy - unhappy respectively).
Moreover, we investigate the distinction between lexical
antonyms that are contrary or contradictory, that
is, those that have or do not have an available
intermediate value
        <xref ref-type="bibr" rid="ref19 ref7">(Fraenkel and Schul, 2008)</xref>
        : e.g.,
something not cold is not necessarily hot - it could
be lukewarm - but something not present is absent.
As for negations of morphological antonyms, we
compare instances of simple and double
negation, where the latter occurs if the antonym that is
negated is an affixal negation (e.g., not unhappy).
      </p>
      <p>Our analyses show that, when considering
distributional information, negated adjectives are
more similar to the adjective itself than to the
antonym (e.g., not cold is closer to cold than
to hot), regardless of the type of antonym or of
negation. However, we find that morphological
antonymy is closer to negation than lexical one is.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Motivation and data</title>
      <p>
        We are interested in how negation acts with respect
to pairs of adjectives connected by the lexical
relation of antonymy
        <xref ref-type="bibr" rid="ref23">(Murphy, 2003)</xref>
        , i.e., that are
associated with opposite properties within the same
domain (e.g., hot - cold). In particular, we want
to compare the negation of one of the antonymic
adjectives with itself and its antonym respectively
(e.g., not cold vs. cold and vs. hot). Our data of
interest are then triples obtained starting from an
antonymic pair and negating one of the two items
(for each pair we obtain two triples). For example:
(1)
(2)
h hot, cold, not fhotjcoldgi
hhappy, unhappy, not fhappyjunhappygi
As data, we make use of a subset of the
Lexical Negation Dictionary by Van Son et al. (2016).
This consists of antonym pairs in WordNet
        <xref ref-type="bibr" rid="ref6">(Fellbaum, 1998)</xref>
        annotated for different types of
lexical negation
        <xref ref-type="bibr" rid="ref16">(Joshi, 2012)</xref>
        . We consider
adjective pairs that are either lexical antonyms, i.e., with
distinct lexical roots (e.g., cold - hot), or
morphological antonyms, i.e., derived by affixal negation
(e.g., happy - unhappy).1 In our analyses, we
compare different subsets of the data: we explicate and
motivate the distinctions in the following.
Lexical vs. morphological antonyms These
two groups are usually taken to express the same
lexical relation - i.e., opposition - and to be
different only on morphological terms. However, such
adj.
      </p>
      <p>not adj.</p>
      <p>
        # triples
Lexical antonyms
– contrary
– contradictory
Morphological antonyms
– simple negations
– double negations
difference might affect their relation with negated
adjectives: indeed, affixal negations have a
morphological structure that resembles negated
adjectives (e.g., un-happy vs. not happy). For this
reason, we keep triples derived from lexical and
morphological antonyms distinct, and compare them
in our analyses: in particular, we are interested
in testing whether in a distributional space
negation tends to be more similar to morphological
antonymy than to lexical one. Besides this
comparison, we apply other distinctions to the triples
obtained with lexical and morphological antonyms
respectively, in order to investigate further effects.
Contrary vs. contradictory Lexical antonyms
have been classified as either contradictory or
contrary
        <xref ref-type="bibr" rid="ref5">(Clark, 1974)</xref>
        , depending on whether the
negation of one entails the truth of the other,
without the availability of a mid-value. Fraenkel
and Shul (2008) provided psycholinguistic results
showing that if an adjective is part of a
contradictory pair, its negation is interpreted as closer to the
antonym than if it is part of a contrary pair (e.g.,
not dead is interpreted as being closer to alive than
not small to large). We aim to investigate this
result in a distributional space, where we are able to
quantify similarities between lexical items.
      </p>
      <p>Since no data annotated with respect to this
distinction is available, the three authors
independently annotated the antonym pairs in the
dataset as either contrary, contradictory or
unclear, following the definition used by Fraenkel
and Shul (2008).2 Not surprisingly, the
interannotator agreement is only moderate (Fleiss’ k =
0:37): already Fraenkel and Shul (2008) noted
that even for what they considered contradictory
pairs it is possible to conceive a mid-value
interpretation (e.g., not dead half-dead; Paradis and
Willners (2006)). This suggests that the contrary
1In the dataset, the former are coded as regular antonyms
and the latter as direct affixal negations.</p>
      <p>2Annotation guidelines at https://lauraina.
github.io/data/notadj.pdf
vs. contradictory distinction involves a continuum
rather than a dichotomy. We leave this aspect to
be further clarified by future research and, for the
purpose of our analysis, only consider pairs
classified with full agreement.</p>
      <p>Simple vs. double negation In the case of
morphological antonyms, one of the two
adjectives is an affixal negation, and hence already
contains a negating prefix (such as un- in
unhappy): adding not thus gives rise to a double
negation (e.g., not unhappy). These expressions
have been widely studied in the literature due
to their difference with double negation in logic
(e.g., Bolinger (1972), Krifka (2007) and recently
Tessler and Franke (2018)). While in logic two
negations cancel each other out (::p p), in
natural language double negations are typically
employed to weaken the meaning of the adjective that
is negated twice (e.g., not unhappy 6= happy) . Our
goal is to test whether evidence for this effect is
found in a distributional space: in particular, if two
negations were to cancel each other out then the
negation of an affixal negation (e.g., not unhappy)
should be particularly close to the antonym (e.g.,
happy). We then test whether simple (e.g., not
happy) and double (e.g., not unhappy) negations
exhibit similar trends in relation to an antonym
pair (happy vs. unhappy).
3
3.1</p>
    </sec>
    <sec id="sec-3">
      <title>Analyses</title>
      <sec id="sec-3-1">
        <title>Methods</title>
        <p>
          Previous studies about negation of adjectives
described its effect as a meaning shift from the
adjective towards the antonym, that can be measured in
terms of semantic similarity
          <xref ref-type="bibr" rid="ref19 ref7">(Fraenkel and Schul,
2008)</xref>
          . Distributional Semantics offers us a
datadriven method of quantifying this: we can
represent expressions as vectors summarizing their
large-scale patterns of usage and then interpret
their proximity relations in terms of similarity.
        </p>
        <p>
          To this aim, we build a distributional semantic
model with standard techniques, but whose
vocabulary includes, besides word units, also negated
adjectives. In practice, each occurrence of a
negated adjective (adjacent occurrence of not and
an adjective without intervening words; e.g., we
exclude cases like not very cold) is treated as a
single and independent token (e.g., not cold ;
not cold). With this pre-processing, we train a
word2vec CBOW model
          <xref ref-type="bibr" rid="ref20">(Mikolov et al., 2013)</xref>
          3
on the concatenation of UkWaC and
WackypediaEn corpora (2.7B tokens; Baroni et al., (2009)),
setting parameters as in the best performing model
by Baroni et al. (2014).4 We do not carry out
any hyperparameters search, nor we employ any
ad hoc techniques aimed at, for example,
amplifying the distances between antonyms in the
semantic space (such as that of Nguyen et al. (2016)
or The Pham et al. (2015)). Indeed, we are
interested in investigating characteristics of antonyms
and negated adjectives in a standard distributional
model, that is not fine-tuned to a particular task
and where no assumptions about the structure of
its space are incorporated. However, we assess the
quality of the induced model through a similarity
relatedness task, where we find that it achieves
satisfying performances.5
        </p>
        <p>
          For our analyses, we consider triples as
those described in Section 2. Given a triple
hai; aj ; not aii (e.g., cold, hot, not cold), we
define the following score:
(3) Shift := Sim(not ai; aj )
Sim(not ai; ai)
where i6=j, and Sim(not ai; aj ) and Sim(not ai; ai)
are the cosine similarities of the negated adjective
with the antonym and the adjective, respectively.
This measures how much closer a negated
adjective is to the antonym than to the adjective (i.e.,
how much closer not cold is to hot than to cold),
and hence how much negation shifts the
meaning of an adjective towards that of the antonym.
Due to the well-known tendency of antonyms to be
close in a distributional space
          <xref ref-type="bibr" rid="ref21">(Mohammad et al.,
2013)</xref>
          , the absolute value of Shift is not expected
to be high (a vector close to one is likely close to
the other too). However, we can test whether a
higher proximity is registered towards one of the
two adjectives.
        </p>
        <p>From the data introduced in Section 2, we only
consider triples where each of the three elements
occurs at least 100 times in the training corpus of
the distributional model. Table 1 shows the
number of triples considered for each class and the
average frequency of adjectives and negated
adjectives.6 The number of contradictory triples is
3Gensim implementation.</p>
        <p>4Vectors size: 400; window size: 5; minimum frequency:
20; sample: 0.005; negative samples: 1.</p>
        <p>
          5Spearman’s of 0.75 on the MEN dataset
          <xref ref-type="bibr" rid="ref4">(Bruni et al.,
2014)</xref>
          ; see results by Baroni et al. (2009) for a comparison.
        </p>
        <p>6Negated adjectives are overall less frequent than their
non-negated counterparts, as shown in Table 1.
small due to the choice of keeping only antonyms
for which we had full agreement in the annotation;
double negations triples are few due to the limited
frequency of these expressions in the corpus.7
Lexical vs. morphological antonyms The
average Shift scores of both classes are negative,
showing that a negated adjective is typically closer
to the adjective than to the antonym. Indeed,
as shown in Table 3, the nearest neighbor of a
negated adjective is often the related adjective. On
one hand, this could be seen as supporting the
idea that negated adjectives express an
intermediate meaning between that of the adjective and the
antonym (e.g., not small is close to normal-sized).
More in general, it shows that negated adjectives
have a profile of use that is more similar to that of
the adjective than to the antonym.</p>
        <p>The two classes of antonyms differ significantly
in the extent of this effect: negated adjectives are
closer to a morphological antonym than a
lexical one (e.g., not perfect vs. imperfect, not wide
vs. narrow). Such similarity in distribution can be
explained by the similarity in structure, and hence
possibly in meaning, of negated adjectives and
affixal negations. Yet, in spite of the higher
similarity in use, affixal negation still does not seem
equivalent to negation by not, due to the negative
average Shift value.</p>
      </sec>
      <sec id="sec-3-2">
        <title>Contrary vs. contradictory antonyms In con</title>
        <p>trast to the results from the linguistic literature (see
Section 2), the behavior of contrary and
contradictory antonym pairs is not significantly
different in our analysis. When we look into a
distributional space, even for contradictory antonyms, the
negated adjectives tend to be more similar to the
adjective itself than to the antonym.</p>
        <p>This result points at the fact that distributional
similarity is capturing a different type of
similarity from that considered in the experiments of
Fraenkel and Shul (2008). We cannot thus directly
interpret our results as just a product of the
mitigating aspect of negation. Distributional
information may discriminate between the negation of
7Full list of triples at https://lauraina.
github.io/data/notadj.pdf
an adjective and the antonym, even when the two
seem intuitively equivalent (e.g., not dead is closer
to dead than to alive): indeed, the use of one or
the other may serve different functions (e.g.,
contradicting an expectation, politeness, etc.),
leading them to appear in different contexts.
Moreover, we find that, since continuous
representations are able to capture nuanced differences, the
alleged dichotomy between contrary and
contradictory antonyms may become a continuum in
distributional space: for example, one of the closest
adjectives to not dead is half-dead. This further
underscores the difficulty in distinguishing
between contrary and contradictory antonyms which
we had already encountered in the annotation.
Simple vs. double negations There is not a
significant difference between negated adjectives that
are instances of simple and double negations:
crucially, it is not the case that double negations are
very close to the antonym as a result of the two
negations canceling each other out (e.g., not
unhappy is closer to unhappy than to happy).</p>
        <p>As before, the result cannot be interpreted only
in terms of mitigation (though, e.g., not unhappy is
close to unimpressed, hence a mid-value between
happy and unhappy). In general, it suggests that
the contexts of use of double negations are more
similar to the ones of the adjective that is negated
than to those of its antonym. Indeed, double
negations typically appear in contexts where the use
of the “logically” equivalent alternative (i.e., the
antonym) is to be avoided for pragmatic reasons,
as possibly too strong or direct (e.g., not
unproblematic vs. problematic; Horn, (1984)).
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>We have investigated negated adjectives using the
tools of Distributional Semantics, which allows us
to quantify the similarities between expressions
on the basis of how they are used. Our
analyses show that, when considering contexts of
occurrence, negating an adjective does not make it
closer to the antonym than to the adjective itself.
This can be seen as a result of the various
functions of negation (e.g., mitigation, contradiction to
an expectation, politeness) that may lead to
different patterns of use for negated adjectives and
antonyms. Further research may shed light on
which type of contexts actually discriminate them,
for example through a corpus study, and which
other properties negated adjectives have in a
distriLexical antonyms
Contrary antonyms
Simple negations
butional space, such as their interaction with scalar
dimensions (e.g., not hot vs. freezing, cold,
lukewarm, hot etc.; Wilkinson and Tim (2016)).
Finally, while for the purpose of this study we opted
for a standard word2vec model, one could test for
the same effects with differently obtained
distributional vectors.</p>
      <p>Despite its current limitations in covering
truthrelated aspects of meaning, Distributional
Semantics was shown by Kruszewski et al. (2017) to be
apt to model at least some aspects of negation,
especially if graded in nature, such as
alternativehood. Our study provides supporting evidence
for this line of research and in addition points at
the utility of using Distributional Semantics to
uncover nuanced differences in use between a
negation and other expressions, even when logically
equivalent. Moreover, we regard our results to be
of general interest for the NLP community, since
effects of negation like the ones we studied and
how they are represented in a distributional space
can be critical for tasks like sentiment analysis
(e.g., what does it imply that a costumer is not
happy or not unhappy with a product?; Wiegand
et al, (2010)).</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This project has received funding from the
European Research Council (ERC) under the
European Union’s Horizon 2020 research and
innovation programme (grant agreement No 715154),
and by the Catalan government (SGR 2017 1575).
This paper reflects the authors’ view only, and the
EU is not responsible for any use that may be made
of the information it contains.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Marco</given-names>
            <surname>Baroni</surname>
          </string-name>
          , Silvia Bernardini, Adriano Ferraresi, and
          <string-name>
            <given-names>Eros</given-names>
            <surname>Zanchetta</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>The wacky wide web: a collection of very large linguistically processed web-crawled corpora</article-title>
          .
          <source>Language resources and evaluation</source>
          ,
          <volume>43</volume>
          (
          <issue>3</issue>
          ):
          <fpage>209</fpage>
          -
          <lpage>226</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Marco</given-names>
            <surname>Baroni</surname>
          </string-name>
          ,
          <source>Georgiana Dinu, and Germa´n Kruszewski</source>
          .
          <year>2014</year>
          .
          <article-title>Don't count, predict! a systematic comparison of context-counting vs. context-predicting semantic vectors</article-title>
          .
          <source>In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL)</source>
          , pages
          <fpage>238</fpage>
          -
          <lpage>247</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Dwight</given-names>
            <surname>Bolinger</surname>
          </string-name>
          .
          <year>1972</year>
          .
          <article-title>Degree words</article-title>
          . Walter de Gruyter.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Elia</given-names>
            <surname>Bruni</surname>
          </string-name>
          ,
          <string-name>
            <surname>Nam-Khanh Tran</surname>
            , and
            <given-names>Marco</given-names>
          </string-name>
          <string-name>
            <surname>Baroni</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Multimodal distributional semantics</article-title>
          .
          <source>Journal of Artificial Intelligence Research</source>
          ,
          <volume>49</volume>
          (
          <year>2014</year>
          ):
          <fpage>1</fpage>
          -
          <lpage>47</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Herbert H.</given-names>
            <surname>Clark</surname>
          </string-name>
          .
          <year>1974</year>
          .
          <article-title>Semantics and comprehension</article-title>
          . In Thomas A. Sebeok, editor,
          <source>Current trends in linguistics: Linguistics and adjacent arts and sciences,</source>
          volume
          <volume>12</volume>
          , pages
          <fpage>1291</fpage>
          -
          <lpage>1428</lpage>
          . Mouton.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Christiane</given-names>
            <surname>Fellbaum</surname>
          </string-name>
          .
          <year>1998</year>
          .
          <article-title>WordNet: An Electronic Lexical Database</article-title>
          . MIT press.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Tamar</given-names>
            <surname>Fraenkel</surname>
          </string-name>
          and
          <string-name>
            <given-names>Yaacov</given-names>
            <surname>Schul</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>The meaning of negated adjectives</article-title>
          .
          <source>Intercultural Pragmatics</source>
          ,
          <volume>5</volume>
          (
          <issue>4</issue>
          ):
          <fpage>517</fpage>
          -
          <lpage>540</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Rachel</given-names>
            <surname>Giora</surname>
          </string-name>
          , Noga Balaban, Ofer Fein, and
          <string-name>
            <given-names>Inbar</given-names>
            <surname>Alkabets</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Negation as positivity in disguise</article-title>
          .
          <source>In Albert N. Katz</source>
          and Herbert L. Colston, editors,
          <source>Figurative language comprehension: Social and cultural influences</source>
          , pages
          <fpage>233</fpage>
          -
          <lpage>258</lpage>
          . Lawrence Erlbaum Associates.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Rachel</given-names>
            <surname>Giora</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Anything negatives can do affirmatives can do just as well, except for some metaphors</article-title>
          .
          <source>Journal of Pragmatics</source>
          ,
          <volume>38</volume>
          (
          <issue>7</issue>
          ):
          <fpage>981</fpage>
          -
          <lpage>1014</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>H. Paul</given-names>
            <surname>Grice</surname>
          </string-name>
          .
          <year>1975</year>
          .
          <article-title>Logic and conversation</article-title>
          .
          <source>Syntax and Semantics</source>
          , pages
          <fpage>41</fpage>
          -
          <lpage>58</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Karl Moritz</surname>
            <given-names>Hermann</given-names>
          </string-name>
          , Edward Grefenstette, and
          <string-name>
            <given-names>Phil</given-names>
            <surname>Blunsom</surname>
          </string-name>
          .
          <year>2013</year>
          . “
          <article-title>Not not bad” is not “bad”: A distributional account of negation</article-title>
          .
          <source>In Proceedings of the 3rd Workshop on Continuous Vector Space Models and their Compositionality (CVSC)</source>
          , pages
          <fpage>74</fpage>
          -
          <lpage>82</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Laurence R.</given-names>
            <surname>Horn</surname>
          </string-name>
          .
          <year>1972</year>
          .
          <article-title>On the Semantic Properties of Logical Operators in English</article-title>
          . University of California, Los Angeles.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Laurence R.</given-names>
            <surname>Horn</surname>
          </string-name>
          .
          <year>1984</year>
          .
          <article-title>Toward a new taxonomy for pragmatic inference: Q-based and R-based implicature</article-title>
          . Meaning, form, and
          <article-title>use in context: Linguistic applications</article-title>
          , pages
          <fpage>11</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Laurence R.</given-names>
            <surname>Horn</surname>
          </string-name>
          .
          <year>1989</year>
          .
          <article-title>A natural history of negation</article-title>
          . University of Chicago Press.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Otto</given-names>
            <surname>Jespersen</surname>
          </string-name>
          .
          <year>1965</year>
          .
          <article-title>The philosophy of grammar</article-title>
          . University of Chicago Press.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Shrikant</given-names>
            <surname>Joshi</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Affixal negation: direct, indirect and their subtypes</article-title>
          .
          <source>Syntaxe et semantique</source>
          ,
          <volume>13</volume>
          (
          <issue>1</issue>
          ):
          <fpage>49</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Manfred</given-names>
            <surname>Krifka</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Negated antonyms: Creating and filling the gap</article-title>
          .
          <source>In Uli Sauerland and Penka Stateva</source>
          , editors,
          <source>Presupposition and implicature in compositional semantics</source>
          , pages
          <fpage>163</fpage>
          -
          <lpage>177</lpage>
          . Palgrave McMillan.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Germa´n Kruszewski</surname>
            , Denis Paperno, Raffaella Bernardi, and
            <given-names>Marco</given-names>
          </string-name>
          <string-name>
            <surname>Baroni</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>There is no logical negation here, but there are alternatives: Modeling conversational negation with distributional semantics</article-title>
          .
          <source>Computational Linguistics.</source>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Lenci</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Distributional semantics in linguistic and cognitive research</article-title>
          .
          <source>Italian journal of linguistics</source>
          ,
          <volume>20</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , Kai Chen, Greg Corrado, and
          <string-name>
            <given-names>Jeffrey</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Efficient estimation of word representations in vector space</article-title>
          .
          <source>Proceedings of 2013 International Conference on Learning Representations (ILCR).</source>
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Saif M Mohammad</surname>
            ,
            <given-names>Bonnie J Dorr</given-names>
          </string-name>
          , Graeme Hirst, and
          <string-name>
            <surname>Peter D Turney</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Computing lexical contrast</article-title>
          .
          <source>Computational Linguistics</source>
          ,
          <volume>39</volume>
          (
          <issue>3</issue>
          ):
          <fpage>555</fpage>
          -
          <lpage>590</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Roser</given-names>
            <surname>Morante</surname>
          </string-name>
          and
          <string-name>
            <given-names>Caroline</given-names>
            <surname>Sporleder</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Modality and negation: An introduction to the special issue</article-title>
          .
          <source>Computational Linguistics</source>
          ,
          <volume>38</volume>
          (
          <issue>2</issue>
          ):
          <fpage>223</fpage>
          -
          <lpage>260</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>Lynne</given-names>
            <surname>Murphy</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>Semantic relations and the lexicon: Antonymy, synonymy and other paradigms</article-title>
          . Cambridge University Press.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <given-names>Kim</given-names>
            <surname>Anh</surname>
          </string-name>
          <string-name>
            <surname>Nguyen</surname>
          </string-name>
          ,
          <source>Sabine Schulte im Walde, and Ngoc Thang Vu</source>
          .
          <year>2016</year>
          .
          <article-title>Integrating distributional lexical contrast into word embeddings for antonymsynonym distinction</article-title>
          .
          <source>In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL)</source>
          , pages
          <fpage>454</fpage>
          -
          <lpage>459</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>Carita</given-names>
            <surname>Paradis</surname>
          </string-name>
          and
          <string-name>
            <given-names>Caroline</given-names>
            <surname>Willners</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Antonymy and negation-the boundedness hypothesis</article-title>
          .
          <source>Journal of pragmatics</source>
          ,
          <volume>38</volume>
          (
          <issue>7</issue>
          ):
          <fpage>1051</fpage>
          -
          <lpage>1080</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <given-names>Laura</given-names>
            <surname>Rimell</surname>
          </string-name>
          , Amandla Mabona, Luana Bulat, and
          <string-name>
            <given-names>Douwe</given-names>
            <surname>Kiela</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Learning to negate adjectives with bilinear models</article-title>
          .
          <source>In Proceedings of the 15th Annual Meeting of the European Chapter of the Association for Computational Linguistics (EACL)</source>
          , pages
          <fpage>71</fpage>
          -
          <lpage>78</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <given-names>Richard</given-names>
            <surname>Socher</surname>
          </string-name>
          , Brody Huval,
          <string-name>
            <given-names>Christopher D.</given-names>
            <surname>Manning</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Andrew Y.</given-names>
            <surname>Ng</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Semantic compositionality through recursive matrix-vector spaces</article-title>
          .
          <source>In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)</source>
          , pages
          <fpage>1201</fpage>
          -
          <lpage>1211</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <given-names>Richard</given-names>
            <surname>Socher</surname>
          </string-name>
          , Alex Perelygin, Jean Y. Wu, Jason Chuang,
          <string-name>
            <surname>Christopher D Manning</surname>
            ,
            <given-names>Andrew Y.</given-names>
          </string-name>
          <string-name>
            <surname>Ng</surname>
            ,
            <given-names>Christopher</given-names>
          </string-name>
          <string-name>
            <surname>Potts</surname>
          </string-name>
          , et al.
          <year>2013</year>
          .
          <article-title>Recursive deep models for semantic compositionality over a sentiment treebank</article-title>
          .
          <source>In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP)</source>
          , pages
          <fpage>1631</fpage>
          -
          <lpage>1642</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <given-names>Michael</given-names>
            <surname>Henry Tessler</surname>
          </string-name>
          and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Franke</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Not unreasonable: Carving vague dimensions with contraries and contradictions</article-title>
          .
          <source>In Proceedings of the 40th Annual Meeting of the Cognitive Science Society.</source>
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <given-names>Nghia</given-names>
            <surname>The</surname>
          </string-name>
          <string-name>
            <surname>Pham</surname>
          </string-name>
          , Angeliki Lazaridou, and
          <string-name>
            <given-names>Marco</given-names>
            <surname>Baroni</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>A multitask objective to inject lexical contrast into distributional semantics</article-title>
          .
          <source>In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and The 7th International Joint Conference of the Asian Federation of Natural Language Processing (ACL-IJCNLP)</source>
          , pages
          <fpage>21</fpage>
          -
          <lpage>26</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <surname>Peter D. Turney</surname>
            and
            <given-names>Patrick</given-names>
          </string-name>
          <string-name>
            <surname>Pantel</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>From frequency to meaning: Vector space models of semantics</article-title>
          .
          <source>Journal of Artificial Intelligence Research</source>
          ,
          <volume>37</volume>
          (
          <issue>1</issue>
          ):
          <fpage>141</fpage>
          -
          <lpage>188</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <string-name>
            <surname>Chantal van Son</surname>
          </string-name>
          ,
          <source>Emiel van Miltenburg, and Roser Morante Vallejo</source>
          .
          <year>2016</year>
          .
          <article-title>Building a dictionary of affixal negations</article-title>
          .
          <source>In Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics.</source>
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          <string-name>
            <given-names>Michael</given-names>
            <surname>Wiegand</surname>
          </string-name>
          , Alexandra Balahur, Benjamin Roth, Dietrich Klakow, and Andre´s Montoyo.
          <year>2010</year>
          .
          <article-title>A survey on the role of negation in sentiment analysis</article-title>
          .
          <source>In Proceedings of the Workshop on Negation and Speculation in Natural Language Processing (NeSP-NLP)</source>
          , pages
          <fpage>60</fpage>
          -
          <lpage>68</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          <string-name>
            <given-names>Bryan</given-names>
            <surname>Wilkinson</surname>
          </string-name>
          and
          <string-name>
            <given-names>Oates</given-names>
            <surname>Tim</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>A gold standard for scalar adjectives</article-title>
          .
          <source>In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC).</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>