<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Exploring Semantic Orientation of Adverbs</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Potemkin</string-name>
          <email>potemkin@philol.msu.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kedrova</string-name>
          <email>kedr@philol.msu.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Philological faculty, Moscow State University</institution>
          ,
          <addr-line>Moscow</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Sentiment analysis often relies on a semantic orientation lexicon of positive and negative words. Determining the semantic orientation of words is necessary for correct estimation of the content of statements in the media, Internet, in the writings and speech. Qualitative adverbs expressing evaluation, intensity, direction of action are important as the modifiers of the main sentence predicate. In this paper we propose a method for extracting seed set of adverbs from a collection of pairs of antonym. A model based on the representation of a set of synonyms from the Russian lexicons as a graph, and determination the semantic orientation of the adverbs concerning three main dimensions of the semantic differential also demonstrated. The assessment of performance of the method in comparison with the dictionary data shows effectiveness of the method obtained.</p>
      </abstract>
      <kwd-group>
        <kwd />
        <kwd>Sentiment</kwd>
        <kwd>semantic differential</kwd>
        <kwd>antonyms</kwd>
        <kwd>seed set</kwd>
        <kwd>graph</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Nowadays, the availability of resources for Natural Language Processing (NLP)
remains a hot topic, in particular for Russian especially due to the lack of
comprehensive semantic resources, despite efforts made to provide a freely-available Russian
WordNet [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Ability to establish relativity, similarity, or semantic distance between
words and concepts is the basis of computational linguistics. This paper deals with
measuring of distance within the syntactic category of adverbs. This set of words is
crucial for some applications because adverbs modify or clarify the meaning of other
words (verbs, nouns, adjectives). The adverbs are of particular interest to determine
the semantic orientation of syntagma containing a main word and its modifier
(adverb). Measuring the semantic distance or similarity between the English words most
often is based on WordNet [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and almost exclusively on taxonomic relationships
established in this database. So such approach is applicable only to the syntactic
categories of nouns and verbs.
      </p>
      <p>The aim of this paper is to extract a list of semantically oriented adverbs and
develop the measure of proximity based on dictionaries of synonyms. The article is
structured as follows. In Section 1 the problem of extracting the seed set of
semantically oriented adverbs from the lexicon of Russian antonyms is discussed. In Section
2 we describe the previously proposed measures of semantic distance between words,
as well as an elementary way to map synonyms onto a graph. In Section 3 the basic
characteristics of the subjective understanding of the meaning and the measures based
on the distance in a graph of synonyms are discussed. Finally, Section 4 presents
some results and conclusions. Additionally, we explore the use of visualization
techniques to gain insight into the results obtained.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Extracting the seed set of adverbs</title>
      <p>
        A number of approaches have been proposed for creating semantic orientation
lexicons in English, most of them are computationally expensive and rely on
significant manual annotation and large corpora. Particularly, the General Inquirer [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
created in the beginning of the last century is used as the gold standard for assessment
the quality of new-generated lexicons. For Russian language there is no open-source
and reliable lexicon with positively and negatively marked entries. We propose some
approaches to generate a broad coverage semantic orientation lexicon for Russian
adverbs which includes both individual words and multi-word adverbial expressions
using only dictionaries of antonyms and synonyms, requiring a small amount of
manual pruning and database processing.
      </p>
      <p>
        First of all we have analyzed a list of antonyms collected from published
dictionaries of antonyms [
        <xref ref-type="bibr" rid="ref4 ref5">4,5</xref>
        ]. This list contains 7300+ antonymous pairs (adjectives, nouns,
verbs, adverbs and prepositions as well). The semantically oriented words were
manually extracted from this list and arranged in 2 separate lists – positive (1859) and
negative (2229) words. This seed lexicon could be compared with the GI lexicon
which contains orientation labels for only about 3600 entries.
      </p>
      <p>
        Next step was to extend our seed lexicon to obtain a broad coverage of different
texts under consideration concerning sentiment analysis. Automatic approaches to
create (English) semantic orientation lexicon and, more generally, approaches for
word-level sentiment annotation can be grouped into two kinds: (1) those that rely on
manually created lexical resources—most of which use WordNet; and (2) those that
rely on text corpora [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. As a lexical source we use a structured list of Russian
synonyms collected from a number of published and Internet-available dictionaries such as
[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and others (11 sources). List of synonyms contains ~600000 word-pairs including
~10000 pairs of adverbs. All synonyms {s(wi)} of each seed word wi receives the
same semantic orientation as wi. The number N of occurrences of a synonym s(wi) in
the extended set contributed by different seed-words wi, (i=1…N) indicates the
confidence of semantic orientation. After manual pruning we have got a list of positively
marked (5990, including 731 adverbs) and negatively marked (6853, including 592
adverbs) words. Since the most part of Russian adverbs could be derived as the short
form singular neutral or short form plural adjective (3135) the list of semantically
orientated adverbs could be expanded.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Measures of distance</title>
      <p>
        A number of distance or similarity measures exist for English based (completely or
partially) on WordNet. In particular, such measure is defined as the number of edges
of the path through the taxonomic relations (IS-A, Part-of, or WordNet’s hyponymy
relation). In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] the concept of bond length was extended for all relations in WordNet
by their clustering in the horizontal (synonyms) or vertical (hyponymy) direction and
assigning a penalty for changing the direction of the path motion. Overview of five
measures and evaluation of their effectiveness using the associations between the
words is given in [9]. Exclusive usage of hyponymy delimits the measure of distance
or similarity only to the syntactic categories of nouns and verbs, as hyponymy
relations in WordNet are established only for these grammatical categories. Therefore,
such measures could not be applied to adjectives and adverbs.
      </p>
      <p>The semantic distance between the words could be determined in the similar way
as the definition adopted in graph theory [10]. The simplest approach is just to gather
all the words from the Dictionary of synonyms and to link each member of a
synonymous group with its dominant word as indicated in the Dictionary. Let G(W,S) be the
undirected graph, with W the set of nodes being all the words from the Dictionary
with associated part-of-speech, S - the set of edges connecting each member of
synonymous group with its dominant word. Every group of synonymous words could be
connected to each other and form a clique in G graph. A path P is the sequence of
nodes connected by edges of G and geodesic is the shortest path between two nodes.
Geodesic distance, D(wi,wj) between two words wi and wj is the length (number of
edges) of the shortest path between wi and wj. If there is no path between wi and wj,
the distance between them is infinity. The minimal path-length defines a metric on the
set of synonyms. All axioms of the metric space are fulfilled in this case. Usually
synonymous groups comprises the words of the same grammatical category and entire
graph G is decomposed into disjoint sub-graphs or networks for nouns, verbs,
adjectives and adverbs. (Fig. 1). In each network exists a maximal connected component
that contains 70-90% of all nodes of the graph constructed from the Dictionary of
synonyms. Maximum component in the class of Russian adverbs contains about 8500
words. The words in this connected component could be analyzed using the metric
defined by the length of geodesics.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Semantic orientation of adverbs</title>
      <p>Classical work on the measurement of emotional or affective values in texts is the
theory of semantic differential by Charles Osgood. Word meaning in cognitive
psychology, is “a strictly psychological one: those cognitive states of human language
users which are necessary antecedent conditions for selective encoding of lexical
signs and necessary subsequent conditions in selective decoding of signs in
messages.” [11]. Semantic differential method was applied mainly to the adjectives
measured in such dimensions as active/passive, good/bad, positive/negative, beautiful/ugly,
etc. Each pair of bipolar adjectives is a factor or an axis in the method of semantic
differential. Application of factor analysis to extensive empirical material gave an
unexpected result: most of the variance in judgment could be explained by only three
major factors including the evaluative factor (e.g., positive/negative); the potency
factor (e.g., strong/weak); and the activity factor (e.g., active/passive). Among these
three factors, the evaluative factor has the strongest relative weight for determining
the semantic orientation.</p>
      <p>Turning to the selected Russian adverbs, we note that the vast majority of adverbs
is matched with the words of other parts of speech primarily with the adjectives
(cheerful – cheerfully // бодрый - бодро, brutal – brutally // жестокий - жестоко), so
that the semantic differential can be naturally extended to motivated adverbs, which
bear semantic meaning and, accordingly, deliver the information on their semantic
orientation. All three pairs of bipolar adverbs negatively/positively (плохо/хорошо);
weakly/strongly (слабо/сильно), passively/actively (пассивно/активно) are
contained in the maximal component of the sub-graph of synonymous adverbs Gadv. One
can assume that the distance to positively (хорошо) is a measure of positive
assessment of an adverb. However, it is easy to show that this measure is in fact rather
controversial.</p>
      <p>A striking example of this is that the words positively (хорошо) and negatively
(плохо) are closely related through the path of synonyms. There is a sequence of only
5 words in English (negatively, hardly, tightly, thoroughly, comprehensively, soundly,
positively), and 6 words in Russian (плохо, дешево, легко, просто, совсем, очень,
здорово, хорошо) – see Fig. 1 – connecting opposites, each pair of words in this
sequence is certainly synonymous (at least in one of their meanings). Thus, we find that
d(positively, negatively) = 6; d(хорошо, плохо) = 7. Despite the fact that the adverb
positively (хорошо) and negatively (плохо) have opposite meanings, they are closely
related by synonymy path. Of course, this is not due to any error in the Dictionary of
synonyms. Partial explanation lies in the wide use of two Russian adverbs хорошо
(625 ipm), плохо (187 ipm) [12]. Due to the fact that both words are members of the
maximum connected component of Gadv sub-graph, we can consider not only the
shortest distance from any adverb to "positively", but the shortest distance to its
antonym, "negatively". This idea is concretized [13] in the definition of EVA function,
which allows to measure the relative distance from the word of two opposites,
"positively" and "negatively":</p>
      <p>EVA (w) = (d (w, neg)-d (w, pos)) / d (neg, pos).</p>
      <p>
        Under the assumption that there is no word "worse than negatively" or "better than
positively" the values of EVA lie in the interval [
        <xref ref-type="bibr" rid="ref1">-1,1</xref>
        ], for example, the word
"honestly" is evaluated by function EVA (honestly) gives a value of 1 as follows EVA
(honestly) = (d (honestly, neg) - d (honestly, pos)) / d (pos, neg) = (8-2) / 6 = 1,. The
measures for other Osgood’s dimensions is defined similarly. For the potency factor
the function: POT (w) = (d (w, weakly)-d (w, strongly)) / d (strongly, weakly) is
defined; for the activity factor the function: ACT (w) = (d (w, passively)-d (w, actively))
/ d (actively, passively) is defined. This fact allows to define measures for any two
words belonging to the maximal connected component of the adverbs subgraph.
      </p>
      <p>An assumption on the boundary position of words negatively/positively is not
entirely justified. Intuitively, perfectly (превосходно) is better than positively,
disgustingly (отвратительно) is worse than negatively. Bearing this in mind and using the
geometry of a triangle with vertices {w, pos, neg}, we redefine the function of EVA,
namely:</p>
      <p>EVA1(w)=(d(w, neg)-d(w, pos))*(d(w, neg)+d(w, pos))/d2(neg, pos).</p>
      <p>
        The values of EVA1 sometimes are beyond the interval [
        <xref ref-type="bibr" rid="ref1">-1,1</xref>
        ]. Similarly, we can
redesign POT(w) and ACT(w).
      </p>
      <p>For English adjectives (and motivated adverbs) there exists the source for assessing
the measure constructed above in comparison with the independently obtained
answers to the "General Inquirer” [11], which contains a set of words to assess three
Osgood’s factors. Word lists were obtained from the Stanford political dictionary,
where each of the 3000 most frequent common words were assessed by three or more
experts concerning each Osgood’s factor. Thus 765 positive and 873 negative words
for the assessment factor were obtained, 1474 strong and 647 weak word for the
potency factor and 1568 active and 732 passive words for activity factor. Comparison of
results obtained with the General Inquirer gave the values of 70 - 80% of matches,
depending on what words were considered as neutral in terms of EVA function.</p>
      <p>In the absence of available data for content analysis we used the Russian
dictionaries of antonyms as an independent source. Antonymous pair is a pair of words (or
rather, the specific meanings of words), one opposed to the other on semantic
grounds, such as hot - cold fast - slow, present - absent. We suggest that adverbs
belonging to pair of antonyms lie on the "opposite sides" of the entire set of adverbs.
Methods of multidimensional scaling deliver a mapping of multidimensional space
with the defined distance between individual points d(wi, wj) onto a space of smaller
dimension, namely the plane (Fig. 2). Figure 2 a, b shows that the pairs of antonyms
lie near the diameters of the set of adverbs. For a more profound study of the structure
of the space of adverbs we have constructed chains of synonyms connecting
antonyms pairs within the sub-graph Gadv.</p>
      <p>Fig. 2. Two chains of synonyms, joining antonymous pairs of adverbs.</p>
      <p>a) up – a consistent path; b) down – an inconsistent result.</p>
      <p>Chain in Fig. 2a is a consistent result, i.e. the chain of synonyms passes on the
periphery of the set of adverbs and the distances between the synonyms do not exceed
the distance between the antonyms. Unfortunately, the situation is not always as
favorable. In Fig. 2b pair of antonyms is close to the diameter, but the chain of
synonyms is not at the periphery of the set, but lays in the central part of the set, alternates
its direction, and the distances between synonyms is often greater than the distance
between antonyms. Probably it is necessary to determine more accurate distance
between the words and to choose correctly the axes of the adverb space using the
principal components method. These new axes should not coincide Osgood’s dimensions.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Discussion and conclusions</title>
      <p>In this paper we define a measure of the distance between adverbs using synonyms
graph. It seems obvious that the choice of similarity measure, or distance largely
depends on the type of the problem. The choice of distance measure on the grounds of
synonyms is connected with the goal of determining the semantic orientation of
adverbs. In contrast to Osgood’s semantic differential associated with the reaction of
people on the stimulus - words presented, or the possible emotional impact of words,
this model is based solely on the lexical material and is intended to represent
relatively objective meanings which are fixed in Dictionaries. Further studies will
determine the semantic orientation of sentences or the whole text on the basis of the
orientation of its constituent words. Our method allows to evaluate other classes of words
such as nouns, adjectives and verbs, but this extension will require a significant
increase of calculations and special methods for processing large data sets, since an
algorithm for computing shortest paths requires O(n3) operations, where n is the
number of words in graph G(W,S).
9. Budanitsky, A., Hirst, G. Semantic distance in WordNet: An experimental,
applicationoriented evaluation of five measures. In: Workshop on WordNet and Other Lexical
Resources. Second meeting of the NAACL, Pittsburgh, (2001)
10. Potemkin, S.B. Semantic Distance over Lexical Database and WordNet In: Proceedings of
10 International Conference «Cognitive Modeling in Linguistics », CML, Montenegro
Bechichi (2008).
11. Osgood, C.E., Succi, G.J., Tannenbaum P.H., The Measurement of Meaning.. University
of Illinois Press, Urbana IL (1957).
12. Sharov, S. Frequency Dictionary of Russian</p>
      <p>http://www.artint.ru/projects/frqlist.asp (2003)
13. Kamps, J., Marx, M., Robert, J., Mokken, M. Using WordNet to Measure Semantic
Orientations of Adjectives In: Proceedings of the 4th International Conference on Language
Resources and Evaluation (LREC'04, Vol. IV ) pp. 1115-1118. (2004)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Azarova</surname>
            ,
            <given-names>I.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sinopalnikova</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Javorsky</surname>
            ,
            <given-names>M.V.</given-names>
          </string-name>
          <article-title>Principles of construction of WordNetthesaurus RussNet (in Russian) In: Computer linguistics and intellectual technologies</article-title>
          .
          <source>- Proceedings of the International conference Dialogue'2004</source>
          pp.
          <fpage>542</fpage>
          −
          <lpage>547</lpage>
          .
          <string-name>
            <surname>Мoscow</surname>
          </string-name>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Fellbaum</surname>
          </string-name>
          , C. (ed.),
          <source>WordNet: An Electronic</source>
          Lexical Database // Language, Speech, and Communication Series. The MIT Press, Cambridge MA (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Stone</surname>
            ,
            <given-names>P.J.</given-names>
          </string-name>
          <article-title>Thematic text analysis: new agendas for analyzing text content</article-title>
          . In: C. Roberts (ed.),
          <article-title>Text Analysis for the Social Sciences</article-title>
          . Lawrence Erlbaum Associates, Mahwah NJ (
          <year>1997</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Vvedenskaia</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          <article-title>Dictionary of Russian Antonyms (in Russian) OOO «AST Publishing House »</article-title>
          , Moscow (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lvov</surname>
            ,
            <given-names>M.R.</given-names>
          </string-name>
          <article-title>Dictionary of Russian Antonyms (in Russian)</article-title>
          “Press-Kniga”
          <article-title>(</article-title>
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S</given-names>
          </string-name>
          et.
          <source>al Generating High-Coverage Semantic Orientation Lexicons From Overtly Marked Words and a Thesaurus In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)</source>
          ,
          <source>Singapore</source>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Alexandrova</surname>
            ,
            <given-names>Z.E.</given-names>
          </string-name>
          <article-title>Dictionary of Russian Synonyms (in Russian) “Russkii Iazyk-</article-title>
          <string-name>
            <surname>Media</surname>
          </string-name>
          , Moscow (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Hirst</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>St-Onge</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <article-title>Lexical Chains Representations of Context for the Detection and Correction of Malapropisms"</article-title>
          . In Fellbaum, C. (ed.)
          <source>WordNet. An Electronic Lexical Database</source>
          , The MIT Press, (
          <year>1998</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>