<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Adapting Sentiment Lexicons using Contextual Semantics for Sentiment Analysis of Twitter</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Hassan Saif</string-name>
          <email>h.saif@open.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yulan He</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Miriam Fernandez</string-name>
          <email>m.fernandez@open.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Harith Alani</string-name>
          <email>h.alani@open.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Knowledge Media Institute, The Open University</institution>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Engineering and Applied Science, Aston University</institution>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Sentiment lexicons for sentiment analysis offer a simple, yet effective way to obtain the prior sentiment information of opinionated words in texts. However, words' sentiment orientations and strengths often change throughout various contexts in which the words appear. In this paper, we propose a lexicon adaptation approach that uses the contextual semantics of words to capture their contexts in tweet messages and update their prior sentiment orientations and/or strengths accordingly. We evaluate our approach on one state-of-the-art sentiment lexicon using three different Twitter datasets. Results show that the sentiment lexicons adapted by our approach outperform the original lexicon in accuracy and F-measure in two datasets, but give similar accuracy and slightly lower F-measure in one dataset.</p>
      </abstract>
      <kwd-group>
        <kwd>Sentiment Analysis</kwd>
        <kwd>Semantics</kwd>
        <kwd>Lexicon Adaptation</kwd>
        <kwd>Twitter</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Sentiment analysis on Twitter has been attracting much attention recently due to the
rapid growth in Twitter’s popularity as a platform for people to express their opinions
and attitudes towards a great variety of topics. Most existing approaches to Twitter
sentiment analysis can be categorised into machine learning [
        <xref ref-type="bibr" rid="ref11 ref13 ref7">7, 11, 13</xref>
        ] and
lexiconbased approaches [
        <xref ref-type="bibr" rid="ref15 ref2 ref6 ref8">2, 8, 15, 6</xref>
        ].
      </p>
      <p>
        Lexicon-based approaches use lexicons of words weighted with their sentiment
orientations to determine the overall sentiment in texts. These approaches have shown
to be more applicable to Twitter data than machine learning approaches, since they do
not require training from labelled data and therefore, they offer a domain-independent
sentiment detection [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Nonetheless, lexicon-based approaches are limited by the
sentiment lexicon used [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Firstly, because sentiment lexicons are composed by a
generally static set of words that do not cover the wide variety of new terms that
constantly emerge in the social web. Secondly, because words in the lexicons have fixed
prior sentiment orientations, i.e. each term has always the same associated sentiment
orientation independently of the context in which the term is used.
      </p>
      <p>
        To overcome the above limitations, several lexicon bootstrapping and adaptation
methods have been previously proposed. However, these methods are either supervised
[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], i.e., they require training from human-coded corpora, or based on studying the
statistical, syntactical or linguistic relations between words in general textual corpora
(e.g., The Web) [
        <xref ref-type="bibr" rid="ref17 ref19">17, 19</xref>
        ] or in static lexical knowledge sources (e.g., WordNet) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
ignoring, therefore, the specific textual context in which the words appear. In many
cases, however, the sentiment of a word is implicitly associated with the semantics of its
context [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>In this paper we propose an unsupervised approach for adapting sentiment lexicons
based on the contextual semantics of their words in a tweet corpus. In particular, our
approach studies the co-occurrences between words to capture their contexts in tweets
and update their prior sentiment orientations and/or sentiment strengths in a given lexicon
accordingly.</p>
      <p>
        As a case study we apply our approach on Thelwall-Lexicon [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], which, to our
knowledge, is the state-of-the-art sentiment lexicon for social data. We evaluate the
adapted lexicons by performing a lexicon-based polarity sentiment detection (positive vs.
negative) on three Twitter datasets. Our results show that the adapted lexicons produce
a significant improvement in the sentiment detection accuracy and F-measure in two
datasets but gives a slightly lower F-measure in one dataset.
      </p>
      <p>In the rest of this paper, related work is discussed in Section 2, and our approach is
presented in Sections 3. Experiments and results are presented in Sections 4. Discussion
and future work are covered in Section 5. Finally, we conclude our work in Section 6.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Exiting approaches to bootstrapping and adapting sentiment lexicons can be categorised
into dictionary and corpus-based approaches. The dictionary-based approach [
        <xref ref-type="bibr" rid="ref14 ref5">5, 14</xref>
        ] starts
with a small set of general opinionated words (e.g., good, bad) and lexical knowledge base
(e.g., WordNet). After that, the approach expands this set by searching the knowledge
base for words that have lexical or linguistic relations to the opinionated words in the
initial set (e.g., synonyms, glosses, etc).
      </p>
      <p>
        Alternatively, the corpus-based approach measures the sentiment orientation of
words automatically based on their association to other strongly opinionated words in a
given corpus [
        <xref ref-type="bibr" rid="ref14 ref17 ref19">17, 14, 19</xref>
        ]. For example, Turney and Littman [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] used Pointwise Mutual
Information (PMI) to measure the statistical correlation between a given word and a
balanced set of 14 positive and negative paradigm words (e.g., good, nice, nasty, poor).
Although this work does not require large lexical input knowledge, its identification
speed is very limited [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] because it uses web search engines in order to retrieve the
relative co-occurrences of words.
      </p>
      <p>
        Following the aforementioned approaches, several lexicons such as MPQA [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]
and SentiWordNet [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] have been induced and successfully used for sentiment analysis
on conventional text (e.g., movie review data). However, on Twitter these lexicons are
not as compatible due to their limited coverage of Twitter-specific expressions, such as
abbreviations and colloquial words (e.g, “looov”, “luv”, “gr8”) that are often found
in tweets.
      </p>
      <p>
        Quite few sentiment lexicons have been recently built to work specifically with social
media data, such as Thelwall-Lexicon [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and Nielsen-Lexicon [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. These lexicons have
proven to work effectively on Twitter data. Nevertheless, such lexicons are similar to
other traditional ones, in the sense that they all offer fixed and context-insensitive
wordsentiment orientations and strengths. Although a training algorithm has been proposed
to update the sentiment of terms in Thelwall-Lexicon[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], it requires to be trained from
human-coded corpora, which is labour-intensive to obtain.
      </p>
      <p>Aiming at addressing the above limitations we have designed our lexicon-adaptation
approach in away that allows to (i) work in unsupervised fashion, avoiding the need for
labelled data, and (ii) exploit the contextual semantics of words. This allows capturing
their contextual information in tweets and update their prior sentiment orientation and
strength in a given sentiment lexicon accordingly.
3</p>
    </sec>
    <sec id="sec-3">
      <title>A Contextual Semantic Approach to Lexicon Adaptation</title>
      <p>
        The main principle behind our approach is that the
sentiment of a term is not static, as found in general-purpose Tweets Extract
wsehnitcimhethnet lteexrmicoinssu,sbeudt, ria.eth.,eirt ddeeppeennddss oonn tihtse ccoonntteexxttuianl SeLenxtiimcoennt CSoenntteimxteunatl
semantics.3 Therefore, our approach functions in two main
steps as shown in Figure 1. First, given a tweet collection Rule-based Lexicon Adaptation
and a sentiment lexicon, the approach builds a contextual
semantic representation for each unique term in the tweet Adapted Lexicon
collection and subsequently uses it to derive the term’s
contextual sentiment orientation and strength. The SentiCircle Fig. 1. The systematic
workrepresentation model is used to this end [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Secondly, flow of our proposed lexicon
rule-based algorithm is applied to amend the prior senti- adaptation approach.
ment of terms in the lexicon based on their corresponding
contextual sentiment. Both steps are further detailed in the following subsections.
3.1
      </p>
      <sec id="sec-3-1">
        <title>Capturing Contextual Semantics and Sentiment</title>
        <p>
          The first step in our pipeline is to capture the words contextual semantics and sentiment
in tweets. To this end, we use our previously proposed semantic representation model,
SentiCircle [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
        </p>
        <p>
          Following the distributional hypothesis that words that co-occur in similar contexts
tend to have similar meaning [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], SentiCircle extracts the contextual semantics of
a word from its co-occurrence patterns with other words in a given tweet collection.
These patterns are then represented as a geometric circle, which is subsequently used
to compute the contextual sentiment of the word by applying simple trigonometric
identities on it. In particular, for each unique term m in a tweet collection, we build
a two-dimensional geometric circle, where the term m is situated in the centre of the
circle, and each point around it represents a context term ci (i.e., a term that occurs with
m in the same context). The position of ci, as illustrated in Figure 2, is defined jointly
by its Cartesian coordinates xi, yi as:
xi = ri cos(✓ i ⇤ ⇡ )
yi = ri sin(✓ i ⇤ ⇡ )
Where ✓ i is the polar angle of the context term ci and its value equals to the prior
sentiment of ci in a sentiment lexicon before adaptation, ri is the radius of ci and its
value represents the degree of correlation (tdoc) between ci and m, and can be computed
as:
ri = tdoc(m, ci) = f (ci, m) ⇥ log
        </p>
        <p>N
Nci
3 We define context as a textual corpus or a set of tweets.
where f (ci, m) is the number of times ci occurs with m in tweets, N is the total
number of terms, and Nci is the total number of terms that occur with ci. Note that
all terms’ radii in the SentiCircle are normalised. Also, all angles’ values are in radian.
tToheentcroigdoentohmeectornictepxrtoupaelrsteiemsaonftitchseoSfeantteiCrmircalse saelnlotiwmseunst Very Positive Y +1 Positive
toirmieennttaotifotnheantedrmse,nit.iem.,eanpt osstriteinvgetyh.vYal-uaexidsedneofitenseas pthoesistievne- yi ri Ci
ssetrnetni mgtehnotfanthdevtiecremv.eTrshae. TsmhealXle-ratxhise dxevfianleuset,htehesesntrtoimngeenrt -1 m θi xi +1 X
the sentiment.4 This, in turn, divides the circle into four
sentiment quadrants. Terms in the two upper quadrants have
a positive sentiment (sin ✓ &gt; 0), with upper left quadrant Very Negative -1 Negative
representing stronger positive sentiment since it has larger rθii==TPDriOorC_(SCei)ntiment (Ci)
angle values than those in the top right quadrant. Simi- Fig. 2. SentiCircle of a term
larly, terms in the two lower quadrants have negative sen- m. Neutral region is shaded
timent values (sin ✓ &lt; 0). Moreover, a small region called in blue.
the “Neutral Region” can be defined. This region, as shown
in Figure 2, is located very close to X-axis in the “Positive” and the “Negative” quadrants
only, where terms lie in this region have very weak sentiment (i.e, |✓ | t 0).
Calculating Contextual Sentiment In summary, the Senti-Circle of a term m is
composed by the set of (x, y) Cartesian coordinates of all the context terms of m. An effective
way to compute the overall sentiment of m is by calculating the geometric median of all
tChierlpcoein⌦ts, itnheit2sDSegnetioCmirectlrei.cFmoremdiaalnlyg,fiosrdaegfiinveedn saest: ogf=n paorignmts i(npg12 , Rp22, P...n,pn) in a
Sentii=1 k|pi g||2.</p>
        <p>We call the geometric median g the SentiMedian as its position in the SentiCircle
determines the final contextual-sentiment orientation and strength of m.</p>
        <p>
          Note that the boundaries of the neutral region can be computed by measuring the
density distribution of terms in the SentiCircle along the Y-axis. In this paper we use
similar boundaries to the ones used in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] since we use the same evaluation datasets.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Lexicon Adaptation</title>
        <p>
          The second step in our approach is to update the sentiment lexicon with the terms’
contextual sentiment information extracted in the previous step. As mentioned earlier, in
this work we use Thelwall-Lexicon [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] as a case study. Therefore, in this section we
first describe this lexicon and its properties, and then introduce our proposed adaptation
method.
        </p>
        <p>
          Thelwall-Lexicon consists of 2546 terms coupled with integer values between -5 (very
negative) and +5 (very positive). Based on the terms’ prior sentiment orientations and
strengths (SOS), we group them into three subsets of 1919 negative terms (SOS2 [-2,-5]),
398 positive terms (SOS2 [
          <xref ref-type="bibr" rid="ref2 ref5">2,5</xref>
          ]) and 229 neutral terms (SOS2{ -1,1}).
        </p>
        <p>The adaptation method uses a set of antecedent-consequent rules that decides how the
prior sentiment of the terms in Thelwall-Lexicon should be updated according to the
positions of their SentiMedians (i.e., their contextual sentiment). In particular, for a term
m, the method checks (i) its prior SOS value in Thelwall-Lexicon and (ii) the SentiCircle
4 This is because cos ✓ &lt; 0 for large angles.
quadrant in which the SentiMedian of m resides. The method subsequently chooses the
best-matching rule to update the term’s prior sentiment and/or strength.</p>
        <p>Table 1 shows the complete list of rules in the proposed method. As noted, these rules
are divided into updating rules, i.e., rules for updating the existing terms in
ThelwallLexicon, and expanding rules, i.e., rules for expanding the lexicon with new terms. The
updating rules are further divided into rules that deal with terms that have similar prior
and contextual sentiment orientations (i.e., both positive or negative), and rules that deal
with terms that have different prior and contextual sentiment orientations (i.e., negative
prior, positive contextual sentiment and vice versa).</p>
        <p>Although they look complicated, the notion behind the proposed rules is rather simple:
Check how strong the contextual sentiment is and how weak the prior sentiment is !
update the sentiment orientation and strength accordingly. The strength of the contextual
sentiment can be determined based on the sentiment quadrant of the SentiMedian of m,
i.e., the contextual sentiment is strong if the SentiMedian resides in the “Very Positive”
or “Very Negative” quadrants (See Figure 2). On the other hand, the prior sentiment of
m (i.e., priorm) in Thelwall-Lexicon is weak if |priorm| 6 3 and strong otherwise.</p>
        <p>Updating Rules (Similar Sentiment Orientations)
Id Antecedents Consequent
1 (|prior| 6 3) ^ (SentiMedian 2 / StrongQuadrant) |prior| = |prior| + 1
2 (|prior| 6 3) ^ (SentiMedian 2 StrongQuadrant) |prior| = |prior| + 2
3 (|prior| &gt; 3) ^ (SentiMedian 2 / StrongQuadrant) |prior| = |prior| + 1
4 (|prior| &gt; 3) ^ (SentiMedian 2 StrongQuadrant) |prior| = |prior| + 1</p>
        <p>Updating Rules (Different Sentiment Orientations)
5 (|prior| 6 3) ^ (SentiMedian 2 / StrongQuadrant) |prior| = 1
6 (|prior| 6 3) ^ (SentiMedian 2 StrongQuadrant) prior = prior
7 (|prior| &gt; 3) ^ (SentiMedian 2 / StrongQuadrant) |prior| = |prior| 1
8 (|prior| &gt; 3) ^ (SentiMedian 2 StrongQuadrant) prior = prior
9 (|prior| &gt; 3) ^ (SentiMedian 2 N eutralRegion) |prior| = |prior| 1
10 (|prior| 6 3) ^ (SentiMedian 2 N eutralRegion) |prior| = 1</p>
        <p>Expanding Rules
11 SentiMedian 2 N eutralRegion (|contextual| = 1) ^ AddT erm
12 SentiMedian 2 / StrongQuadrant (|contextual| = 3) ^ AddT erm
13 SentiMedian 2 StrongQuadrant (|contextual| = 5) ^ AddT erm
Table 1. Adaptation rules for Thelwall-Lexicon, where prior: prior sentiment value,
StrongQuadrant: very negative/positive quadrant in the SentiCircle, Add: add the term to
Thelwall-Lexicon.</p>
        <p>For example, the word “revolution” in Thelwall-Lexicon has a weak negative
sentiment (prior=-2) while it has a neutral contextual sentiment since its SentiMedian
resides in the neutral region (SentiM edian 2 N eutralRegion). Therefore, rule number
10 is applied and the term’s prior sentiment in Thelwall lexicon will be updated to neutral
(|prior| = 1). In another example, the words “Obama” and “Independence” are not
covered by the Thelwall-Lexicon, and therefore, they have no prior sentiment. However,
their SentiMedians reside in the “Positive” quadrant in their SentiCircles, and therefore
rule number 12 is applied and both terms will be assigned with a positive sentiment
strength of 3 and added to the lexicon consequently.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Evaluation Results</title>
      <p>
        We evaluate our approach on Thelwall-Lexicon using three adaptation settings: (i) the
update setting where we update the prior sentiment of existing terms in the lexicon, (ii)
The expand setting where we expand Thelwall-Lexicon with new opinionated terms, and
(iii) the update+expand setting where we try both aforementioned settings together. To
this end, we use three Twitter datasets OMD, HCR and STS-Gold. Numbers of positive
and negative tweets within these datasets are summarised in Table 2, and detailed in the
references added in the table. To evaluate the adapted lexicons under the above settings,
we perform binary polarity classification on the three datasets. To this end, we use the
sentiment detection method proposed with Thelwall-Lexicon [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. According to this
method a tweet is considered as positive if its aggregated positive sentiment strength is
1.5 times higher than the aggregated negative one, and negative vice versa.
      </p>
      <p>Applying our adaptation approach to Thelwall-Lexicon results in dramatic changes
in it. Table 3 shows the percentage of words in the three datasets that were found in
Thelwall-Lexicon with their sentiment changed after adaptation. One can notice that on
average 9.61% of the words in our datasets were found in the lexicon. However, updating
the lexicon with the contextual sentiment of words resulted in 33.82% of these words
flipping their sentiment orientation and 62.94% changing their sentiment strength while
keeping their prior sentiment orientation. Only 3.24% of the words in Thelwall-Lexicon
remained untouched. Moreover, 21.37% of words previously unseen in the lexicon were
assigned with contextual sentiment by our approach and added to Thelwall-Lexicon
subsequently.</p>
      <p>Table 4 shows the average results of binary sentiment classification performed on
our datasets using (i) the original Thelwall-Lexicon (Original), (ii) Thelwall-Lexicon
induced under the update setting (Updated), and (iii) Thelwall-Lexicon induced under
the update+expand setting.5 The table reports the results in accuracy and three sets of
precision (P), recall (R), and F-measure (F1), one for positive sentiment detection, one
for negative, and one for the average of the two.</p>
      <p>From these results in Table 4, we notice that the best classification performance in
accuracy and F1 is obtained on the STS-Gold dataset regardless the lexicon being used.
We also observe that the negative sentiment detection performance is always higher than
the positive detection performance for all datasets and lexicons.</p>
      <p>As for different lexicons, we notice that on OMD and STS-Gold the adapted lexicons
outperform the original lexicon in both accuracy and F-measure. For example, on OMD
the adapted lexicon shows an average improvement of 2.46% and 4.51% in accuracy and
F1 respectively over the original lexicon. On STS-Gold the performance improvement is
5 Note that in this work we do not report the results obtained under the expand setting since no
improvement was observed comparing to the other two settings.
We demonstrated the value of using contextual semantics of words for adapting
sentiment lexicons from tweets. Specifically, we used Thelwall-Lexicon as a case study and
evaluated its adaptation to three datasets of different sizes. Although the potential is
palpable, our results were not conclusive, where a performance drop was observed in the
HCR dataset using our adapted lexicons. Our initial observations suggest that the quality
of our approach might be dependent on the sentiment class distribution in the dataset.
Therefore, a deeper investigation in this direction is required.</p>
      <p>We used the SentiCircle approach to extract the contextual semantics of words from
tweets. In future work we will try other contextual semantic approaches and study how
the semantic extraction quality affects the adaptation performance.</p>
      <p>Our adaptation rules in this paper are specific to Thelwall-Lexicon. These rules,
however, can be generalized to other lexicons, which constitutes another future direction
of this work.</p>
      <p>All words which have contextual sentiment were used for adaptation. Nevertheless,
the results conveyed that the prior sentiments in the lexicon might need to be unchanged
for words of specific syntactical or linguistic properties in tweets. Part of our future work
is to detect and filter those words that are more likely to have stable sentiment regardless
the contexts in which they appear.
6</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>In this paper we proposed an unsupervised approach for sentiment lexicon
adaptation from Twitter data. Our approach extracts the contextual semantics of words and
uses them to update the words’ prior sentiment orientations and/or strength in a given
sentiment lexicon. The evaluation was done on Thelwall-Lexicon using three Twitter
datasets. Results showed that lexicons adapted by our approach improved the sentiment
classification performance in both accuracy and F1 in two out of three datasets.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgment</title>
      <p>This work was supported by the EU-FP7 project SENSE4US (grant no. 611242).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Baccianella</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Esuli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sebastiani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining</article-title>
          .
          <source>In: Seventh conference on International Language Resources and Evaluation</source>
          , Malta. Retrieved May. Valletta,
          <string-name>
            <surname>Malta</surname>
          </string-name>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bollen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zeng</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Twitter mood predicts the stock market</article-title>
          .
          <source>Journal of Computational Science</source>
          <volume>2</volume>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cambria</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <article-title>An introduction to concept-level sentiment analysis</article-title>
          .
          <source>In: Advances in Soft Computing and Its Applications</source>
          , pp.
          <fpage>478</fpage>
          -
          <lpage>483</lpage>
          . Springer (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Diakopoulos</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shamma</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Characterizing debate performance via aggregated twitter sentiment</article-title>
          .
          <source>In: Proc. 28th Int</source>
          .
          <article-title>Conf. on Human factors in computing systems</article-title>
          .
          <source>ACM</source>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Esuli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sebastiani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Determining term subjectivity and term orientation for opinion mining</article-title>
          .
          <source>In: EACL</source>
          . vol.
          <volume>6</volume>
          , p.
          <year>2006</year>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Liu, H.:
          <article-title>Unsupervised sentiment analysis with emotional signals</article-title>
          .
          <source>In: Proceedings of the 22nd World Wide Web conf</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Kouloumpis</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , Wilson,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Moore</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          :
          <article-title>Twitter sentiment analysis: The good the bad and the omg!</article-title>
          <source>In: Proceedings of the ICWSM. Barcelona</source>
          ,
          <string-name>
            <surname>Spain</surname>
          </string-name>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Nielsen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <article-title>A˚.: A new anew: Evaluation of a word list for sentiment analysis in microblogs</article-title>
          .
          <source>arXiv preprint arXiv:1103.2903</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Saif</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fernandez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alani</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>Evaluation datasets for twitter sentiment analysis a survey and a new dataset, the sts-gold</article-title>
          .
          <source>In: Proceedings, 1st ESSEM Workshop</source>
          . Turin,
          <string-name>
            <surname>Italy</surname>
          </string-name>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Saif</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fernandez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alani</surname>
          </string-name>
          , H.:
          <article-title>Senticircles for contextual and conceptual semantic sentiment analysis of twitter</article-title>
          .
          <source>In: Proc. 11th Extended Semantic Web Conf. (ESWC)</source>
          . Crete,
          <string-name>
            <surname>Greece</surname>
          </string-name>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Saif</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alani</surname>
          </string-name>
          , H.:
          <article-title>Semantic sentiment analysis of twitter</article-title>
          .
          <source>In: Proc. 11th Int. Semantic Web Conf. (ISWC)</source>
          . Boston, MA (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Speriosu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sudan</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Upadhyay</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baldridge</surname>
          </string-name>
          , J.:
          <article-title>Twitter polarity classification with label propagation over lexical links and the follower graph</article-title>
          .
          <source>In: Proceedings of the EMNLP First workshop on Unsupervised Learning in NLP. Edinburgh</source>
          ,
          <string-name>
            <surname>Scotland</surname>
          </string-name>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Suttles</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ide</surname>
          </string-name>
          , N.:
          <article-title>Distant supervision for emotion classification with discrete binary values</article-title>
          .
          <source>In: Computational Linguistics and Intelligent Text Processing</source>
          , pp.
          <fpage>121</fpage>
          -
          <lpage>136</lpage>
          . Springer (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Takamura</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Inui</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Okumura</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Extracting semantic orientations of words using spin model</article-title>
          .
          <source>In: Proc. 43rd Annual Meeting on Association for Computational Linguistics</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Thelwall</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buckley</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paltoglou</surname>
          </string-name>
          , G.:
          <article-title>Sentiment strength detection for the social web</article-title>
          .
          <source>J. American Society for Information Science and Technology</source>
          <volume>63</volume>
          (
          <issue>1</issue>
          ),
          <fpage>163</fpage>
          -
          <lpage>173</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Thelwall</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buckley</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paltoglou</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cai</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kappas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Sentiment strength detection in short informal text</article-title>
          .
          <source>J. American Society for Info. Science and Technology</source>
          <volume>61</volume>
          (
          <issue>12</issue>
          ) (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Turney</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Littman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Measuring praise and criticism: Inference of semantic orientation from association</article-title>
          .
          <source>ACM Transactions on Information Systems</source>
          <volume>21</volume>
          ,
          <fpage>315</fpage>
          -
          <lpage>346</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Turney</surname>
            ,
            <given-names>P.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pantel</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , et al.:
          <article-title>From frequency to meaning: Vector space models of semantics</article-title>
          .
          <source>Journal of artificial intelligence research 37(1)</source>
          ,
          <fpage>141</fpage>
          -
          <lpage>188</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Velikovich</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blair-Goldensohn</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hannan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McDonald</surname>
            ,
            <given-names>R.:</given-names>
          </string-name>
          <article-title>The viability of web-derived polarity lexicons</article-title>
          .
          <source>In: Human Language Technologies: ACL</source>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20. Wilson,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Wiebe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Hoffmann</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          :
          <article-title>Recognizing contextual polarity in phrase-level sentiment analysis</article-title>
          .
          <source>In: Proc. Empirical Methods in NLP Conf. (EMNLP)</source>
          . Vancouver, Canada (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peng</surname>
          </string-name>
          , Q., Cheng, Y.:
          <article-title>Identifying the semantic orientation of terms using s-hal for sentiment analysis</article-title>
          .
          <source>Knowledge-Based Systems 35</source>
          ,
          <fpage>279</fpage>
          -
          <lpage>289</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>