<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Automatic Emotional Text Annotation Using Facial Expression Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alex Mircoli Supervisors: Alessandro Cucchiarelli</string-name>
          <email>a.mircoli@pm.univpm.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Claudia Diamantini</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Domenico Potena</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information Engineering Università Politecnica delle Marche</institution>
          ,
          <addr-line>Ancona</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <fpage>188</fpage>
      <lpage>196</lpage>
      <abstract>
        <p>Sentiment analysis calls for large amounts of annotated data, which are usually in short supply and require great efforts in terms of manual annotation. Furthermore, the analysis is often limited to text polarity and writer's emotions are ignored, even if they provide valuable information about writer's feelings that might support a large number of applications, such as open innovation processes. Our research is hence aimed at developing a methodology for the automatic annotation of texts with regard to their emotional aspects, exploiting the correlation between speech and facial expressions in videos. In the present work we describe the main ideas of our proposal, presenting a four-phase methodology and discussing the main issues related to the selection of input frames and the processing of emotions resulting from facial expressions analysis.</p>
      </abstract>
      <kwd-group>
        <kwd>Sentiment Analysis</kwd>
        <kwd>Automated text annotation</kwd>
        <kwd>Facial expression analysis</kwd>
        <kwd>Emotion detection</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>In recent years, the rapid growth of social networks, personal blogs and review
sites has made available an enormous amount of user-generated content. Such data
often contain people’s opinions and emotions about a certain topic; that information is
considered authentic, as in the above contexts people usually feel free to express their
thoughts. Therefore, the analysis of this user-generated content provides valuable
information on how a certain topic or product is perceived by users, allowing firms to
address typical marketing problems as, for instance, the evaluation of customer
satisfaction or the measurement of the appreciation of a new marketing campaign.
Moreover, the analysis of customers' opinions about a certain product helps business owners
to find out possible issues and may suggest new interesting features, thus representing
a valid tool for open innovation.</p>
      <p>
        For this reason, in the last years many researchers (e.g., [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]) focused on
techniques for the automatic analysis of writer’s opinion, generally referred to as
sentiment analysis [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In its most common meaning, the term refers to the analysis of the
text polarity, that is the evaluation of positiveness or negativeness of the author’s
view towards a particular entity. Due to the intrinsic complexity of the human
language, this task offers several challenges, some of which, namely the automatic
detection of the scope of negation [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and the disambiguation of polysemous words [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ],
have been addressed in our previous work.
      </p>
      <p>
        A more general definition of sentiment analysis also involves the computational
treatment of the emotional aspects of text [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In this paper we adopt this broader
definition and, in particular, we focus on the detection of feelings and emotions in
texts.
      </p>
      <p>
        Sentiment analysis is generally performed using two different approaches, that
respectively rely on annotated lexical resources (lexicon-based techniques) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and deep
neural networks (learning-based techniques) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Learning-based techniques usually
reach high accuracy but they need considerable amounts of annotated training data.
Moreover, they are very domain-specific, so the creation of a new annotated dataset is
required whenever the model needs to be retrained for a different domain. On the
other hand, the lexicon-based approach requires the availability of large corpora. At
the moment, few open access corpora exist (e.g., SentiWordNet [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]) and they are only
available for English. Furthermore, emotions are considered by few corpora (e.g.,
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]) and typically for small sets of words. In conclusion, both approaches rely on the
existence of large amounts of annotated data, which require great efforts in terms of
manual annotation. Therefore, there is a need for techniques to automatically (and
objectively) annotate emotions in text, in order to dynamically create language- and
domain-specific corpora for sentiment analysis.
      </p>
      <p>To this purpose, our research aims at developing a novel methodology for the
automatic creation of emotionally annotated corpora through the analysis of speakers’
facial expressions in subtitled videos. These corpora will enable the analysis of the
emotional aspects of user-generated content, in order to provide valuable insights
about, for instance, consumer perception or voters’ opinion. We start from the
following research questions (RQ):
• RQ1: What is the state of the art in sentiment analysis?
• RQ2: How can we automatically annotate a corpus w.r.t emotions
expressed in text?
• RQ3: How can the resulting emotional annotation be evaluated?
• RQ4: How can we exploit our system to enhance traditional Business
Intelligence applications?</p>
      <p>
        Due to the popularity of video-sharing platforms, such as YouTube, a multitude of
subtitled videos has been made publicly available: as a consequence, an automatic
annotation technique allows for the analysis of large amounts of text data. The choice
of analyzing facial expressions is driven by the consideration that they are
unambiguous non-verbal clues that can be exploited to precisely estimate the speaker’s
emotional state. Furthermore, in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] is empirically verified that there is a strong
correlation between facial expressions and speech, so that spontaneous speakers are expected
to exhibit emotions consistently with their speech.
      </p>
      <p>
        In this work, we model human emotions according to Ekman’s theory [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], that
states the existence of six archetypal emotions (i.e., anger, disgust, fear, happiness,
sadness, surprise). Text annotation by means of the analysis of these basic facial
expressions shows several advantages: (i) every emotion can be viewed as a
combination of the six basic emotions and hence annotating with respect to basic emotions
allows for the representation of every human emotion; (ii) the technique is
languageand domain-independent, so emotional annotations can be carried out for every
language and scope of application, and finally (iii) facial expressions are demonstrated to
be universal [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], that is unrelated to speaker’s personal characteristics (e.g., sex,
ethnicity).
      </p>
      <p>The rest of the paper is structured as follows: next Section is devoted to present some
relevant related works on sentiment analysis and automatic annotation of corpora. The
methodology for the automatic text annotation is proposed in Section 3, along with
the description of the current status of the research. Finally, Section 4 draws
conclusions and discusses future directions of research.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        In recent literature, several different approaches have been proposed for the
analysis of writer’s opinions. Socher et al. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] present a recursive neural tensor network
that reaches a state-of-the-art classification accuracy when trained on annotated parse
trees. The main disadvantage of the technique is that it requires a huge amount of
training data: more than 600,000 human annotations were needed to train the original
model and training the network on a different domain would require similar efforts.
Go et al. [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] propose a solution based on distant supervision, in which training data
consists of tweets with emoticons, that reaches an accuracy similar to [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] without
requiring manual annotations. However, both approaches only analyze the text
polarity, without considering the emotional content of text.
      </p>
      <p>
        The analysis of writer’s emotions is performed in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], where authors consider
emotion-word hashtags as labels for emotions in tweets. Nevertheless, their approach
differs from ours since they are more focused on the analysis of personality traits and
they do not consider para-verbal indicators. To the best of our knowledge, the analysis
of para- and non-verbal communication, such as facial expressions, for the purpose of
automatic emotional text annotation has received no attention in recent literature:
although Busso et al. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] propose to analyze facial expressions and speech tone in
order to detect emotions, they do not exploit this information to build annotated
corpora.
      </p>
      <p>In this Section we describe the methodology for the automatic emotional text
annotation based on the analysis of speakers’ facial expressions in videos. As depicted in
Fig.1, the proposed methodology consists of four steps: first, a data source is selected,
with particular attention to some issues that could preclude the feasibility and/or the
accuracy of emotion detection. Afterwards, in the second step frames are extracted
and analyzed using a face recognition software, in order to filter out scenes with zero
or multiple faces. The facial expressions of people appearing in the remaining frames
are then analyzed (step three) and the resulting emotion vectors are assigned to the
corresponding subtitle chunk. Finally, in step four the emotion associated to each
subtitle chunk is computed starting from the emotion vectors.</p>
      <p>An example of this procedure is depicted in Fig.2, where frames belonging to the
subtitle chunk “I am anguished and tormented” (Fig. 2(a)) are processed by a facial
expression analyzer, whose output is a set of emotion vectors, that are plotted in
Fig.2(b). In the line chart, the sadness expression (blue line, circular marker)
dominates the central part of the graph (frames 3-7), while disgust expressions (green line,
x-shaped marker) appear on the final frames.</p>
      <p>A more detailed discussion of each methodology step is presented in the following
subsections, along with the description of the current state of the research.</p>
      <p>The first step involves the selection of the data source. This is a crucial phase, as
the quality of the final annotated corpus strongly depends on the selection of a
suitable set of input videos. Even if emotions are expressed through universally shared
patterns, there are several factors that can impact on speaker’s spontaneity and
expressiveness and that must be considered in selecting the video categories to be
analyzed. A list of issues that we faced in our preliminary video scouting activity, and
can potentially impact on the quality of text annotation, includes:
• lack of expressiveness: in some video typologies, such as news reports or product
reviews, speakers are required to maintain an expressionless face, in order to give
objectivity to their speech. As a consequence, the analysis of their facial
expressions can be misleading, as emotion-bearing words could be annotated as
neutral.
• interpretation: in case of movies or theatrical monologues, some actors are
required to play characters having a specific personality. In such circumstances,
facial expressions are altered by acting: a criminal, for instance, might have a scary
face even when talking about happy things.
• reported speech: when people report what another person has said, facial
expressions reflect their personal feelings and hence detected emotions can be in contrast
with the original meaning of the sentences.
• external factors: external factors impacting on speaker’s mood can affect the
correlation between speech and facial expressions. For instance, an eyewitness
interviewed immediately after a plane crash would probably show fear expressions,
regardless of the specific words he is pronouncing.
• subtitles quality: video and subtitles might not be in synchronization, thus facial
expressions could not correspond to subtitle text, or subtitles may not be accurate,
as they have been automatically generated.</p>
      <p>The above-listed issues are noise factors that cannot be totally avoided but the
selection of a proper data source can effectively impact on their presence in analyzed
videos. In this preliminary phase we limited to explore several different data sources
and manually select those that we considered more suitable to the purpose of our
analysis.</p>
      <p>Nevertheless, some of these problems can be automatically detected in videos: for
example, interviews and news reports can often be identified through the analysis of
the video title, while in some video sharing sites (e.g., YouTube) there are dedicated
APIs to find out if subtitles have been automatically generated.
3.2</p>
      <sec id="sec-2-1">
        <title>Video Pre-processing</title>
        <p>After selecting a proper data source, each video is subject to a preprocessing phase,
whose output is the set of frames  to be analyzed. From a computational perspective,
a desirable property for  is that  ≪ ||, where  is the entire set of video frames,
since the analysis of facial expressions is a computationally-intensive task, that can
take up to five seconds for each frame on state-of-the-art facial analysis tools.
Considering that typical frame rates are around 30 fps, the analysis of every frame of a
5minute video may require up to 150 minutes to be performed, making infeasible
large-scale annotations. Anyway, many frames may be discarded without significant
loss of information: for instance, since the purpose of the methodology is the
annotation of subtitles on the basis of the concomitant speaker’s expressions, frames not
related to any subtitle chunk may also be discarded. Apart from this preliminary
operation, video pre-processing consists of sampling and filtering.</p>
        <p>Sampling. The choice of the sampling rate (), that is the number of fps to be
extracted, has implications on both speed and accuracy. A high value for the parameter
 implies a higher number of frames to be analyzed, with a consequent increase in
the execution time of the facial expressions analysis. Moreover, a speaker is expected
to exhibit almost identical expressions in a block of consecutive frames, then the
analysis of the entire block is redundant.</p>
        <p>On the other hand, by choosing a small value for  (e.g.,  &lt; 1) there is the risk to
extract many irrelevant frames, such as those where there are transitions from an
expression to another. Furthermore, facial expressions are somewhat dependent on the
concomitant phonatory movements. For instance, the pronunciation of the vowel [ɑ]
requires speakers to widely open their mouth, that could be interpreted as a surprise
expression. As a consequence, the analysis of a too small amount of frames is
errorprone, especially in presence of speakers with a great articulation of open vowels.
We performed some preliminary experiments in order to find a value for  that
would balance speed and accuracy. We found that 2 ≤  ≤ 4 offers a classification
accuracy comparable to the analysis of every frame, while reducing of approximately
one order of magnitude the whole execution time.</p>
        <p>
          Filtering. Some sampled frames should be discarded as they do not contain useful
information:
• frames with zero or multiple faces, as well as frames containing speakers not
facing the camera, cannot be analyzed due to the lack (or the excess, with the
consequent problem of identifying speaker’s face) of correctly recognizable faces.
• in case of two temporally close subtitle chunks (e.g., when they both belong to the
same sentence) ! and !!!, the speaker’s facial expressions in the first frames of
!!! could be related to !, because emotions may remain on speaker’s face up to
22.5 seconds after the end of the related sentence [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. Therefore, if the ending time
! !  of ! is close or coincident to the starting time !!!! of !!!, the first  frames
related to !!! may be excluded from , where the parameter  has to be chosen
empirically.
        </p>
        <p>The filtering step can be automated through a face analysis tool: in particular, in our
system we use Microsoft Face API1, that provides information about the number and
the pose (i.e., facing/not facing camera) of faces in an image.
3.3</p>
      </sec>
      <sec id="sec-2-2">
        <title>Facial Expression Analysis</title>
        <p>As a result of the video pre-processing phase, each ! is associated with a set of
frames ! ⊆ , where ! = { !!! ∙!! ,  !!! ∙!! !!, … ,  !!! ∙!! }; the symbol · denotes
the  · operator (alternatively, the (·) operator may be used). In this step,
for each frame ! ∈ ! we analyze the speaker’s facial expression with respect to
Ekman’s theory of six basic emotions (i.e., anger, disgust, fear, happiness, sadness,
surprise) and we obtain the emotion vector ! , defined as
1 https://www.microsoft.com/cognitive-services/en-us/emotion-api
! = [! !! ! !! ! !! ! !! ! !! ! !! ]!, where  is the emotion matrix of the video and
! ! represents the value of the  emotion in !. At the end of the analysis, each
subtitle chunk ! is associated with the emotion matrix !! =  !!! ∙!! : !!! ∙!! , where !:!
denotes the ( −  + 1) columns of  starting from the -th column.</p>
        <p>
          At the moment, we perform the facial expression analysis through the free version
of Microsoft Emotion API. The facial expression analysis is performed with respect to
eight classes: in addition to Ekman’s basic expressions, the software evaluates the
contempt and the neutral expressions. The value of the latter is calculated as the 1’s
complement of the sum of the other vector components, each of which has a value in
[
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ].
        </p>
        <p>
          In early experiments we found that the analysis of facial expressions of speaking
people is a challenging task, since the degree of mouth opening (that is considered a key
feature by the facial expression analyzer) of a speaker strongly depends on the
articulation of the speech. Although we limited our experiments to Microsoft Emotion API,
it is plausible that other tools may have the same issue, since many facial expression
analysis techniques in literature relies on this feature (e.g., [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]). We hypothesized
that mouths could be removed from images without compromising the analysis, as the
software may still rely on the position of eyes, forehead and eyebrows as emotion
markers. We tested our hypothesis on a small test set and we noticed a 20% increase
in classification accuracy when mouths were manually hidden before performing the
facial expression analysis.
3.4
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Data Post-processing</title>
        <p>The outputs of the previous step are the emotion matrix  and a set of annotations in
the form ! → !!. This level of annotation provides information about the
distribution of emotions in each frame, while we are more interested in a text-level
annotation. The final form of the annotated text strictly depends on the kind of application it
is intended for: training data for machine learning are usually in the form ! → !,
where the class ! ∈  corresponds to a basic emotion, while annotated corpora for
lexicon-based sentiment analysis may also be in the form ! → ê!, where ê! is a vector
containing an aggregate value for each emotion.</p>
        <p>In general, the first step of the post-processing phase is the definition of the
aggregation function :
that applies an aggregation operator (e.g., max, avg) to every row of !, outputting a
vector ê! having a single aggregate value for each emotion.</p>
        <p>To bind each subtitle chunk to a class label !, the transform function  is applied:
:  ! →   ê!
:  ê! → !
in order to evaluate the dominant emotion and assign the related class label to !. The
choice of  is non-trivial, as in some situations there is no correspondence between
the maximum value of ê! and the actual dominant emotion. For instance, in case of
noisy scenes, the facial expression analyzer would probably assign lower values to
emotions, as it is not able to detect enough emotion markers in speaker’s face. In this
scenario, the function  should discard the highest value in ê!, whenever it is related
to the neutral emotion and there is another emotion with a sufficiently high value.</p>
        <p>At the moment, in our implementation we set  = (·) and  = max  (·) but
other solutions are under investigation: for example, we are interested in performing
experiments using  = (·). In preliminary experiments we found that the
annotation accuracy of the system depends on the quality of input videos (e.g., expressive
speakers, synchronized subtitles). For some videos we reached a remarkable 66%
classification accuracy in 8-class emotional annotation. A major advantage of our
technique emerged during the analysis of our results: when a sentence is
automatically annotated starting from the sentiment of each word, it is not always
possible to correctly capture its overall sentiment, which instead emerges from
speaker’s expression. For instance, the phrase “throw myself in the Arno” is correctly
annotated as “sadness”, although in the phrase there are not words related to sadness.
In our preliminary analysis, we also noticed that some words frequently co-occur in
sentences related to a specific emotion, suggesting that there is a correlation between
these n-grams and the emotion. For this reason, we plan to perform a large-scale
analysis to further investigate this phenomenon.
4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Future Directions</title>
      <p>Our work is organized in terms of a three years Ph.D. research project. The first year
was dedicated to the definition of the methodology, in addition to an early
experimentation in order to validate the research idea. In the next two years, we plan to move
along two main directions: firstly, the study of the best  and  functions for data
post-processing, in particular to deal with noisy scenes, where emotion vectors show
wide variations even among frames containing similar facial expressions. An
appropriate choice of  and  will reduce misclassifications, thus allowing for a more
accurate detection of people’s emotions in texts.</p>
      <p>Secondly, the extension of experiments to larger datasets, in order to better evaluate
the classification accuracy and perform statistical analysis to find correlations
between specific couples &lt;n-gram, emotion&gt;, with the purpose of building an
emotionally annotated corpus for sentiment analysis.</p>
      <p>Moreover, we intend to further investigate the effectiveness of hiding mouths and
possibly automate this procedure.</p>
      <p>Finally, since the quality of the annotation strongly depends on the accuracy of the
facial expression analysis, we are interested in evaluating different face analysis tools,
in order to compare their performance and choose the best solution.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Chien</surname>
            ,
            <given-names>C. C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>You-De</surname>
          </string-name>
          , T.:
          <article-title>Quality evaluation of product reviews using an information quality framework</article-title>
          .
          <source>Decision Support Systems</source>
          <year>2011</year>
          , Vol.
          <volume>50</volume>
          , pp.
          <fpage>755</fpage>
          -
          <lpage>68</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Lai</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Recurrent Convolutional Neural Networks for Text Classification</article-title>
          . AAAI Vol.
          <volume>33</volume>
          , pp.
          <fpage>2267</fpage>
          -
          <lpage>2273</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Pang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts</article-title>
          .
          <source>In: Proceedings of the ACL</source>
          <year>2004</year>
          , pp.
          <fpage>271</fpage>
          -
          <lpage>278</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Diamantini</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mircoli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potena</surname>
            ,
            <given-names>D.:</given-names>
          </string-name>
          <article-title>A negation handling technique for sentiment analysis</article-title>
          .
          <source>In: Proceedings of the Collaboration Technologies and Systems (CTS)</source>
          <year>2016</year>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Diamantini</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mircoli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potena</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Storti</surname>
          </string-name>
          , E.:
          <article-title>Semantic disambiguation in a social information discovery system</article-title>
          .
          <source>In: Proceedings of the Collaboration Technologies and Systems (CTS)</source>
          <year>2015</year>
          , pp.
          <fpage>326</fpage>
          -
          <lpage>333</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Medhat</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hassan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Korashy</surname>
          </string-name>
          , H.:
          <article-title>Sentiment analysis algorithms and applications: a survey</article-title>
          .
          <source>Ain Shams Engineering Journal</source>
          , Volume
          <volume>5</volume>
          , Issue 4, pp.
          <fpage>1093</fpage>
          -
          <lpage>1113</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Qiu</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>DASA: dissatisfaction-oriented advertising based on sentiment analysis</article-title>
          .
          <source>Expert Systems with Applications</source>
          <year>2010</year>
          , Vol
          <volume>37</volume>
          , pp.
          <fpage>6182</fpage>
          -
          <lpage>91</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Pang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vaithyanathan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Thumbs up?: sentiment classification using machine learning techniques</article-title>
          .
          <source>In: Proc. of Empirical methods in natural language processing</source>
          , Volume
          <volume>10</volume>
          , pp.
          <fpage>79</fpage>
          -
          <lpage>86</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Baccianella</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Esuli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sebastiani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>SentiWordNet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining</article-title>
          .
          <source>In: Proc. of LREC. Genoa</source>
          , vol.
          <volume>10</volume>
          , pp.
          <fpage>2200</fpage>
          -
          <lpage>2204</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turney</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Crowdsourcing a Word-Emotion Association Lexicon</article-title>
          .
          <source>Computational Intelligence</source>
          ,
          <volume>29</volume>
          (
          <issue>3</issue>
          ), pp.
          <fpage>436</fpage>
          -
          <lpage>465</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Livingstone</surname>
            ,
            <given-names>S.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>W.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wanderley</surname>
            ,
            <given-names>M.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palmer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Common cues to emotion in the dynamic facial expressions of speech and song</article-title>
          .
          <source>The Quarterly Journal of Experimental Psychology</source>
          ,
          <volume>68</volume>
          :5, pp.
          <fpage>952</fpage>
          -
          <lpage>970</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Ekman</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>An argument for basic emotions</article-title>
          .
          <source>Cognition &amp; emotion, 6(3-4)</source>
          , pp.
          <fpage>169</fpage>
          -
          <lpage>200</lpage>
          (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Ekman</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sorenson</surname>
            ,
            <given-names>R.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friesen</surname>
            ,
            <given-names>W.V.</given-names>
          </string-name>
          :
          <article-title>Pan-cultural elements in facial displays of emotions</article-title>
          .
          <source>Science</source>
          ,
          <volume>164</volume>
          , pp.
          <fpage>86</fpage>
          -
          <lpage>88</lpage>
          (
          <year>1969</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Tamilselvi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , ParveenTaj.,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Sentiment Analysis of Micro blogs using Opinion Mining Classification Algorithm</article-title>
          .
          <source>International Journal of Science and Research (IJSR) 2.10</source>
          , pp.
          <fpage>196</fpage>
          -
          <lpage>202</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Gokulakrishnan</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Priyanthan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ragavan</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prasath</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perera</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Opinion mining and sentiment analysis on a twitter data stream</article-title>
          .
          <source>In: Proc. of advances in ICT for emerging regions</source>
          , pp.
          <fpage>182</fpage>
          -
          <lpage>188</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Socher</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perelygin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>J.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chuang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ng</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potts</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Recursive deep models for semantic compositionality over a sentiment treebank</article-title>
          .
          <source>In: Proceedings of the conference on empirical methods in natural language processing (EMNLP)</source>
          , Vol.
          <volume>1631</volume>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Go</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bhayani</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Twitter sentiment classification using distant supervision</article-title>
          .
          <source>CS224N Project Report</source>
          , Stanford (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiritchenko</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Using hashtags to capture fine emotion categories from tweets</article-title>
          .
          <source>Computational Intelligence</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Busso</surname>
          </string-name>
          ,
          <string-name>
            <surname>Carlos</surname>
          </string-name>
          , et al:
          <article-title>Analysis of emotion recognition using facial expressions, speech and multimodal information</article-title>
          .
          <source>In: Proceedings of the 6th international conference on Multimodal interfaces</source>
          .
          <source>ACM</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Tian</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kanade</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cohn</surname>
          </string-name>
          , J.:
          <article-title>Recognizing Lower Face Action Units for Facial Expression Analysis</article-title>
          .
          <source>In: Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition (FG'00)</source>
          , pp.
          <fpage>484</fpage>
          -
          <lpage>490</lpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>