<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Pragma-Semantic Analysis of the Emotion/Sentiment Relation in Debates</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Valerio Basile</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elena Cabrio</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Serena Villata</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Claude Frasson</string-name>
          <email>frasson@iro.umontreal.ca</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fabien Gandon</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universite Co</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Montreal</institution>
          ,
          <country country="CA">Canada</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>te d'Azur</institution>
          ,
          <addr-line>CNRS, Inria, I3S</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In the last years, emotions recognition tools have become more and more popular, aiming at detecting the emotions of human actors while performing di erent intelligent tasks by means of headsets and facial emotions detection tools. In addition to this kind of technology, when participants interact with each others by means of textual exchanges, sentiment analysis techniques, from the natural language processing research area, are exploited to detect the polarity of the exchanged messages. Investigating how these two connected components interacts and can support each other towards a better emotions and sentiment detection is a relevant but unexplored research challenge. In this paper, we start from a dataset of debate interactions annotated with the emotions of the involved participants, captured by means of EEG headsets and a facial emotions recognition tool, and the argumentative structures of the debates, and we compare this information to the polarity of the proposed textual arguments, retrieved through a sentiment analysis algorithm. A pragma-semantic analysis of the obtained results is provided, along with a discussion of the potential future work.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Analyzing and detecting the emotions felt by people engaged in a debate is
becoming more and more important in Arti cial Intelligence. Reasoning techniques
such as argumentation theory [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], based on rational postulates and critical
thinking, have started to be connected with personal and emotional information, like
for instance in [
        <xref ref-type="bibr" rid="ref2 ref4 ref6">6, 2, 4</xref>
        ]. The idea is that, to have an overall view of a debate,
several components have to be considered at the same time, i.e.,
argumentation, emotions, sentiment, and they in uence each other in a mutual way. In
this paper, we start from a dataset of textual arguments annotated with the
emotions felt by the participants of an experimental session through an Emotiv
EPOC EEG headset and the facial expression real-time frame-by-frame analysis
software FaceReader [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], and we apply sentiment analysis techniques to analyse
the natural language textual arguments proposed in the debates. More precisely,
we address a pragma-semantic analysis of the the obtained mismatches, e.g.,
captured emotion happy and polarity of the argument negative, and we discuss
the preliminary results of this study. The analysis we propose has the aim to
challenge current sentiment analysis techniques over a gold standard of textual
argument whose associated real emotion is annotated.
      </p>
      <p>Up to our knowledge, this is the rst time such a kind of analysis is proposed,
while it is important to start considering the interplay of these three components,
i.e., argumentation, emotions and sentiment, that are indispensable to model
cognitive agents.
2</p>
    </sec>
    <sec id="sec-2">
      <title>The Pragma-Semantic Analysis</title>
      <p>In this section, we rst describe the dataset of textual arguments we use to
address our pragma-semantic analysis (Section 2.1), and second, we provide some
insights about sentiment analysis techniques (Section 2.2). Finally, we discuss
the results of our ongoing pragma-semantic analysis (Section 2.3).
2.1</p>
      <sec id="sec-2-1">
        <title>Dataset</title>
        <p>
          In [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], we presented an open dataset to compare and analyze emotion detection
in an argumentation session. More precisely, the goal of our empirical
analysis was to study the link between the argumentation people address when they
debate with each other, and the emotions they feel during these debates. We
conducted an experiment aimed at verifying our hypotheses about the
correlation between the positive/negative emotions emerging when positive/negative
relations among the arguments are put forward in the debate. For more details
about the participants and the results of this study, we refer the reader to [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
        <p>The dataset of debates consists of 10 debates carried out by 4 participants at
a time (20 total participants), excluding the moderator. The dataset is composed
of three main layers: (i) the basic annotation of the arguments proposed in each
debate (i.e. the annotation in xml of the debate ow downloaded from the debate
platform); (ii) the annotation of the relations of support and attack among the
arguments; and (iii) starting from the basic annotation of the arguments, the
annotation of each argument with the emotions felt by each participant involved
in the debate. Table 1 shows some statistics on the dataset1.</p>
        <p>An example, from the debate about the topic \Religion does more harm than
good" where arguments are annotated with emotions (i.e., the third layer of the
annotation of the textual arguments we retrieved), is as follows:
&lt;argument id="30" debate_id="4" participant="4"
time-from="20:43" time-to="20:43"
emotion_p1="neutral" emotion_p2="neutral"
1 The dataset of annotated arguments is available here: http://project.inria.fr/
seempad/datasets/
emotion_p3="neutral" emotion_p4="neutral"&gt;</p>
        <p>Indeed but there exist some advocates of the devil
like Bernard Levi who is decomposing arabic countries.
&lt;/argument&gt;
&lt;argument id="31" debate_id="4" participant="1"
time-from="20:43" time-to="20:43"
emotion_p1="angry" emotion_p2="neutral"
emotion_p3="angry" emotion_p4="disgusted"&gt;</p>
        <p>I don't totally agree with you Participant2: science
and religion don't explain each other, they tend to
explain the world but in two different ways.
&lt;/argument&gt;
&lt;argument id="32" debate_id="4" participant="3"
time-from="20:44" time-to="20:44"
emotion_p1="angry" emotion_p2="happy"
emotion_p3="surprised" emotion_p4="angry"&gt;</p>
        <p>Participant4: for recent wars ok but what about wars
happened 3 or 4 centuries ago?
&lt;/argument&gt;</p>
        <p>Figure 1 shows one of the visualizations of the debates that we used to
explore the data set (number 9, \Fear government power over Internet"). The
participants are color-coded (gray always indicates the moderator). The number
inside the nodes represents the identi ers of the arguments, in chronological
order. A single directed edge indicates a support relation, where the tail node
supports the head node, while a double-lined edge indicates an attack. The shape
of the argument nodes shows the associated sentiment (see Section 2.2).
The goal of sentiment analysis (sometimes referred to as opinion mining ) is
detecting the polarity, whether positive, neutral, or negative, of the attitude
contained in a natural language utterance. A typical rst step is to determine
whether a statement is objective or subjective, and then only in the latter case
one can proceed to identify its polarity. However, often only the second task is
performed, thus collapsing objective statements and a neutral attitude.</p>
        <p>
          The last years have seen an enormous increase in research on developing
sentiment analysis systems of various sorts that employ several natural language
processing techniques. Solutions range from simple lookups in polarity or a
ection resources, i.e., databases where a polarity score is associated to terms, to
more sophisticated models built through supervised, unsupervised, and distant
learning involving various sets of features [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>
          Several approaches are found in literature for polarity detection. The
simplest route is detecting the speci c words which are known to express a positive,
negative or neutral feeling. For example, [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] use a lexicon projection strategy
yielding predictions which signi cantly correlate with polls. Deeper linguistic
analysis has been proven to improve the performance of sentiment analysis
systems [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. However, accurate processing can be hard on texts such as social media
messages and text chats, which are short, rich in abbreviations and often contain
syntactic or spelling mistakes.
        </p>
        <p>In order to extract the sentiment from the text of the debates, we submit
every argument separately to Alchemy API2, a free service provided by IBM via
2 http://www.alchemyapi.com/api/sentiment-analysis
an online HTTP REST Web service. For any given text, Alchemy API returns
a sentiment label, either neutral, negative or positive, and a sentiment score
ranging from -1 (totally negative sentiment) to 1 (totally positive sentiment).</p>
        <p>We correlate the sentiment of the arguments with the emotion of the
participants who wrote them. Moreover, we only consider the primary emotion.</p>
        <p>The confusion matrix in Table 2 shows how the sentiment compares to the
emotions detected by the FaceReader. It is clear that, while the FaceReader is
quite conservative in assigning non-neutral emotions, the output of Alchemy API
reveals highly polarized sentiment, with 212 arguments categorized as negative,
140 as positive, and a relative minority of 104 as neutral. The emotion happy
is observed in the dataset only twice as primary emotion, and in neither case is
the emotion detected for the proponent of the argument.</p>
        <p>In order to study the correlation of sentiment and emotions, we proceed
to manually map the polarity of emotions and sentiment, following this simple
scheme:
{ angry, disgusted, scared ! negative
{ happy, surprised ! positive
{ neutral ! neutral
This mapping results in the confusion matrix shown in Table 3. The sentiment
extracted by Alchemy API matches the polarity of the emotion in 116 cases
(37 negative, 1 positive, 78 neutral, about 25% of all arguments). Out of all
the cases of mismatch, in 30 cases the polarities are inverted: in 27 cases a
positive sentiment is associated to a negative emotion, and in 3 cases the opposite
happens.</p>
        <p>We believe that the cases where a full mismatch is observed are the most
interesting to explore the relation between the sentiment found in the text, and
the emotions felt by the participants in a debate. Therefore, we inspected them
one by one and report our ndings in the next section.
2.3</p>
      </sec>
      <sec id="sec-2-2">
        <title>Discussion</title>
        <p>In the three cases where the sentiment is negative, the emotion is \surprised".
The arguments seem to be genuinely of a negative nature (\Racial pro ling is
a prototype which is unacceptable I think. "), therefore the mismatch could be
the result of either a misclassi cation of the emotion by the FaceReader or the
nave mapping between sentiment polarity and emotions shown in Section 2.2,
where surprise is classi ed as a positive emotion.</p>
        <p>In eight cases, the polarity of the argument has been wrongly predicted. Given
the statistical nature of the majority of state-of-the-art language analysis tools,
including the software for sentiment analysis, a certain rate of errors is always
to be expected. Interestingly, here most of the errors are due to the word choice,
in particular to the use of certain nouns that are typically associated to positive
sentiment. Examples of this phenomenon include, for instance: \Do we know
what would be a good way to make someone not a bully? i.e. to teach \respect"?".
The word \respect" in particular is associated with a positive polarity by the
system. While the sentiment score of the original message is 0:33, replacing the
word \respect" with a neutral word like \math" gives a totally neutral message,
according to Alchemy API. Another word that seem to confuse the automatic
classi cation of sentiment is \thanks", in phrases such as \thanks to ...".</p>
        <p>In nine cases, the argument is a reply (possibly an attack or a support) to an
argument proposed by another participant. Since we fed isolated messages to the
sentiment analysis component, it is natural that the analysis of such cases will
not be accurate, since the system is missing important contextual information.</p>
        <p>Two of the mismatching arguments are phrased in a quite convoluted way
that contributes to confusing the classi er. One of such examples recites: \Of
course, from university you can learn a lot of stu , have better degree, but don't
think that such degree will be helpful to get a better job later." Note that the
pattern \of course X, but Y" is di cult to interpret by automatic language
analysis without resorting to some logical interpretation of the text structure.</p>
        <p>Finally, in three cases, the sentiment seems to be genuinely positive. In two
of them, the corresponding emotion is \scared", which is seldom observed across
the entire dataset. Since there is no element of fear in the text of these arguments,
we tend to attribute these mismatches to noise in the original data.</p>
        <p>The six remaining examples include a mix of the aforementioned phenomena
or their features are too sparse to draw proper conclusions. It is worth noticing
that one argument is interestingly of ironic nature (\RFID ALL THE
PEOPLE!"), a case where the positive polarity of the literal meaning of the text is
correctly associated with a negative emotion.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conclusions</title>
      <p>In this paper, we have presented some preliminary results of a pragma-semantic
analysis over a dataset of textual arguments from human debates annotated with
their emotions, and on which we have then applied sentiment analysis techniques.
More precisely, we have studied the cases where a mismatch holds between the
sentiment captured from the textual arguments through sentiment analysis, and
the emotion(s) detected from the participants proposing such arguments in the
debate. Some patterns emerge from this analysis. However, our intuition is that
adding also the argumentative information into the loop, i.e., considering not
only the detected emotions but also the attack and support relations among
the arguments, would return useful information to enrich such patterns. This
is our main direction for future work, i.e., to study the interplay of
argumentation, sentiment analysis and emotions in debates, in order to detect patterns
of information from the argumentation and emotion components to improve the
performance of sentiment analysis techniques, and enrich their results.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Benlamine</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chaouachi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villata</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cabrio</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frasson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gandon</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Emotions in argumentation: an empirical evaluation</article-title>
          .
          <source>In: Proceedings of the TwentyFourth International Joint Conference on Arti cial Intelligence</source>
          ,
          <string-name>
            <surname>IJCAI</surname>
          </string-name>
          <year>2015</year>
          . pp.
          <volume>156</volume>
          {
          <issue>163</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Grosse</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          , Chesn~evar,
          <string-name>
            <given-names>C.I.</given-names>
            ,
            <surname>Maguitman</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.G.</surname>
          </string-name>
          :
          <article-title>Integrating argumentation and sentiment analysis for mining opinions from twitter</article-title>
          .
          <source>AI Commun</source>
          .
          <volume>28</volume>
          (
          <issue>3</issue>
          ),
          <volume>387</volume>
          {
          <fpage>401</fpage>
          (
          <year>2015</year>
          ), http://dx.doi.org/10.3233/AIC-140627
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Sentiment Analysis</article-title>
          and
          <string-name>
            <given-names>Opinion</given-names>
            <surname>Mining</surname>
          </string-name>
          .
          <source>Synthesis Lectures on Human Language Technologies</source>
          , Morgan &amp; Claypool Publishers (
          <year>2012</year>
          ), http://dx.doi. org/10.2200/S00416ED1V01Y201204HLT016
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Medellin</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reed</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanson</surname>
          </string-name>
          , V.L.:
          <article-title>Spoken interaction with broadcast debates</article-title>
          .
          <source>In: Computational Models of Argument - Proceedings of COMMA 2014</source>
          . pp.
          <volume>51</volume>
          {
          <issue>58</issue>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>O</given-names>
            <surname>'Connor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Balasubramanyan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Routledge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.R.</given-names>
            ,
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <surname>N.A.</surname>
          </string-name>
          :
          <article-title>From tweets to polls: Linking text sentiment to public opinion time series</article-title>
          . In: Cohen,
          <string-name>
            <given-names>W.W.</given-names>
            ,
            <surname>Gosling</surname>
          </string-name>
          , S. (eds.) ICWSM. The AAAI Press (
          <year>2010</year>
          ), http://dblp.uni-trier.de/ db/conf/icwsm/icwsm2010.html#OConnorBRS10
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Pak</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paroubek</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Twitter as a corpus for sentiment analysis and opinion mining</article-title>
          . In: Calzolari,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Odijk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Piperidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Rosner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Tapias</surname>
          </string-name>
          ,
          <string-name>
            <surname>D</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of the International Conference on Language Resources and Evaluation</source>
          ,
          <string-name>
            <surname>LREC</surname>
          </string-name>
          <year>2010</year>
          ,
          <volume>17</volume>
          -
          <fpage>23</fpage>
          May
          <year>2010</year>
          , Valletta, Malta.
          <source>European Language Resources Association</source>
          (
          <year>2010</year>
          ), http://www.lrec-conf.org/ proceedings/lrec2010/summaries/385.html
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Pang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Opinion mining and sentiment analysis</article-title>
          .
          <source>Found. Trends Inf. Retr</source>
          .
          <volume>2</volume>
          (
          <issue>1-2</issue>
          ),
          <volume>1</volume>
          {
          <fpage>135</fpage>
          (Jan
          <year>2008</year>
          ), http://dx.doi.org/10.1561/1500000011
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Rahwan</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simari</surname>
            ,
            <given-names>G.R.</given-names>
          </string-name>
          :
          <source>Argumentation in Arti cial Intelligence</source>
          . Springer Publishing Company, Incorporated, 1st edn. (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>