<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Debate Outcome Prediction using Automatic Persuasiveness Evaluation and Counterargument Relations</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Daiki Shirafuji</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rafal Rzepka</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kenji Araki</string-name>
          <email>arakig@ist.hokudai.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Graduate School of Information Science and Technology, Hokkaido University</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>RIKEN Center for Advanced Intelligence Project</institution>
          ,
          <addr-line>AIP</addr-line>
        </aff>
      </contrib-group>
      <fpage>24</fpage>
      <lpage>29</lpage>
      <abstract>
        <p>Debates play an important educational role and proper argumentation has a power to change people's stance on a given topic. Existing NLP research on persuasiveness of argumentation calculates it for single arguments or ranks arguments by their level of conviction. Our work extends this research by considering counterarguments. They can weaken or strengthen persuasiveness of a given argument, hence we propose novel methods to calculate persuasiveness with opinions existing in opposite stances. We create a corpus from a site containing debates where users evaluate discussions and choose winning side of a debate, allowing the calculation of opinion change. We experimentally confirmed 60.12% accuracy in the proposed debate outcome prediction task proving that additional counterargument-related information is capable to improve baseline methods. Contact Author</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>In recent years, the area of argumentation mining has
become popular [Green et al., 2014; Al-Khatib et al., 2016;
Stab et al., 2018]. This topic spreads from extracting
argument components (e.g. claims, premises) to predicting
persuasiveness of an argument [Van Eemeren et al., 2014;
Cabrio and Villata, 2018]. Recent works on persuasiveness
adrress student essays [Farra et al., 2015] and debate’s
argument [Persing and Ng, 2017].</p>
      <p>During a debate, various real life problems are discussed,
such as whether to support death penalty or to ban guns. One
of the purposes of a debate is to determine which side has won
and to provoke forming richer opinions on societal issues.
Human beings judge the winning side of a debate generally
by assessing argument persuasiveness and counterarguments.
However, researchers have concentrated only on
counterargument retrieval [Wachsmuth et al., 2018], predicting
the persuasiveness of argumentation [Persing and Ng, 2017]
or comparing persuasiveness of a pair of arguments
[Habernal and Gurevych, 2016] in debate processing. To the
best of authors’ knowledge, works on counterarguments did
not consider the persuasiveness of both argument and
counterargument, and research on persuasiveness has not yet
considered how persuasiveness is affected by a
counterargument, hence researchers have not dealt with several
arguments within a particular debate. Therefore, existing methods
are not capable of automatic debate winner selection. This
problem is crucial because the debate is a competition of
comparing persuasiveness between attacking and defending
sides. Moreover, existing methods for estimating
persuasiveness cannot address real life problems because these methods
do not consider whether an argument is rebutted or not.</p>
      <p>Researchers have not dealt with above-mentioned
problems, and this paper proposes and studies a task of
predicting debate outcome for evaluating persuasiveness of
several arguments. Predicting debate outcome means
automatically selecting the winner from Pro side (For1) and Con side
(Against) of a debate on a given topic with counterargument
relations. Usually one debate consists of several arguments,
and there is a need to compare not only pairs of arguments
within one side as in previous works, but all arguments
separated into two camps (For and Against) to be capable to
predict the winning side. Some researchers worked on predicting
debate outcome task before [Potash and Rumshisky, 2017],
but their research did not consider persuasiveness of each
argument. Moreover, they only focused on the final audience
poll, which can become the bias. Therefore, we propose a
method for debate outcome prediction considering arguments
persuasiveness and the audience bias to debate themes.</p>
      <p>For this task, we provide a new corpus of 321 debates with
third party evaluation, retrieved from idebate.org site
presenting debates on various topics where anyone can vote for more,
in their opinion, convincing argumentation.</p>
      <p>In short, main contributions of this article are:
(1) A corpus for judging the winner side;
(2) Appropriate task setting for debate outcome prediction;
(3) Methods for the proposed task.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>In an essay, a debate or a discussion, the most important
issue is whether the argumentation can change people’s
stance or not [Tan et al., 2016]. Therefore,
argumentation mining has become focused on persuasion of an
1We expressed the words we defined originally in italic letter.
argumentation [Carlile et al., 2018; Persing and Ng, 2017;
Durmus and Cardie, 2018]. Persuasion research concentrates
on debates and discussions, and makes effort to measure
the absolute value of persuasiveness [Wei et al., 2016]
and to compare persuasiveness of an argument pair
[Habernal and Gurevych, 2016; Hidey and McKeown, 2018]
or rank arguments persuasivenesses [Tan et al., 2016;
Cano-Basave and He, 2016].</p>
      <p>However, argumentation itself is not enough to determine
the persuasiveness because arguments are usually being
rebutted. In order to achieve better winning side prediction, we
need to consider counterarguments [Habernal et al., 2018].
Counterargument retrieval is the major task in argumentation
mining that identifies attack or support relations of arguments
[Cocarascu and Toni, 2017]. In those studies, it was usual to
use prior topic knowledge, but [Wachsmuth et al., 2018]
proposed methods independent of knowledge. We have to
consider counterarguments prediction in the debate outcome
prediction task.</p>
      <p>There are several studies for predicting outcomes of a
debate. However, most of them focused on only one or
two themes [Strapparava et al., 2010]. Those research
cannot be applied to general debate in terms of persuasiveness.
To the authors? best knowledge, Potash and Rumshisky
are the first to study general debate outcome prediction
[Potash and Rumshisky, 2017]. They achieved the best
accuracy (71%) using a Recurrent Neural Network with
Attention architecture. The dataset they used is debates only with
the final favorability of audience. It is visible which stance,
For or Against, is more supported with the final
favorability, but audience supports is prone to bias or influenced by
preconceptions for a given debate theme. Their research did
not consider the audience bias, which has a large effect on
debate results. It is preferable to compare the final audience
favorability with the audience favorability before a debate
because audience usually has an opinion which changes during
the actual debate. In addition, Potash and Rumshisky did not
consider persuasiveness of each argument, which plays a
significant role in debates. To tackle these problems, we decided
to deal with debate outcome prediction with argument
persuasiveness and audience favorability before and after a debate.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Corpus</title>
      <p>This section introduces two corpora which we use for our
proposed task. First one is used in [Persing and Ng, 2017] for
predicting persuasiveness level of an argument. Second one
is for debate outcome prediction task. Our proposed method
for this task is novel, hence no corpus with annotated
winner’s side (before/after debating) had existed before.
Therefore, we constructed a new corpus, automatically retrieved
from idebate.org. This corpus contains a set of debates with
the debate outcomes. Besides, we define “winner” in debate
as the side which persuaded more people than the other side.</p>
      <sec id="sec-3-1">
        <title>3.1 Persing and Ng’s corpus</title>
        <p>Persing and Ng’s corpus contains a subset of 165 debates
extracted from idebate.org. Each debate includes Motion which
expresses the debate theme, and has 7.3 arguments on
average (1,208 arguments in total). Argument is an opinion on
the Motion, and every argument belongs to a stance (For or
Against). Arguments are divided into Assertions and
Justifications. Assertion is the debater’s main opinion written about
the reason why this person agrees or disagrees with Motion.
Justification explains the Assertion in detail usually with
references and logical explanation. Table 1 shows an example of
argument divided into components. In their corpus, Persing
and Ng annotated arguments with Argument Persuasiveness
(AP).</p>
        <p>AP is the persuasiveness score of arguments on a 6 point
rating scale, where 6 indicates that the argument is very
persuasive and clear, while 1 means that it is an unclear
argument. For example, AP of the argument shown in Table 1 is
6.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Our Corpus</title>
        <p>We retrieved the debate data from idebate.org, acquiring 321
debate themes (For: 148 and Against: 173), almost twice
more than in the case of corpus of [Persing and Ng, 2017]
and not overlapped with their corpus. Each debate theme has
7.55 arguments on average. On idebate.org, the third party
evaluation is performed by the site visitors for almost all of
the debates, and includes opinion rating before and after
reading a debate and these results are open to the public. There
are five evaluation categories: Strongly For (SF), Mildly For
(MF), Don’t Know (DK), Mildly Against (MA) and Strongly
Against (SA). The evaluation from the visitors is shown as a
percentage, and the evaluation of a debate on the topic “This
House believes Tennessee is correct to protect teachers who
wish to explore the merits of creationism” is shown in Table
2 as a example.</p>
        <p>From the data we obtain the debate outcome with the
following equation.</p>
        <p>2</p>
        <p>SF + M F
(2</p>
        <p>SA + M A)
(1)
To estimate whether arguments in a debate have changed
people’s stance or not, we subtract ”before” from ”after” values
(Equation (1)), and if the result is larger than zero, the For
side is assigned as the debate’s winner. For example, if we
substitute Equation (1) with values from Table 2, For side
wins because the result will be larger than zero, even though
the percentage of Against side is bigger than For.</p>
        <p>We also show arguments used in this example in Table 3.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Proposed Methods</title>
      <p>Our method for the task of debate outcome prediction is
divided into the following three steps: AP estimation, argument
similarity and discourse parsing. In these steps, we treat
persuasiveness estimation of a debate as a main source for the
result prediction via SVM [Cortes and Vapnik, 1995].</p>
      <p>AP estimation We calculate AP in the range of 1 to 6
for arguments in our corpus using the Persing and Ng’s
corpus as train data and by employing Support Vector Machine
(SVM) with ten features, which produced better results than
the method of [Persing and Ng, 2017]. The ten features are
as follows: number of grammar errors, subjectivity
indicators, number of first plural pronouns, number of citations,
number of content lemmas only in Justification, Assertion
Motion
Assertion
Justification</p>
      <p>This house believes Quebec should secede from Canada
International Law Mandates Quebec be allowed Independence
International law recognizes Quebec’s right to self-determination and denying them self-determination is therefore a
violation of international law. International law recognizes the right of all peoples to self-determination. The
international community has decided that it is oppressive to individuals to live under a government that is systematically
incapable or unwilling to protect them and their interests.[1] The Quebecois have been systematically denied
adequate representation in the federal government of Canada. Quebecois legislation protection their basic rights to retain
their language and culture have been met with contempt[2] and legal action by the federal Canadian government and
courts.[3] This is but one example of the very clear denial of basic representation and self-governance that afflicts the
Quebecois in Canada. Therefore, Quebec has the legal right to self-determination and independence in international
law.
[1] “Reference re Secession of Quebec”, Supreme Court of Canada, 1998, 2 S.C.R. 217,
&lt;http://scc.lexum.org/en/1998/1998scr2-217/1998scr2-217.html &gt;
[2] “Maxime Bernier on Quebec law: ‘We don’t need Bill 101”’, The Canadian Press, 4 February 2011,
&lt;http://www.ctv.ca/CTVNews/Canada/20110204/bernier-law-110204/ &gt;
[3] Hudon, R., ,,Bill 101”, The Canadian Encyclopedia, &lt;http://www.thecanadianencyclopedia.com/index.cfm?PgNm=
TC&amp;Params=A1ARTA0000744 &gt;
length, number of content lemmas only in Assertion,
number of words in Justification, number of subject matches in
discourse relation (between two sentences in an argument),
and number of transitional phrases in Justification.</p>
      <p>To confirm the error rate of AP estimation, we performed
10-fold cross-validation with Persing and Ng’s corpus, and
the AP estimation results are evaluated with two scoring
metrics: E, which is the error rate, and ME, which measures
the mean distance between a prediction and the correct value
of AP. Persuasiveness calculated with data from Persing and
Ng’s corpus resulted with E=0.64 and ME=1.18 when only
AP estimation was used. This result may be capable for
predicting debate outcome because the mean distance between a
prediction and the correct value of AP is approximately 1 (the
range of ME is 0 to 5).</p>
      <p>Argument similarity For retrieving counterarguments
against an argument, we use cosine similarity over word2vec
[Mikolov et al., 2013] trained with Google News dataset
between the Justification in an argument without stop words and
arguments in the opposing side within the same debate. In
order to obtain the best threshold of cosine similarity, we tested
all of our proposed methods while changing the threshold for
the debate outcome prediction task using our corpus. We
changed the threshold within 0.325 and 0.775 in increments
of 0.025. As a result, 0.55 performed the best, therefore we
set the threshold to this value.</p>
      <p>Discourse parsing We assumed that counterarguments can
be partially discovered with the additional help of transitional
phrases such as ”but” or ”because” appearing in Justification.</p>
      <p>Therefore, we extract a sentences proceeding and
following those transitional phrases using PDTB-styled discourse
parser [Lin et al., 2014].
4.1</p>
      <sec id="sec-4-1">
        <title>Basic SVM-based Approach</title>
        <p>In this method, we utilize AP in the debate outcome
prediction with SVM (default parameters).</p>
        <p>The input to SVM is a vector of AP. This vector is derived
from For APs and Against APs. We firstly make one vector
with eight elements: sorted For APs, and sorted Against APs.
In the next step, two vectors are concatenated. Finally, we
input the vector, which length is 16, and the result is calculated.
If the number of For/Against arguments is smaller than eight,
the vector was padded with zeroes.</p>
        <p>Results are computed with 4-cross validation, and this
evaluation procedure is identical in other experiments described
below.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Similarity-based Approach</title>
        <p>In this approach, we only use AP estimation and argument
similarity calculation. At first, the method estimates AP for
all arguments in a debate, and when an argument is analyzed
and its AP is lower than the AP of other arguments in the
opposing side and argument similarity is over the threshold,
we multiply AP of the argument with a value, automatically
decided as follows. A certain value changes by d which is the
distance between AP of the input argument and remaining
arguments. To obtain the highest accuracy of debate outcome
task, we tested this approach with following value V [d] which
is calculated by Equation (2).</p>
        <p>V [d] = SV
i (d
1)
(2)
where SV is the standard value within 0.8 and 1.0 in
increments of 0.01, and i is the intervals: 0.0025, 0.005,
0.01, 0.0125, 0.015, 0.0175 and 0.02. The best results were
achieved when SV =1 and i=0.0025, so that V [1]=1.0 and
V [5]=0.96. For example, in the case where the argument’s
AP equals 2, the similarity with the other argument is over
the threshold and AP is 5, V [5-2]=0.98 is multiplied by the
input argument’s AP: 2.</p>
        <p>Motion</p>
        <p>This House believes Tennessee is correct to protect teachers who wish to explore the merits of creationism
Freedom of speech should apply to teachers as much as anyone else
Teaching creationism as well as evolution gives students freedom to choose
The bill does not exclude evolution just allows room for other theories
Teachers should not have freedom to teach whatever they wish as fact
Children should have the freedom not to be misled
Tennessee is not seeking to protect freedom of speech
As it is not science creationism should not even be covered by the Tennessee law</p>
        <p>In addition, we tested the case of i=0, which means V [d]
does not depend on d, for comparison. In this case, the
highest result was achieved when SV was equal 0.99.
In this approach, we add discourse parsing to the Similarity
Approach. In Similarity Approach, we get the cosine
similarity between an argument’s Justification and other argument’s
Justification, but in this approach, we extract sentences which
are in discourse relations with discourse parsing, and
calculate the similarity between an argument’s Justification and
other’s sentences which are in discourse relations. The best
case of V [d] is when SV =0.92 and i=0.0025. In addition, we
tested the case of i=0 for comparison. The best accuracy was
achieved when SV was equal 0.92.
4.4</p>
      </sec>
      <sec id="sec-4-3">
        <title>PDTB-styled Approach 2 (PDTB2)</title>
        <p>In this approach, in addition to discourse parsing, we
calculate similarity of the first sentence in the Justification of
the other side argument because we assume that the first
sentence of Justification may be mentioned in the
counterargument. The best prediction results for V [d] are achieved when
SV =0.95/0.96 and i=0.0025. Additionally, we performed an
experiment for the case of i=0 and SV =0.93 obtained the
most accurate estimation.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Evaluation Results</title>
      <p>Researchers have not dealt with the task of debate outcome
prediction yet, hence three prediction methods using statistics
are proposed as baselines. First one is the median baseline (a).
This method predicts the debate winner by comparing median
of AP in For and Against stances. Two remaining baselines
use average (b) and summation (c) instead of median.</p>
      <p>Table 4 shows the accuracy of each system described
earlier. Variables (SV and i) are selected to achieve the best
accuracy. The rows indicating i=0 mean that the value which
is multiplied with AP is not altered by the distance of AP.
The best result is 60.1% achieved by PDTB2 method when
SV =0.93 and i=0. All of our proposed methods are
improved by approximately 0.07. Moreover, PDTB1 showed
accuracy superior to the Similarity Approach and PDTB2
accuracy is better than in the case of PDTB1. Therefore, it can
be said that the discourse relations are beneficial for
comparison of persuasiveness of an argument and the
counterarguments. However, the accuracy of all methods is higher when
the value of i is 0 than in the case where i &lt; 0. Moreover, the
value of i which provides the best results are 0.0025, hence</p>
      <sec id="sec-5-1">
        <title>Method</title>
        <p>Median</p>
        <p>Average
Summation</p>
        <p>Basic
Similarity</p>
      </sec>
      <sec id="sec-5-2">
        <title>PDTB1</title>
      </sec>
      <sec id="sec-5-3">
        <title>PDTB2</title>
      </sec>
      <sec id="sec-5-4">
        <title>Parameters None None None</title>
        <p>None</p>
        <p>SV=0.99, i=0
SV =1, i=0.0025</p>
        <p>SV =0.92, i=0
SV=0.92, i=0.0025</p>
        <p>SV =0.93, i=0
SV =0.95/0.96, i=0.0025
it might be better to set i to a smaller value. This also
suggests that the prediction accuracy does not necessarily depend
on the distance between persuasiveness of an argument and
counterargument.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion and Future Work</title>
      <p>In this paper, we proposed a new appropriate task of debate
outcome prediction and four methods with cosine
similarity and PDTB-styled discourse parser were introduced. The
highest accuracy (60.1%) was achieved by PDTB2 method
showing that similarity between arguments (on both For or
Against sides) when combined with discourse parsing-based
information, are capable to improve the accuracy. However,
the results also suggest that debate outcome (decision on
which side has won the debate) may be independent from
the distance between persuasiveness of an argument and its
counterargument.</p>
      <p>In our experiments, we did not consider all discourse
information from the discourse parser, therefore we plan to
perform series of experiments to investigate the categories in
PDTB (e.g. “CONTINGENCY” or “COMPARISON”) with
our methods in order to improve the accuracy of our proposed
task further.</p>
      <p>In the corpora we use, an argument is separated into
Motion, Assertion and Justification components, but it is
rather artificial division. Therefore, there is a need for
an and automatic way to discover other elements of a
debate, e.g. Claim, Premise, Anecdote and Assumption
[Ajjour et al., 2017; Stab and Gurevych, 2014].</p>
      <p>We did not consider values from Equation (1) when
calculating the change before and after reading all arguments of a
debate. This change may differ in size depending on a theme,
therefore, in the next step we plan to examine the role of this
value when predicting debate outcome.</p>
      <p>In addition, both corpora include a clear stance (For or
Against), but in general arguments are not categorized in such
manner, therefore, we have to add stance prediction algorithm
similar to the one proposed by [Chen et al., 2018] in order to
be able to predict outcomes of debates from other resources.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [Ajjour et al.,
          <year>2017</year>
          ]
          <string-name>
            <given-names>Yamen</given-names>
            <surname>Ajjour</surname>
          </string-name>
          ,
          <string-name>
            <surname>Wei-Fan</surname>
            <given-names>Chen</given-names>
          </string-name>
          , Johannes Kiesel, Henning Wachsmuth, and
          <string-name>
            <given-names>Benno</given-names>
            <surname>Stein</surname>
          </string-name>
          .
          <article-title>Unit segmentation of argumentative texts</article-title>
          .
          <source>In Proceedings of the 4th Workshop on Argument Mining</source>
          , pages
          <fpage>118</fpage>
          -
          <lpage>128</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [
          <string-name>
            <surname>Al-Khatib</surname>
          </string-name>
          et al.,
          <year>2016</year>
          ]
          <string-name>
            <given-names>Khalid</given-names>
            <surname>Al-Khatib</surname>
          </string-name>
          , Henning Wachsmuth, Matthias Hagen, Jonas Ko¨hler, and Benno Stein.
          <article-title>Cross-domain mining of argumentative text through distant supervision</article-title>
          .
          <source>In Proceedings of the 2016</source>
          conference
          <article-title>of the north american chapter of the association for computational linguistics: human language technologies</article-title>
          , pages
          <fpage>1395</fpage>
          -
          <lpage>1404</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <source>[Cabrio and Villata</source>
          , 2018]
          <string-name>
            <given-names>Elena</given-names>
            <surname>Cabrio</surname>
          </string-name>
          and
          <string-name>
            <given-names>Serena</given-names>
            <surname>Villata</surname>
          </string-name>
          .
          <article-title>Five years of argument mining: a data-driven analysis</article-title>
          .
          <source>In IJCAI</source>
          , pages
          <fpage>5427</fpage>
          -
          <lpage>5433</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <source>[Cano-Basave and He</source>
          , 2016]
          <article-title>Amparo Elizabeth CanoBasave</article-title>
          and
          <string-name>
            <given-names>Yulan</given-names>
            <surname>He</surname>
          </string-name>
          .
          <article-title>A study of the impact of persuasive argumentation in political debates</article-title>
          .
          <source>In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , pages
          <fpage>1405</fpage>
          -
          <lpage>1413</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [Carlile et al.,
          <year>2018</year>
          ]
          <string-name>
            <given-names>Winston</given-names>
            <surname>Carlile</surname>
          </string-name>
          , Nishant Gurrapadi, Zixuan Ke, and
          <string-name>
            <given-names>Vincent</given-names>
            <surname>Ng</surname>
          </string-name>
          .
          <article-title>Give me more feedback: Annotating argument persuasiveness and related attributes in student essays</article-title>
          .
          <source>In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          , volume
          <volume>1</volume>
          , pages
          <fpage>621</fpage>
          -
          <lpage>631</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [Chen et al.,
          <year>2018</year>
          ]
          <string-name>
            <given-names>Di</given-names>
            <surname>Chen</surname>
          </string-name>
          , Jiachen Du, Lidong Bing, and
          <string-name>
            <given-names>Ruifeng</given-names>
            <surname>Xu</surname>
          </string-name>
          .
          <article-title>Hybrid neural attention for agreement/disagreement inference in online debates</article-title>
          .
          <source>In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>665</fpage>
          -
          <lpage>670</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>[Cocarascu and Toni</source>
          , 2017]
          <string-name>
            <given-names>Oana</given-names>
            <surname>Cocarascu</surname>
          </string-name>
          and
          <string-name>
            <given-names>Francesca</given-names>
            <surname>Toni</surname>
          </string-name>
          .
          <article-title>Identifying attack and support argumentative relations using deep learning</article-title>
          .
          <source>In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>1374</fpage>
          -
          <lpage>1379</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <source>[Cortes and Vapnik</source>
          , 1995]
          <string-name>
            <given-names>Corinna</given-names>
            <surname>Cortes</surname>
          </string-name>
          and
          <string-name>
            <given-names>Vladimir</given-names>
            <surname>Vapnik</surname>
          </string-name>
          .
          <article-title>Support-vector networks</article-title>
          .
          <source>Machine Learning</source>
          ,
          <volume>20</volume>
          (
          <issue>3</issue>
          ):
          <fpage>273</fpage>
          -
          <lpage>297</lpage>
          ,
          <year>Sep 1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <source>[Durmus and Cardie</source>
          , 2018]
          <string-name>
            <given-names>Esin</given-names>
            <surname>Durmus</surname>
          </string-name>
          and
          <string-name>
            <given-names>Claire</given-names>
            <surname>Cardie</surname>
          </string-name>
          .
          <article-title>Exploring the role of prior beliefs for argument persuasion</article-title>
          .
          <source>In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , Volume
          <volume>1</volume>
          (
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          , volume
          <volume>1</volume>
          , pages
          <fpage>1035</fpage>
          -
          <lpage>1045</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [Farra et al.,
          <year>2015</year>
          ]
          <string-name>
            <given-names>Noura</given-names>
            <surname>Farra</surname>
          </string-name>
          , Swapna Somasundaran, and
          <string-name>
            <given-names>Jill</given-names>
            <surname>Burstein</surname>
          </string-name>
          .
          <article-title>Scoring persuasive essays using opinions and their targets</article-title>
          .
          <source>In Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications</source>
          , pages
          <fpage>64</fpage>
          -
          <lpage>74</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>[Green</surname>
          </string-name>
          et al.,
          <year>2014</year>
          ]
          <string-name>
            <given-names>Nancy</given-names>
            <surname>Green</surname>
          </string-name>
          , Kevin Ashley, Diane Litman, Chris Reed, and
          <string-name>
            <given-names>Vern</given-names>
            <surname>Walker</surname>
          </string-name>
          .
          <source>Proceedings of the first workshop on argumentation mining</source>
          .
          <source>In Proceedings of the First Workshop on Argumentation Mining. Association for Computational Linguistics</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <source>[Habernal and Gurevych</source>
          , 2016]
          <string-name>
            <given-names>Ivan</given-names>
            <surname>Habernal</surname>
          </string-name>
          and
          <string-name>
            <given-names>Iryna</given-names>
            <surname>Gurevych</surname>
          </string-name>
          .
          <article-title>Which argument is more convincing? analyzing and predicting convincingness of web arguments using bidirectional lstm</article-title>
          .
          <source>In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          , volume
          <volume>1</volume>
          , pages
          <fpage>1589</fpage>
          -
          <lpage>1599</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [Habernal et al.,
          <year>2018</year>
          ]
          <string-name>
            <given-names>Ivan</given-names>
            <surname>Habernal</surname>
          </string-name>
          , Henning Wachsmuth, Iryna Gurevych, and
          <string-name>
            <given-names>Benno</given-names>
            <surname>Stein</surname>
          </string-name>
          .
          <article-title>Before name-calling: Dynamics and triggers of ad hominem fallacies in web argumentation</article-title>
          .
          <source>arXiv preprint arXiv:1802.06613</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>[Hidey and McKeown</source>
          , 2018]
          <article-title>Christopher Thomas Hidey and Kathleen McKeown</article-title>
          .
          <article-title>Persuasive influence detection: The role of argument sequencing</article-title>
          .
          <source>In Thirty-Second AAAI Conference on Artificial Intelligence</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>[Lin</surname>
          </string-name>
          et al.,
          <year>2014</year>
          ]
          <string-name>
            <given-names>Ziheng</given-names>
            <surname>Lin</surname>
          </string-name>
          , Hwee Tou Ng, and
          <string-name>
            <surname>Min-Yen Kan</surname>
          </string-name>
          .
          <article-title>A pdtb-styled end-to-end discourse parser</article-title>
          .
          <source>Natural Language Engineering</source>
          ,
          <volume>20</volume>
          (
          <issue>2</issue>
          ):
          <fpage>151</fpage>
          -
          <lpage>184</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [Mikolov et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , Ilya Sutskever, Kai Chen, Greg S Corrado, and
          <string-name>
            <given-names>Jeff</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          , pages
          <fpage>3111</fpage>
          -
          <lpage>3119</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <source>[Persing and Ng</source>
          , 2017]
          <string-name>
            <given-names>Isaac</given-names>
            <surname>Persing</surname>
          </string-name>
          and
          <string-name>
            <given-names>Vincent</given-names>
            <surname>Ng</surname>
          </string-name>
          .
          <article-title>Why can't you convince me? modeling weaknesses in unpersuasive arguments</article-title>
          .
          <source>In IJCAI</source>
          , pages
          <fpage>4082</fpage>
          -
          <lpage>4088</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <source>[Potash and Rumshisky</source>
          , 2017]
          <string-name>
            <given-names>Peter</given-names>
            <surname>Potash</surname>
          </string-name>
          and
          <string-name>
            <given-names>Anna</given-names>
            <surname>Rumshisky</surname>
          </string-name>
          .
          <article-title>Towards debate automation: a recurrent model for predicting debate winners</article-title>
          .
          <source>In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>2465</fpage>
          -
          <lpage>2475</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <source>[Stab and Gurevych</source>
          , 2014]
          <string-name>
            <given-names>Christian</given-names>
            <surname>Stab</surname>
          </string-name>
          and
          <string-name>
            <given-names>Iryna</given-names>
            <surname>Gurevych</surname>
          </string-name>
          .
          <article-title>Annotating argument components and relations in persuasive essays</article-title>
          .
          <source>In Proceedings of COLING</source>
          <year>2014</year>
          ,
          <source>the 25th International Conference on Computational Linguistics: Technical Papers</source>
          , pages
          <fpage>1501</fpage>
          -
          <lpage>1510</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [Stab et al.,
          <year>2018</year>
          ]
          <string-name>
            <given-names>Christian</given-names>
            <surname>Stab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Tristan</given-names>
            <surname>Miller</surname>
          </string-name>
          , Benjamin Schiller, Pranav Rai, and
          <string-name>
            <given-names>Iryna</given-names>
            <surname>Gurevych</surname>
          </string-name>
          .
          <article-title>Cross-topic argument mining from heterogeneous sources</article-title>
          .
          <source>In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>3664</fpage>
          -
          <lpage>3674</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [Strapparava et al.,
          <year>2010</year>
          ]
          <string-name>
            <given-names>Carlo</given-names>
            <surname>Strapparava</surname>
          </string-name>
          , Marco Guerini, and
          <string-name>
            <given-names>Oliviero</given-names>
            <surname>Stock</surname>
          </string-name>
          .
          <article-title>Predicting persuasiveness in political discourses</article-title>
          .
          <source>In LREC</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [Tan et al.,
          <year>2016</year>
          ]
          <string-name>
            <given-names>Chenhao</given-names>
            <surname>Tan</surname>
          </string-name>
          , Vlad Niculae,
          <article-title>Cristian Danescu-Niculescu-</article-title>
          <string-name>
            <surname>Mizil</surname>
            ,
            <given-names>and Lillian</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
          </string-name>
          .
          <article-title>Winning arguments: Interaction dynamics and persuasion strategies in good-faith online discussions</article-title>
          .
          <source>In Proceedings of the 25th international conference on world wide web</source>
          , pages
          <fpage>613</fpage>
          -
          <lpage>624</lpage>
          . International World Wide Web Conferences Steering Committee,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>[Van</surname>
          </string-name>
          Eemeren et al.,
          <year>2014</year>
          ]
          <string-name>
            <surname>Frans H Van Eemeren</surname>
          </string-name>
          ,
          <string-name>
            <surname>Bart Garssen</surname>
            , Erik CW Krabbe,
            <given-names>A Francisca</given-names>
          </string-name>
          <string-name>
            <surname>Snoeck</surname>
            <given-names>Henkemans</given-names>
          </string-name>
          ,
          <source>Bart Verheij, and Jean HM Wagemans. Handbook of argumentation theory</source>
          .
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [Wachsmuth et al.,
          <year>2018</year>
          ]
          <string-name>
            <given-names>Henning</given-names>
            <surname>Wachsmuth</surname>
          </string-name>
          , Shahbaz Syed, and
          <string-name>
            <given-names>Benno</given-names>
            <surname>Stein</surname>
          </string-name>
          .
          <article-title>Retrieval of the best counterargument without prior topic knowledge</article-title>
          .
          <source>In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          , volume
          <volume>1</volume>
          , pages
          <fpage>241</fpage>
          -
          <lpage>251</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [Wei et al.,
          <year>2016</year>
          ]
          <string-name>
            <given-names>Zhongyu</given-names>
            <surname>Wei</surname>
          </string-name>
          , Yang Liu, and
          <string-name>
            <given-names>Yi</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <article-title>Is this post persuasive? ranking argumentative comments in online forum</article-title>
          .
          <source>In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>2</volume>
          :
          <string-name>
            <surname>Short</surname>
            <given-names>Papers)</given-names>
          </string-name>
          , volume
          <volume>2</volume>
          , pages
          <fpage>195</fpage>
          -
          <lpage>200</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>