<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Combining Classification-centered and Relation-based Argument Mining Methods</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andrew Henning</string-name>
          <email>andrew.henning@kcl.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anthony P. Young</string-name>
          <email>peter.young@kcl.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elizabeth Sklar</string-name>
          <email>esklar@lincoln.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simon Miles</string-name>
          <email>simon.miles@kcl.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elizabeth Black</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Engineering, King's College London</institution>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Informatics, King's College London</institution>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Lincoln Institute for Agri-Food Technology, University of Lincoln</institution>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <fpage>135</fpage>
      <lpage>139</lpage>
      <abstract>
        <p>Two key tasks in argument mining (AM) are classification of argument components and identification of relations between argument components. Approaches to solving the argument component classification problem typically take a supervised learning approach, however a lack of suitable datasets makes this a challenge for identification of argument component relations. We propose a pipeline with a recurrent, branched structure that combines supervised learning of argument component classifications with NLP approaches to identification of argument component relations, with the aim of improving both classification of argument components (i.e. premises and claims) and identification of support relationships between components.</p>
      </abstract>
      <kwd-group>
        <kwd>Argument mining</kwd>
        <kwd>Computational argumentation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Argument mining (AM) is a relatively new field intersecting computational
argumentation, natural language processing (NLP), and machine learning. While
the primary goal of AM is simple – to extract arguments from raw text and
identify the relationships between them – researchers currently deploy sophisticated
systems as pipelines with stages that tackle relevant sub-tasks, like boundary
detection, component classification, and relation prediction [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Recently, AM has
garnered increased interest, with applications in fields such as law and medicine,
education, and social media (e.g. [
        <xref ref-type="bibr" rid="ref1 ref10 ref11 ref2 ref6 ref9">1,2,6,9,10,11</xref>
        ]). We aim to take raw textual data
from Wikipedia articles, classify its argument components as claims, premises,
both, or neither, and predict the support relationships between them. To do
this we propose a novel AM pipeline that leverages both supervised learning
approaches to argument component classification and NLP techniques for
identification of support relations in a branched, recurrent structure.
      </p>
      <p>
        Classification-centered models typically use supervised machine learning
algorithms to classify argument components as claims or premises (e.g. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]). But
these models often struggle with ambiguity when determining to which class an
argument component belongs, largely because an argument component’s
classification is highly dependent on its relation to other statements. Relation-based
models aim to predict relationships between argumentative statements (e.g. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]),
but are constrained by the granularity of the input data, such as sentences
versus clauses. This suggests they may not be suciently sensitive to account for
scenarios where argument components span multiple sentences, or where a single
complex sentence may contain many di↵erent argument components. Further,
relation-based models do not cope well with stand-alone argument components
that cannot be easily related to others.
      </p>
      <p>We describe ongoing work to develop an AM pipeline (Figure 1) that
combines classification-centered models with relation-based models to address these
problems. We claim that using relation-based methods to adjust preliminary
classification likelihoods can improve argument component classifications made
by machine learning algorithms alone. We also claim that a recurrent method
of combining argument components can overcome the input constraint problem
experienced by relation-based models. We argue that the branched, recurrent
structure proposed in Figure 1 can better discriminate argument components
over classification-centered designs and is more sensitive to the range of what
can serve as input to current relation-based models.</p>
      <p>
        While other works input text into a linear pipeline (e.g. [
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ]) and combine
classification and relation-based methods (e.g. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]), we believe our approach is
the first to propose a branched, recurrent structure that aims to leverage
benefits of both while reducing drawbacks. Further, we propose a four-stage process
that provides: an extension to current shallow text classification methods called
part-of-speech-tying for classifying argument components using both context and
content-based features (Stage C); a novel method for creating argument relation
templates (Stage B); a novel method to improve argument component
classification by enriching their likelihood measures with additional relation-based
information (Stage C); and a method for taking argument relation templates
and adjusting them, given initial classifications made in Stage C (Stage D).
      </p>
    </sec>
    <sec id="sec-2">
      <title>Pipeline Architecture</title>
      <p>This section details the expected behaviour of di↵erent stages in our pipeline.
Due to space considerations, we will briefly describe the pipeline’s input and
output, but provide more detail of the critical stages.</p>
      <p>An Input Document will come as raw textual data taken from a Wikipedia
article. We will use the IBM Watson Debater dataset1 for argument component
classification training and overall evaluation, as it also uses Wikipedia data.</p>
      <p>Stage A: Segmentation will perform clause tokenization, which is the
smallest individual textual unit that argument components could possibly be.
Later in Stage D, unused argument components that cannot be mapped using
the argument relation template will be combined into new, disparate statements
and returned to the end of this stage to be used as input back into Stages B and
C, allowing consideration of argument components that span multiple clauses.</p>
      <p>Stage B: Templating will take the segmented text from Stage A as input
and output argument component classifications and an argument relation
template, which is a graph whose nodes and edges correspond to the segmented text
and their support relations, respectively.</p>
      <p>
        The purpose of constructing an argument relation template is to extract
classification information from the structure of the text, since edges represent
support relationships between argument components, the argument relation
template helps us to classify these components. Like Cocarascu and Toni [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], we aim
to identify relations using LDA and sentiment analysis. However, we will extend
this idea with additional NLP techniques. First, we will perform LDA and topic
modelling to group statements by topic, since we assume related statements
are contained in the same topic. Second, between each pair of statements in
each topic, we will compute the mutual shared information and cosine similarity
scores, which provide a measure of how similar the statements are, given their
constituent words and usage. We assume related statements share similar
sentiment values, so will calculate the similarity of sentiment scores for each pair
of statements within a topic. Finally, we will track the distance between two
statements in the text by how many clauses separate them. We will connect two
statements together by taking the values from each step and combine them into
single metric m. If the value of m is greater than some threshold T , a support
relation exists between them. Although our assumptions may not hold in every
case, our idea is that by combining each score into a single metric and setting an
optimal threshold, two statements will still be linked. The direction of this edge
will be determined by comparing the values calculated from topic modelling. The
statement with the higher score will be considered a claim or both statement, as
higher scores may suggest closer adherence to a given topic.
      </p>
      <p>After we create the argument relation template, we will determine the
classification to which each node belongs for use in Stage C. Each node’s incoming
and outgoing support edges will indicate that node’s classification and will be
1 Available at: https://www.research.ibm.com/haifa/dept/vst/debating data.shtml,
last accessed 19 August 2019.
determined through pre-established classification rules. For instance, a node N
with Sinc &gt; 0, where Sinc is the number of incoming support edges, is classified
as either a claim or both statement.</p>
      <p>Stage C: Classification will take the individual statements from Stage A
and classify each statement by its role in the text as a claim, premise, neither,
or both. We first apply a supervised learning approach, trained on the IBM
Debater dataset, and aim to improve the classifications from this with information
from the argument relation template from Stage B. We have developed our own
shallow-learning based models called part-of-speech-tying (POST) that account
for both content and context, from we will test and select the best classifier.
POST models produce a likelihood measure for each class and express them as
tuples: tS = CL, P L, N L, BL , where CL, P L, N L, and BL, represent claim,
premise, neither and both likelihoods, respectively. From the tuples, we will
determine statements that are ambiguous, which we define as statements whose
likelihoods are close to the same value. We will factor in the relation-based
classifications output from Stage B by adding to or subtracting from the likelihoods
of all statements based on the amount of support relations of that statement’s
node in the argument relation template. For example, nodes with more incoming
edges are likely to be claims, so for those nodes, we will increase CL by some
weighted amount w. This should improve upon the ambiguous classification of
previous AM models.</p>
      <p>Stage D: Adjustment will take the argument relation template created
in Stage B and the argument component classifications from Stage C as input.
The template guides the identification of which classified statements from Stage
C could be combined together and returned to Stage A as new statements for
re-classification, provides a heuristic to determine which statements to evaluate
first for connectivity, and determines when the stage proceeds to final output. We
will first count the number of each type of argument component in each topic.
Second, starting with the topic which contains the greatest cumulative number
of argument components, we will compare di↵erent sub-trees in the template
for structures that closely match the numbers described by the collection of
argument component counts. When a match or close match is identified, we will
label the nodes with the appropriate text segments by re-calculating the mutual
information and cosine similarity scores, perform sentiment analysis, and factor
in textual distance similar to the procedure described in Stage B; however, in
this stage only relevant pairs are calculated depending on their classification.
Combinations of statements with impossible connectivity (i.e. two claims or two
premises) will be excluded. If any of the classified statements cannot be fit into
the graph, pairs of those statements will be concatenated together to form a
new statement and returned to Stage A, where the process repeats. We hope to
be able to use classification information to identify the most appropriate text
to feed back into Stage B. The pipeline terminates once a “best fit” has been
found, or a recurrence threshold showing no further progress is reached. Best fit
occurs when the template matches the classification information. This recurrent
approach should improve upon the input sensitivity problem experienced by
relation-based models, which will be evaluated in future work.</p>
      <p>Finally, an Output Document and Graph will be generated in a new
mark-up document along with the final graph for visualisation.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion and Future Work</title>
      <p>We have proposed an argument mining pipeline with a branched, recurrent
structure that combines elements of both classification-centered and relation-based
models. We believe that this structure will address ambiguity found in
component classification and input sensitivity in relation-based models which are not
suciently fine-grained; this claim will be evaluated fully in future work.</p>
      <p>Additionally, we intend to expand our pipeline to include support and attack
relations and plan to test the pipeline outside of the Wikipedia domain. Due to
the growing body of research in attention mechanisms for text classification, we
also intend to evaluate methods using recurrent neural networks with attention
or gate recurrent unit mechanisms in Stage C. Finally, we intend to apply our
pipeline to reasoning problems, such as finding winning arguments in text.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bosc</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cabrio</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villata</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          : Tweeties Squabbling:
          <article-title>Positive and Negative Results in Applying Argument Mining on Social Media</article-title>
          .
          <source>In: Comp. Models of Arg</source>
          . pp.
          <fpage>21</fpage>
          -
          <lpage>32</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Boschi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Young</surname>
            ,
            <given-names>A.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joglekar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cammarota</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sastry</surname>
          </string-name>
          , N.:
          <article-title>Having the Last Word: Understanding How to Sample Discussions Online</article-title>
          . ArXiv preprint arXiV:
          <year>1906</year>
          .
          <volume>04148</volume>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Carstens</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toni</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Using Argumentation to improve classification in Natural Language problems</article-title>
          .
          <source>ACM Trans. on Internet</source>
          Technology pp.
          <fpage>30</fpage>
          -
          <lpage>41</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Cocarascu</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toni</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Detecting Deceptive Reviews Using Argumentation</article-title>
          .
          <source>In: Proc. of the 1st Int. Works. on AI for Privacy and Security</source>
          . pp.
          <volume>9</volume>
          :
          <fpage>1</fpage>
          -
          <issue>9</issue>
          :
          <issue>8</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Galassi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lippi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torroni</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Argumentative Link Prediction using Residual Networks and Multi-Objective Learning</article-title>
          .
          <source>In: Proc. of the 5th Works. on Arg. Mining</source>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          . ACL (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Haddadan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cabrio</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villata</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Yes, we can! Mining Arguments in 50 Years of US Presidential Campaign Debates</article-title>
          .
          <source>In: Proc. of the 57th Conf. of the Assc. for Comp. Ling</source>
          . pp.
          <fpage>4684</fpage>
          -
          <lpage>4690</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Lippi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torroni</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Argument mining: A machine learning perspective</article-title>
          .
          <source>In: Int. Works. on Theory and App. of Formal Arg</source>
          . pp.
          <fpage>163</fpage>
          -
          <lpage>176</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Lippi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torroni</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Margot: A web server for argumentation mining</article-title>
          .
          <source>Exp. Sys. with App</source>
          . pp.
          <fpage>292</fpage>
          -
          <lpage>303</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Mayer</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cabrio</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lippi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torroni</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villata</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          : Argument Mining on Clinical Trials p.
          <volume>12</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Moens</surname>
            ,
            <given-names>M.F.</given-names>
          </string-name>
          :
          <article-title>Argumentation Mining: Where Are We Now, Where Do We Want to Be and How Do We Get There? In: Post-Proc. of the 4th and 5th Works. of the Forum for Inf</article-title>
          .
          <source>Ret. Eval</source>
          . (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Stab</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gurevych</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Parsing argumentation structures in persuasive essays</article-title>
          .
          <source>Journ. of Comp</source>
          . Ling. (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>