<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ECNU at MC2 2018 Task 2: Mining Opinion Argumentation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jie Zhou</string-name>
          <email>jzhou@ica.stc.sh.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Qi Zhang</string-name>
          <email>qizhang@ica.stc.sh.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Qinmin Hu</string-name>
          <email>huqinmin@gmail.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Liang He</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Shanghai Key Laboratory of Multidimensional Information Processing East China Normal University</institution>
          ,
          <addr-line>500 Dongchuan Road, Shanghai, 200241</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper describes our participation in MC2 2018 task2: mining opinion argumentation. We build a tweet retrieval system, which is mainly composed by four parts: data preprocessing, retrieval, redundancy detection and reranking. Only the highly relevant and argumentive tweets are sent to the user based on the topics. In addition, three state-of-the-art information retrieval models as BB2 model, PL2 model and DFR model are utilized. The retrieval results are combined for nal delivery.</p>
      </abstract>
      <kwd-group>
        <kwd>opinion argumentation</kwd>
        <kwd>retrieval</kwd>
        <kwd>argumentative ranking</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        An argumentation is, broadly speaking, a claim supported by evidence [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In
corpus-based text analysis, argumentation mining is a new problem that
addresses the challenging task of automatically identifying the justi cations provided by
opinion holders for their judgment. Several approaches of argumentation mining
have been proposed so far in areas such as legal documents, on-line debates,
product reviews, newspaper articles and court cases, as well as in dialogical
domains [
        <xref ref-type="bibr" rid="ref10 ref6 ref8">8, 10, 6</xref>
        ].
      </p>
      <p>
        There are situations where the information we need to retrieve from a set of
documents is expressed in the form of arguments. Recent advances in
argumentation mining pave the way for a new type of ranking that addresses such situations
and can positively reduce the set of documents one needs to access in order to
obtain a satisfactory overview of a given topic. We build a proof-of-concept
argumentative ranking prototype. We found that the results it provides signi cantly
di er from and possibly improve those returned by an argumentation-agnostic
search engine. Argumentative ranking does indeed provide results that are quite
di erent from those that are obtained by a \traditional" search engine. In this
task, relevant information is expressed in the form of arguments [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Success of such argumentation ranking will require interdisciplinary
approaches based on the combination of di erent research issues. In fact, to better
understand a short text and be able to detect the argumentative structures within
a microblog, we could restore a \text contextualization" as a way to provide
more information on the corresponding text [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Providing such information
in order to detect argumentative tweets would highlight relevant ones. In other
words, tweets expressed in the form of arguments. Thus, argumentation mining
in this situation will tend to act in the same way of an Information Retrieval
(IR) system where potential argumentative tweets had to come rst. A similar
approach that addresses such a purpose is presented in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], where the output
of the priority task will be a ranking of tweets according to their probability of
being a potential threat to the reputation of some entity.
      </p>
      <p>
        In this task, given a set of festivals name from most popular festivals on
FlickR English and French language, participants have to search for the most
argumentative tweets in a collection covering 18 months of news about festivals
in di erent languages [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The identi ed tweets have to be a summary of ranked
tweets according to their probability of being argumentative tweets. Such sets
of tweets could be treated easier by priority, by a festival organiser. For each
language ( English and French ), a monolingual scenario is expected : Given
a festival name from a topic le, participants have to to search for the set of
most argumentative tweets in the same query language within the microblog
collection.
      </p>
      <p>The reminder of the paper is organized as follows. Section 2 describes our
approach. In Section 3, experimental results are presented. Finally, the paper is
concluded in Section 4.</p>
      <p>Related Tweets</p>
      <p>Redundancy</p>
      <p>Detection
Topics
Results
Retrieval</p>
      <p>Data
Preprocessing</p>
      <p>Corpus</p>
      <p>Reranking
Argumentative</p>
      <p>Vocabulary
In this section, we demonstrate the architecture of our system, which is shown
in Figure 1. It shows that our system mainly consists of four parts, namely data
preprocessing, retrieval, redundancy detection and reranking. The details of each
part are demonstrated in the following sections.</p>
      <p>Data Preprocessing Before we start to run the system, we preprocess the
dataset. We rst solve the tweets as follow steps:
{ Converting the letter in tweet to lowercase letters.
{ Turning several spaces into one space.
{ Replacing http:// or https:// in tweet with `&lt;URL&gt;`.
{ Replacing @USERNAME in tweet with `&lt;USERNAME&gt;'.
{ Replacing number in tweet with `&lt;NUMBER&gt;'.
{ Replacing repeated character sequences of length 3 or greater with sequences
of length 3.</p>
      <p>
        { Removing punctuation in tweet
Then, we use NLTK for tokenization, stemming and splitting the sentences.
Retrieval With the daily tweet stream, we leverage the Terrier search engine [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]
for indexing and retrieval. Three state-of-the-art information retrieval(IR)
models, namely the BB2 model, the PL2 model and the DFR BM25 model [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], are
utilized for this task. Speci cally, with the three IR models, we can obtain three
scores for a tuple as (Topic, Tweet). Each IR model returns 3000 most related
tweets.
      </p>
      <p>By assuming that di erent retrieval models may compensate each other by
combination, we do a linear combination of the scores to obtain better
performance.</p>
      <p>Redundancy Detection Since the pushed tweets are expected to cover a
variety of arguments given by a user about a culture event, we delete identical tweets
through the similarity between two tweets. Speci cally, when a candidate
tweets speci c to a topic, we devise a redundancy detection strategy to determine
whether it is redundant or not. To calculate the similarity score between two
tweets, we rst obtain the corresponding words set as S(T1) and S(T2). Then,
the similarity score Score(T1; T2) is formulated as:</p>
      <p>Score(T1; T2) = jS(T1) \ S(T2)j
jS(T1) [ S(T2)j
(1)
where S(T1) \ S(T2) is the intersection of S(T1) and S(T2), S(T1) [ S(T2)
represents the union of S(T1) and S(T2), j j denotes the size of the set. If Score(T1; T2)
is large than the threshold , we determine there are redundant.
Reranking We rerank the related tweets by considering whether the tweet
contains the topic, the length of the tweets and the number of argumentative words
in tweets. In order to obtain lexical feature, we download some English
argumentative vocabularies (e.g. admirable,cool, admire, adorable adore, advantage
and so on) and combine them together. For French, we translate the English
vocabulary into French through Google translation API. Finally, we rerank the
tweet T the for topic T opic according to the following function:
f (T; T opic) =
+</p>
      <p>Tlength + (1
) Narg
(2)
8 0; Topic is not in T
= &lt; 1; Topic is in T and is continuous
: ; Topic is in T and is not continuous
(3)
where 2 [0; 1] represents whether topic T opic contained in tweet T and whether
the topic is continuous in tweet, Tlength is the length of the tweet T after
normalizing, Narg denotes the number of words in argumentative vocabulary after
normalizing, 2 [0; 1] represents the weight between Tlength and Narg.
The complete stream of 70,000,000 microblogs is available. English and French
are a respectively 12 and 4 festival name. They represent a set of some popular
festivals on FlickR for which we have pictures. Topics were carefully selected by
the organizer to ensure that selected topics have enough related argumentative
tweets in our corpus. Such manual selection was conduct to to ensure a possible
evaluation.
3.2</p>
      <p>
        Evaluation
The o cial evaluation measures planned are: NDCG and Pyramid.
{ NDCG This ranking measures will give a score for each retrieved tweet with
a discount function over the rank. As we are mostly interested in top ranked
arguments, this ranking measures meet our expectation. This measure was
also used in TREC Microblog Track [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. A tweet is :
- Highly relevant when it is a personal tweet with an argument that
directly referred to the festival noun (topic) and may contain more then one
justi cation .
- Relevant when it comportes at least two of graduation criteria cited above
- Not relevant if no graduation criteria was found
- Exemple of tweet gradution
{ Pyramid [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] This evaluation protocol was chosen to evaluate how much the
identi ed set of argumentative tweets about a festival name is diversi ed.
In fact, participant results are expected to cover a variety of arguments
given by a user about a culture event. Such an evaluation protocol will allow
us to determine if the identi ed summary of ranked tweets expresses the
same content in di erent words or involve di erent arguments about a given
festival name.
The experiment results are shown in Table 1 and Table 2. Our observation shows
that the proposed model works better than baseline in most cases.
4
      </p>
    </sec>
    <sec id="sec-2">
      <title>Conclusions</title>
      <p>In this paper, we present our work in two scenarios of the MC2 2018 task2 mining
opinion argumentation . We build a tweet retrieval system. It mainly performs
four steps to determine whether to push a tweet or not. We apply three
stateof-the-art IR models for search. Various retrieval results are combined for nal
delivery. Noting that the combination strategy does not work very well, we will
extract more useful features and focus on the learning to rank approaches in the
future.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Amati</surname>
          </string-name>
          , G.,
          <string-name>
            <surname>Van Rijsbergen</surname>
            ,
            <given-names>C.J.</given-names>
          </string-name>
          :
          <article-title>Probabilistic models of information retrieval based on measuring the divergence from randomness</article-title>
          .
          <source>ACM Transactions on Information Systems (TOIS) 20(4)</source>
          ,
          <volume>357</volume>
          {
          <fpage>389</fpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Amigo</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Albornoz</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chugur</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corujo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart n</surname>
          </string-name>
          , T.,
          <string-name>
            <surname>Meij</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Rijke</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spina</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          : Overview of replab 2013:
          <article-title>Evaluating online reputation monitoring systems</article-title>
          . In:
          <article-title>International Conference of the Cross-Language Evaluation Forum for European Languages</article-title>
          . pp.
          <volume>333</volume>
          {
          <fpage>352</fpage>
          . Springer (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bellot</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moriceau</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>SanJuan</surname>
          </string-name>
          , E.,
          <string-name>
            <surname>Tannier</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Inex tweet contextualization task: Evaluation, results and lesson learned</article-title>
          .
          <source>Information Processing &amp; Management</source>
          <volume>52</volume>
          (
          <issue>5</issue>
          ),
          <volume>801</volume>
          {
          <fpage>819</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Latiri</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cossu</surname>
            ,
            <given-names>J.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Latiri</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>SanJuan</surname>
          </string-name>
          , E.:
          <article-title>Clef 2018</article-title>
          .
          <article-title>International Conference of the Cross-Language Evaluation Forum for European Languages Proceedings LNCS (</article-title>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Efron</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sherman</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Voorhees</surname>
          </string-name>
          , E.:
          <article-title>Overview of the trec-2015 microblog track</article-title>
          .
          <source>Tech. rep. (</source>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Lippi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sarti</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torroni</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Argumentative ranking
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Nenkova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Passonneau</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McKeown</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>The pyramid method: Incorporating human content selection variation in summarization evaluation</article-title>
          .
          <source>ACM Transactions on Speech and Language Processing (TSLP) 4</source>
          (
          <issue>2</issue>
          ),
          <volume>4</volume>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Oraby</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reed</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Compton</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rilo</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walker</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Whittaker</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>And that's a fact: Distinguishing factual and emotional argumentation in online dialogue</article-title>
          .
          <source>NAACL HLT</source>
          <year>2015</year>
          p.
          <volume>116</volume>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Ounis</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Amati</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Plachouras</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Macdonald</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Johnson</surname>
          </string-name>
          , D.:
          <article-title>Terrier information retrieval platform</article-title>
          .
          <source>In: European Conference on Information Retrieval</source>
          . pp.
          <volume>517</volume>
          {
          <fpage>519</fpage>
          . Springer (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Schulz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eger</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Daxenberger</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kahse</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gurevych</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <article-title>: Multi-task learning for argumentation mining in low-resource settings (</article-title>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>