<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>CLEF MC2 2018 Lab: Technical 0verview of Cross Language Microblog Search and Argumentative Mining</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jean Valre Cossu</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Julio Gonzalo</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Malek Hajjem</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olivier Hamon</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chiraz Latiri</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eric SanJuan</string-name>
          <email>eric.sanjuang@univ-avignon.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>LIA, Avignon University</institution>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>LIPAH, Tunis El Manar University</institution>
          ,
          <country country="TN">Tunisia</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>My Local In uence</institution>
          ,
          <addr-line>Aubagne</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Syllabs</institution>
          ,
          <addr-line>Paris</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>UNED</institution>
          ,
          <addr-line>Madrid</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>MC2 lab mainly focuses on developing processing methods and resources to mine the social media (SM) spheres surrounding cultural events such as festivals, music, books, movies and museums. Two main tasks and one pilot ran in 2018. The rst task was speci c to movies. Topics were extracted from the French VodKaster website that allows readers to get personal short comments (microcritics) about movies. The challenge was to nd related microblogs in four di erent languages in a large archive. The second task, argumentation mining, aimed to automatically identify reason-conclusion structures that can lead to model social web users positions about a cultural event expressed via Twitter microblogs. The idea was to perform a search process on a massive microblog collection that focuses on claims about a given festival. A pilot task was also launched on a new corpus, extending the 2017 language recognition task to handle also dialects.</p>
      </abstract>
      <kwd-group>
        <kwd>Argumentation Mining</kwd>
        <kwd>Microblog Search</kwd>
        <kwd>Cross Language Information Retrieval</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>however, they are less speci c to movies and harder to nd. The usual case is to
display to the reader a concise summary of microblogs related to the microcritics
he/she is reading, considering bilingual and trilingual users that would read
microblogs in other languages than French. Summaries were exclusively made of
extracts from microblog contents and should include authors' names if considered
informative, and have to be readable. Codes like external URLs references to
multimedia objects had to be removed as well. Summaries were intended to
provide an idea of all relevant information included in the corpus, and diversity
among top ranked microblogs was considered important.</p>
      <p>Task 2 was about Argumentation Mining, a new problem in corpus-based text
analysis that addresses the challenging task of automatically identifying the
justi cations provided by opinion holders for their judgment. Several approaches
of argumentation mining have been proposed so far in areas such as legal
documents, online debates, product reviews, newspaper articles and court cases, as
well as in dialogical domains. With the popularization of social networks,
argumentation mining is considered as an extension of the opinion mining issue from
social network content. The aim is to automatically identify reason-conclusion
structures that can lead to model social web users positions about a service or
an event expressed through social network platforms like Twitter. Indeed, when
we need to form an opinion on a new topic or make a decision, arguments is
what we are looking for, rather than a mere aggregation of sentiment or stance.
To make argumentation structures available, in the case of Twitter, robust
automatic recognition is required. However, the ambiguity of natural language text
produced in social media, the di erent writing styles, the lack of proper
syntax, the large amount of implicit context and the heterogeneity of sources make
argumentation mining on Twitter a very challenging problem.</p>
      <p>
        Another possible way to identify the argumentation structures from a generic
tweet corpus, is to use approaches based on information extraction. The idea is
to perform a search process that focuses on claims about a given topic within
a massive collection. This approach relates to the eld of focused retrieval, that
aims to provide users with direct access to relevant information in retrieved
documents. In this task, relevant information is expressed in the form of
arguments [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        As in previous MC2 editions, registered participants were given access to
the microblog collection[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] provided by ANR project GAFES7 with their
metainformation and expanded URLs on a MySQL server. Due to legal terms, the
access to this database is restricted to registered participants under a privacy
agreement.
      </p>
      <p>These two tasks are fully described in the remainder of the paper.</p>
    </sec>
    <sec id="sec-2">
      <title>7 http://anr-gafes.univ-avignon.fr/</title>
      <p>Task 1: Cross-Language cultural microblog search
Vodkaster8 is a French social network about movies where participants can share
comments about movies under the form of microcritics no longer than a
microblog. The main di erences are the restricted cultural domains and the form.
The objective of the task is for a given movie or microcritic language among
French, English, Spanish, Portuguese and Arabic to provide a summary of the
related microblogs.</p>
      <p>Microblogs included in a summary should provide relevant information about
at least one of the following aspects:
{ The lm mentioned in the microcritic includes a subject, genre, presence
in festivals, reception, audience, critics or opinions, as well as actors and
producers careers.
{ Events such as festivals mentioned in the microcritics if any, including
opinions and narratives.
{ Comments and critics in Twitter similar to those in the microcritic if any.</p>
      <p>Extended summaries can include microblogs about closely related lms and
events.
{ If promotional, automatic microblogs or retweets are not considered as
relevant. However, retweets by movie a cionados or movie makers are considered
relevant.
2.1</p>
      <sec id="sec-2-1">
        <title>Use Case</title>
        <p>The task's use case is to display a concise summary of microblogs to a (native
French) reader that are related to the microcritics he/she is reading, considering
bilingual and trilingual users that could read microblogs in other languages than
French. Summaries are exclusively made of extracts from microblog contents and
may include authors' names if this additional piece of information is considered as
relevant and informative. Automatically produced summaries should be readable
and coded items like external URLs and references to multimedia objects should
be removed. Three di erent summary lengths in words are considered: 50, 150
and up to 250.</p>
        <p>Summaries are intended to provide an idea of all relevant information
included in the corpus. Diversity among top ranked microblogs is important. If
the summary does not provide any microblog directly related to the topic, it is
implicitly suggesting that there is no relevant information in the corpus.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Topics</title>
        <p>Topics represent a selection from VodKaster microcritics in French mentioning
the term festival. Each topic contains:
{ A topic ID,</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>8 http://www.vodkaster.com/</title>
      <p>{ A title made of the movie name,
{ A narrative showing a microcritic about the movie,
{ A list of nuggets (i.e terms and expressions) manually extracted from
microcritic.</p>
      <p>To facilitate data exploration, an Indri index with a web interface has been
provided to query the whole set of microblogs. Online Indri indexes are also
available.
2.3</p>
      <sec id="sec-3-1">
        <title>Results</title>
        <p>
          Runs are evaluated according to their informativeness following INEX Tweets
Contextualisation [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] guidelines. Seven teams registered for this task, but only
one team (A collaboration between Chedi Bechikh Ali from the Institut Suprieur
de Gestion, Universit de Tunis, Tunisia, and Hatem Haddad from the Universit
Libre de Bruxelles) managed to submit 3 complete runs. A Baseline was
generated based on Indri index. Both the baseline and the index were shared with
participants.
        </p>
        <p>
          A multilingual reference of 2887 unique textual contents that could be
considered of interest by Vodkaster's users according to community managers has
been manually extracted from the corpus. All microblogs in this reference contain
personal opinions about movies or related festivals. Among them, only 229 could
be related to topics in the queries. We used a large textual reference
characterizing interestingness and a reduced reference about relevant microblogs, and then
applied INEX Tweet Contextualisation [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] methodology to compare participant
runs with the provided baseline.
        </p>
        <p>All three runs from the only participant outperformed the baseline. Three
approaches were experimented. One (FR-FR) without translation, another with
translation to English (FR-EN) and a third one using a French to English
dictionary. In terms of interestingness, the monolingual approach (FR-FR) did better,
which is coherent with the fact that the majority of Vodkaster users express
themselves in French. However, the translation approach (FR-EN) outperformed
all others on relevancy. This is again coherent with the fact that a majority of
microblogs in the corpus are in English. Very speci c relevant microblogs can be
found but not in the query original language.</p>
        <p>Table 2.3 shows interestingness and informativeness results for baseline and
participant runs using the context-eval.pl9 program.
3</p>
        <p>Task 2: Mining Opinion Argumentations
Topics for this task are a selection of festival names which are popular on Flickr
in English (14) and French (4). Participants have to search for the most
argumentative tweets in the same collection of microblogs used for Task 1. The identi ed</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>9 http://tc.talne.eu</title>
      <p>
        microblogs must be ranked according to their probability of being
argumentative. This use case was proposed to help festival organisers deal with online
opinions about their festival nding out not only what people liked/disliked but,
most importantly, why. For each language (English and French), a monolingual
scenario is expected. Diversity in the rank is not required, because an argument
that is frequently repeated is assumed to be of higher priority.
To express argumentation, users tend to employ a speci c list of argumentative
keywords [
        <xref ref-type="bibr" rid="ref1 ref6 ref8">1, 8, 6</xref>
        ]. For example:
{ More, less : to compare and contrast ideas
{ Pronouns like my, mine, myself,I are used to make their statement sound
more objective.
{ Verbs like believe, think, agree, should, could play an important role to
identify argument components and express what users were expecting.
{ Adverbs like also,often or really emphasize the importance of some premise.
We also observed that some expressions (such as because ! coz) could be
normalized to match a higher number of microblogs. These lexical features about
opinion and argumentation were provided to participants.
3.3
Argumentative mining received considerable interest, with 31 registered
participants. However, only 5 teams submitted a total of 18 runs per language.
Organizer baselines were added to this pool as well. The NDGC has been adopted as
the main o cial measure; however, precision at 100 gives the same rankings.
      </p>
      <p>Two reference sets of argumentative structures were represented as regular
expressions and have been assigned to each query (festival name). The rst
reference of 97 distinct regular expressions has been extracted a priori from
the manual interactive run provided as baseline. The second one contains 77
expressions and has been extracted from participant runs. To avoid duplicated
content, only microblog textual content has been considered. All meta-data such
as URLs, #hashtags and @replies were removed.</p>
      <p>These steps were both applied to the English and French runs. Table 3.3
provides examples of extracted regular expressions.
almost all queries, there was no overlap with argumentative microblogs found in
the baseline runs.</p>
      <p>Teams relying on language models using queries mixing multiword terms
with argumentative connectors found less argumentative microblogs, but a larger
overlap with the reference extracted from the baseline run. This was the case
of the LIA Team, which found the best overlap with the reference of
organizers by using a convolutional neural network. As no labeled data was provided,
participants from this team constructed their own training dataset.
Concerning ECNUica team, they experimented various re-ranking strategies. Finally the
ISAMM team experimented with a combination of Information Retrieval, Topic
Modeling and Opinion Mining techniques.
The initial challenge for 2018 was, given a short movie review on the French
VodKaster10 Social Media site, to nd related microblogs in the MC2 corpus
in four di erent target languages (French, English, Spanish and Portuguese).
Browsing the VodKaster website, French readers got personal short comments
about movies. Since similar posts can be found on Twitter, we decided to display
10 http://www.vodkaster.com/
to the reader a concise summary of microblogs related to the comment he/she is
reading, considering bilingual and trilingual users that would read microblogs in
other languages than French. In this scenario, personal and argumentative
microblogs are expected to be more relevant than news or o cial announcements.
Microblogs sharing similar arguments can be considered as highly relevant even
though they are about di erent movies. In addition, a second task was created
focusing on argument mining in a multilingual collection. It consisted in
nding personal and argumentative microblogs in the corpus. Public posts about
cultural events such as festivals are frequently promotional announcements by
organizers or artists. Personal argumentative microblogs about speci c festivals,
in contrast, provide real insights into public reception but both their variety
and sparsity make them di cult to locate and aggregate. Argumentative mining
attracted most of the participants' e orts in this edition of the MC2 CLEF Lab.
The cold start scenario of nding them without any speci c learning resources
motivated the use of IR approaches based on language models or specialized
linguistic resources.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Aker</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sliwa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ma</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lui</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borad</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ziyaei</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ghobadi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>What works and what does not: Classi er and feature analysis for argument mining</article-title>
          .
          <source>In: Proceedings of the 4th Workshop on Argument Mining</source>
          ,
          <string-name>
            <surname>ArgMining@EMNLP</surname>
          </string-name>
          <year>2017</year>
          . pp.
          <volume>91</volume>
          {
          <fpage>96</fpage>
          .
          <article-title>Association for Computational Linguistics (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bellot</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moriceau</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>SanJuan</surname>
          </string-name>
          , E.,
          <string-name>
            <surname>Tannier</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>INEX tweet contextualization task: Evaluation, results and lesson learned</article-title>
          .
          <source>Inf. Process. Manage</source>
          .
          <volume>52</volume>
          (
          <issue>5</issue>
          ),
          <volume>801</volume>
          {
          <fpage>819</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mulhem</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murtagh</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>SanJuan</surname>
          </string-name>
          , E.:
          <article-title>Overview of the CLEF 2016 cultural micro-blog contextualization workshop</article-title>
          . In: Experimental IR Meets Multilinguality, Multimodality, and Interaction - 7th
          <source>International Conference of the CLEF Association, CLEF 2016, Proceedings. Lecture Notes in Computer Science</source>
          , vol.
          <volume>9822</volume>
          , pp.
          <volume>371</volume>
          {
          <fpage>378</fpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mulhem</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>SanJuan</surname>
          </string-name>
          , E.:
          <article-title>Building evaluation datasets for cultural microblog retrieval</article-title>
          .
          <source>In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation</source>
          ,
          <string-name>
            <surname>LREC</surname>
          </string-name>
          <year>2018</year>
          .
          <article-title>European Language Resources Association</article-title>
          (ELRA) (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mulhem</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>SanJuan</surname>
          </string-name>
          , E.:
          <article-title>CLEF 2017 MC2 search and time line tasks overview</article-title>
          .
          <source>In: Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum</source>
          , Dublin, Ireland,
          <source>September 11-14</source>
          ,
          <year>2017</year>
          . (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Mining and summarizing customer reviews</article-title>
          .
          <source>In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          . pp.
          <volume>168</volume>
          {
          <fpage>177</fpage>
          . KDD '04,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Lippi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torroni</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Argumentation mining: State of the art and emerging trends</article-title>
          .
          <source>ACM Trans. Internet Technol</source>
          .
          <volume>16</volume>
          (
          <issue>2</issue>
          ),
          <volume>10</volume>
          :1{
          <fpage>10</fpage>
          :25 (Mar
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Stab</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gurevych</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Identifying argumentative discourse structures in persuasive essays</article-title>
          .
          <source>In: EMNLP</source>
          . pp.
          <volume>46</volume>
          {
          <issue>56</issue>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Warriner</surname>
            ,
            <given-names>A.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuperman</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brysbaert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Norms of valence, arousal, and dominance for 13,915 english lemmas</article-title>
          .
          <source>Behavior Research Methods</source>
          <volume>45</volume>
          (
          <issue>4</issue>
          ),
          <volume>1191</volume>
          {1207 (Dec
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>