<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>UPV-INAOE-Autoritas - Check That: Preliminary Approach for Checking Worthiness of Claims</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Bilal Ghanem</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel Montes-y-Gomez</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francisco Rangel</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paolo Rosso</string-name>
          <email>prosso@dsic</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Autoritas Consulting</institution>
          ,
          <addr-line>Valencia</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Instituto Nacional de Astrof sica,Optica y Electronica (INAOE)</institution>
          ,
          <addr-line>Puebla</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>PRHLT Research Center, Universitat Politecnica de Valencia</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Journalists usually work for a long time to investigate presidential debates. Their main role is to extract the sentences in the debates that include information about facts or previous events. These sentences are called claims. The investigation process of these claims is important where it can reveal how credible is the speaker or the other candidates. Therefore, proposing systems for extracting these claims can certainly improve the press work. In this paper, we will present our approach for CLEF-2018 Check That lab for Task 1. We propose an approach that uses a text distortion technique to detect claims that are worthy for checking. Our approach has achieved an acceptable results taking into account the complexity of this task.</p>
      </abstract>
      <kwd-group>
        <kwd>Factual Claims</kwd>
        <kwd>English</kwd>
        <kwd>Arabic</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In the recent years, the political situation in many countries started to be more
complex, which made the politicians started to exchange accusations in the
public political debates, especially in the presidential debates. The prevailing
situation in any debate is that each presidential candidate has a short period to
respond to the claims of the other candidates. Each of them has the right to
accuse the other with di erent claims, where the main objective is to convince
the audience about his ability for that political position. During these long
debates, the journalists' role is to investigate and validate the mutual accusations
between those candidates to reveal the truth behind each claim. This task is
complex to be done manually and in a short time, since the debates are very
long, and each candidate declares many claims. Many of these claims are just
opinions while the others contain previous facts that happened before. Starting
from this issue, recent research topics are moving towards detecting check worthy
claims in the presidential debates, where automatic approaches will save a lot of
time to journalists for this process. In this paper, we present our approach for
detecting claims that are worthy for checking, we tried to predict factual claims
by highlighting speci c cue words in a classi cation process. In [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] we presented
our approach for task 2 (checking claims factuality). In the following section, we
present the previous works in the literature, then in Section 3, we describe the
task. In Section 4, we describe our approach. Section 5 presents the experimental
results, and nally in Section 6 we draw some conclusions and discuss further
work.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        The main orientation in the literature was focused more on another related issue
on claims, about detecting their factuality. Some researchers have proposed
multiple approaches using di erent types of features to handle this issue. In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] the
authors have used a tree kernel approach to detect claims in the argument
mining domain. Their approach used consistency trees of sentences to detect claims
by capturing the trees similarities. They employed a support vector machine
(SVM). In a di erent direction, a set of textual features to detect worthy claims
in the presidential debates was proposed in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. They used sentiment polarity,
number of words in a sentence, Bag-of-Words (BOW) using the Tf-Idf weighting
scheme, Part-of-Speech tags, and named entities types. Since they generated a
large feature set, they used a feature selection technique to extract the top N
important features. Finally, di erent classi ers were tested, where the Random
Forest classi er shows better results. Another related approach was proposed
in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], similarly, also for the presidential debates. The authors proposed di erent
types of features, such as sentence-level features (sentiment, named entities,
linguistic features etc.), contextual features (the position of a claim in the debate),
and other mixed features such as text embedding, the discussed topic, and
contradictions in the debate. For their approach, a SVM and a deep forward neural
network (FNN) were used. In the results, the FNN showed better results
comparing to the SVM. They created a corpus from the USA presidential debates of
2016, with a total of 5,415 sentences and they made it available to the research
community.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Task Description</title>
      <p>
        A set of presidential debates from the US presidential election are presented for
the task, where each claim in the debate text has been tagged manually as worth
to be check (1) or not (0). The text of the debates is used for the task as it is, to
give the opportunity to the participants to exploit also contextual features in the
debates. These debates are provided in two languages, English and Arabic, where
the Arabic text is obtained translating from the English debates. The dataset
that was provided is totally imbalanced, where the total number of claims is
4064: 90 claims are worth to be check and 3970 are not. The task goal is to
detect the claims that are worthy for checking and to rank them, from the most
worthy one for checking to the lowest one. The Average Precision was used as a
performance measure. More details of the task are mentioned in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Proposed Approach</title>
      <p>
        Previously, the authors in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] have proposed a text distortion technique to
enhance thematic text clustering by maintaining the words that have a low
frequency in a document. Later on, a similar research in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] has used the same
text distortion technique for authorship attribution task, where the author has
maintained the words that have the highest frequency in the documents, in an
attempt to detect the author from his writing style.
      </p>
      <p>
        We believe that this type of tasks is more thematic than stylistic, where the
writing style is not important as the thematic words. In our approach, we used
the same text distortion technique to detect worthy claims, where we concealed
words that have high frequency in documents and maintaining (highlighting)
other cue words that are used more in factual claims. Therefore, we followed [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
in their approach by concealing the most frequent words.
4.1
      </p>
      <p>
        Text Distortion with Linguistic Features
In our approach, we have maintained the thematic words (that have the lowest
frequency) using a threshold (C). The higher value of C is, the more thematic
words are maintained. Also, we maintained a set of linguistic cue words (LC)
that were used previously in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] to infer the credibility of news (see Table 1).
Additionally, we maintained also the named entities (NE) from being distorted,
such as: Iraq, Trump, America. Through the manually checking of the claims, we
found the checking worthy claims tended to list di erent types of named entities.
In Table 2, we show an example of the distortion process.
      </p>
      <p>After applying the distortion process, the new version of the text was used by
the char n-gram model using Tf-Idf weighting scheme. The new distorted text,
become less biased by the high frequency words, such as stopwords. Finally,
after preparing the distorted text, there is still one issue which is the value of
C variable. The value of C is crucial, being a threshold between the amount
of thematic and the stylistic words. In Section 5, we show how we select the
most appropriate value of C. For the Arabic language, we employed the same
approach, where the only issue we had was the Arabic version of the linguistic
lexicons. The manual translation of them is a time-consuming process, where
they are quite large. Therefore, we used Google Translation API to translate
these lexicons.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Experiments and Results</title>
      <p>In this section, we present the tuning process of our approach. We carried
out many experiments to test di erent machine learning classi ers. We found
that the K-Nearest-Neighbor (KNN) has achieved the highest Average Precision
value. We had have two parameters to select the best model: the value of
Kneighbors (K) of the KNN classi er and the C value of the distortion ratio. The
selection process of these two values is hard to be set manually, therefore, we
used the Grid Search technique to select the most appropriate values of these
two parameters. The best value of K is 1, where for C value is 1700. The low
value of K is due to the highly imbalanced situation of the dataset; larger values
tend to bias the classi er to the majority class. A similar process was applied
to select the best parameters but using word n-gram rather than character. For
the evaluation, the Average Precision @N was used. The results of both runs
are showed in Table 3. From these runs, we can see that char n-gram model
outperformed clearly the one using word n-gram.
Ranking Claims After the claims have been detected, it is important to rank
them based on their worthiness for checking. For the ranking process, we used
the KNN classi er. We ranked the claims based on the KNN con dence in the
classi cation process. At the beginning, we extracted the distances to the nearest
neighbor (since we used K-neighbor equal to 1) for all the predictions in the test
le. Then we applied a normalization for the distances to range 0-1. For each
predicted instance, we checked the class type of the nearest neighbor: if it was
positive, we subtracted the distance value from 1 and we used it for the ranking.
We subtracted the distance from 1 to take the inverse of it: the small distance
value (near to zero) mean a high classi cation con dence. The highest value
(near to 1) is the one that obtained a higher rank (more worthy for checking).
We applied the same process when the nearest neighbor is from the negative
class, the rank value by -1, in order to discriminate the positive and the negative
instances.</p>
      <p>As we mentioned before, the used measure for this task is the Average
Precision. In the o cial testing phase, multiple testing les were presented. For the
nal results the Mean Average Precision (MAP) was used. The o cial results of
the task 1 are shown in Table 4. In the English part of the task, our approach
has achieved the third position among seven teams, where the results are close
to each other. In the Arabic part, only two teams have submitted their results.
Similar to English, the results are close and there is not big di erence between
them. We believe that the lower results of our approach in the Arabic part is
because of the automatic translation of the lexicons. A manual translation would
have been more reliable.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion and Future Work</title>
      <p>Detecting the claims that are worthy to be checked is important being a
preliminary step for detecting their factuality. With these two steps, we can improve
the journalists work of manual investigating for instance presidential debates.
As a result of that, journalists can achieve their work quicker and in an easier
manner. As we saw from the o cial results, performances are low, showing the
di culty of the task. Also, we can conclude that text distortion method worked
better than using the full text in the classi cation process, where it has improved
the results comparing to the baseline with the normal BOW method. As future
work, we will try to test more features that could discriminate the claims that
are worthy to be checked from those that are not.
7</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>The authors acknowledge the SomEMBED TIN2015-71147-C2-1-P MINECO
research project. The work on the data in Arabic as well as this publication were
made possible by NPRP grant #9-175-1-033 from the Qatar National Research
Fund (a member of Qatar Foundation). The statements made herein are solely
the responsibility of the last two authors.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ghanem</surname>
            , Bilal, Manuel Montes-y-Gomez, Francisco Rangel and
            <given-names>Paolo</given-names>
          </string-name>
          <string-name>
            <surname>Rosso. UPVINAOE-Autoritas - Check That</surname>
          </string-name>
          :
          <article-title>An Approach based on External Sources to Detect Claims Credibility</article-title>
          . In Working Notes of CLEF 2018 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , CLEF '18,
          <string-name>
            <surname>Avignon</surname>
          </string-name>
          , France, September.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Lippi</surname>
            , Marco, and
            <given-names>Paolo</given-names>
          </string-name>
          <string-name>
            <surname>Torroni</surname>
          </string-name>
          .
          <article-title>Context-Independent Claim Detection for Argument Mining</article-title>
          .
          <source>In IJCAI</source>
          , vol.
          <volume>15</volume>
          , pp.
          <fpage>185</fpage>
          -
          <lpage>191</lpage>
          .
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Hassan</surname>
            , Naeemul,
            <given-names>Chengkai</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
            , and
            <given-names>Mark</given-names>
          </string-name>
          <string-name>
            <surname>Tremayne</surname>
          </string-name>
          .
          <article-title>Detecting Check-Worthy Factual Claims in Presidential Debates</article-title>
          .
          <source>In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management</source>
          , pp.
          <fpage>1835</fpage>
          -
          <lpage>1838</lpage>
          . ACM,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gencheva</surname>
          </string-name>
          , Pepa, Preslav Nakov,
          <article-title>Llu s Marquez, Alberto Barron-Ceden~o, and Ivan Koychev. A Context-Aware Approach for Detecting Worth-Checking Claims in Political Debates</article-title>
          .
          <source>In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017</source>
          , pp.
          <fpage>267</fpage>
          -
          <lpage>276</lpage>
          .
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Granados</surname>
          </string-name>
          , Ana, Manuel Cebrian, David Camacho, and Francisco de Borja Rodriguez.
          <article-title>Reducing the Loss of Information Through Annealing Text Distortion</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>23</volume>
          , no.
          <issue>7</issue>
          (
          <year>2011</year>
          ):
          <fpage>1090</fpage>
          -
          <lpage>1102</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Stamatatos</surname>
          </string-name>
          , Efstathios.
          <article-title>Authorship Attribution using Text Distortion</article-title>
          .
          <source>In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume</source>
          <volume>1</volume>
          ,
          <string-name>
            <surname>Long</surname>
            <given-names>Papers</given-names>
          </string-name>
          , vol.
          <volume>1</volume>
          , pp.
          <fpage>1138</fpage>
          -
          <lpage>1149</lpage>
          .
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Mukherjee</surname>
            , Subhabrata, and
            <given-names>Gerhard</given-names>
          </string-name>
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          .
          <article-title>Leveraging Joint Interactions for Credibility Analysis in News Communities</article-title>
          .
          <source>In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management</source>
          , pp.
          <fpage>353</fpage>
          -
          <lpage>362</lpage>
          . ACM,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Atanasova</surname>
          </string-name>
          ,
          <article-title>Pepa and Marquez, Llu s and Barron-Ceden~o, Alberto and Elsayed, Tamer and Suwaileh, Reem and Zaghouani, Wajdi and Kyuchukov, Spas</article-title>
          and Da San Martino, Giovanni and Nakov, Preslav, Overview of the CLEF-2018
          <source>CheckThat! Lab on Automatic Identi cation and Veri cation of Political Claims. Task</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Check-Worthiness</surname>
          </string-name>
          . In Working Notes of CLEF 2018 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , CLEF '18,
          <string-name>
            <surname>Avignon</surname>
          </string-name>
          , France, September.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>