<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Proceedings of the ASSIN 2 Shared Task</article-title>
      </title-group>
      <kwd-group>
        <kwd>and</kwd>
        <kwd>Textual</kwd>
        <kwd>Entailment</kwd>
        <kwd>in</kwd>
        <kwd>Portuguese</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>i
ASSIN 2 is the second edition of the Evaluation of Semantic Similarity and
Textual Inference (Avalia¸c˜ao de Similaridade Semˆantica e Inferˆencia textual )
in Portuguese, that took place as a parallel event with the STIL conference in
2019. Like its previous edition, it proposed a shared task on Semantic Similarity
and Text Entailment; with the former ranking pairs of sentences from 1 to 5,
and the latter labeling them as either entailment or non-entailment (but not
paraphrases, in contrast with the first edition).</p>
      <p>There are some notable di↵erences between the first and second edition of
the shared task. Concerning the data, a new corpus of 10 thousand sentences
was presented, but instead of text extracted from news articles, it contains much
simpler sentence pairs, modeled after the SICK corpus. With sentences written
on purpose for this task, some linguistic phenomena could be directly controlled.
As a result, a word overlap baseline is not so powerful on ASSIN 2 as it was on
ASSIN 1.</p>
      <p>On the side of systems, we saw a reflection of the recent development of
neural networks. While hand-engineered features and lexical resources are still
useful, pretrained language models proved themselves as very helpful for both
tasks evaluated.</p>
      <p>This volume presents the main findings of the shared task organizers, and
the descriptions of the strategies developed by the participants. With a total of
nine of them, we are happy with the results of ASSIN 2. We leave a new dataset
as a benchmark to evaluate the progress of this area in Portuguese, as well as
the reflections upon its research directions.</p>
    </sec>
    <sec id="sec-2">
      <title>February, 2020 ii Erick Fonseca Livy Real</title>
      <p>Hugo Gon¸calo Oliveira
Erick Fonseca
Reviewers</p>
    </sec>
    <sec id="sec-3">
      <title>B2W Digital/Grupo de Lingu´ıstica Computacional – USP, Brazil CISUC / DEI, Universidade de Coimbra, Portugal Instituto de Telecomunica¸c˜oes, Lisboa, Portugal</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>Organizing the ASSIN 2 Shared</article-title>
          <string-name>
            <given-names>Task . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Livy</given-names>
            <surname>Real</surname>
          </string-name>
          , Erick Fonseca, Hugo Gon¸calo Oliveira
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <source>NILC at ASSIN</source>
          <volume>2</volume>
          :
          <string-name>
            <surname>Exploring</surname>
            <given-names>Multilingual Approaches . . . . . . . . . . . . . . . . . Marco A.</given-names>
          </string-name>
          <string-name>
            <surname>Sobrevilla</surname>
            <given-names>Cabezudo</given-names>
          </string-name>
          , Marcio Ina´cio, Ana Carolina Rodrigues, Edresson Casanova, Rog´erio Figueredo de Sousa
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>