<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Collective Elaboration of a Coreference Annotated Corpus for Portuguese Texts</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Evandro Fonseca</string-name>
          <email>evandro.fonseca@acad.pucrs.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vinicius Sesti</string-name>
          <email>vinicius.sesti@acad.pucrs.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sandra Collovini</string-name>
          <email>sandra.abreu@acad.pucrs.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Renata Vieira</string-name>
          <email>renata.vieira@pucrs.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ana Luísa Leal</string-name>
          <email>analeal@umac.mo</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paulo Quaresma</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Pontifical Catholic University of Rio Grande do Sul</institution>
          ,
          <addr-line>PUCRS</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Évora</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Macau</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>68</fpage>
      <lpage>82</lpage>
      <abstract>
        <p>This paper describes the collaborative creation of a corpus with coreference annotation for Portuguese. The annotation was performed using the coreference annotation CORP, and the editing tool CorrefVisual. The texts were automatically annotated and manually revised by Portuguese speakers. As a result a new corpus for coreference studies was produced for Portuguese.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Coreference resolution basically consists of finding different references to a same
entity in a text, as in the example: “A França resiste como único país da União</p>
      <p>E. Fonseca et al.</p>
      <p>Européia a não permitir o patenteamento de genes”. The noun phrases [único
país da União Européia a não permitir o patenteamento de genes] and [A França]
are considered coreferent. In other words, they belong to the same coreference
chain.</p>
      <p>
        Coreference resolution may provide important input for other NLP tasks. One
example is the area of entity relation extraction, since coreference links may be
useful for extracting implicit relations [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Consider the following sentence:
“[Barack Obama], said today that the climate changes are a great threat for
the planet”. [The United States president] ...”. When identifying and creating a
coreference relation between [Barack Obama] and [the United States president],
it is possible to infer a relation between the entities [Barack Obama] and [United
States] (in which Barack Obama is the president of the United States). Also,
when we link Barack Obama with the president, it is possible to classify him as
a person, as well as to say that he has a relation with the United States.
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Coreference resolution is very important in understanding texts; thus, it is a
crucial step in many high-level natural processing tasks, ranging from
information extraction to text summarization or machine translation [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]. In general,
the evaluation of systems devoted to this task depends on reference corpora
(golden standards). There are, for example, English coreference annotated
corpora that have been used in coreference resolution tracks such as SemEval,
ACE and CoNLL [
        <xref ref-type="bibr" rid="ref22 ref23 ref24 ref29 ref3 ref8">3,29,24,8,23,22</xref>
        ]. SemEval (Evaluation Exercises on
Semantic Evaluation) includes, among others tasks, the Coreference Resolution task
[
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], considering multiple languages (Catalan, Dutch, English, German, Italian
and Spanish). This task involved automatically detecting full coreference chains,
composed of named entities, pronouns, and full noun phrases. The datasets used
in SemEval task were extracted from five corpora: 1) the AnCora corpora [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]
for Catalan and Spanish; 2) the KNACK-2002 corpus [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] for Dutch; 3) the
OntoNotes Release 2.0 corpus for English [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]; 4) the TurBa-D/Z corpus [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] for
German; and 5) the LiveMemories corpus [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] for Italian.
      </p>
      <p>
        CoNLL-2011 Coreference Task included a closed (limited to using the
distributed resources) and an open track (unrestricted use of external resources).
The task was to automatically identify mentions of entities and events in texts
and to link the coreferring mentions together to form coreference chains. For
this, the participants could use information from other structural layers
including parsing, semantic roles, word sense and named entities. It was based on
OntoNotes 4.0 [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ].
      </p>
      <p>
        The OntoNotes is a large-scale corpus of general anaphoric coreference not
restricted to noun phrases or to a specified set of entity types [
        <xref ref-type="bibr" rid="ref22 ref23">23,22</xref>
        ]. In
addition to coreference, the corpus provides other layers of annotation: syntactic
trees; propositions structures of verbs; partial verb and noun word senses; and
18 named entity types. OntoNotes is a multi-lingual resource with annotations
available in three languages: English, Chinese and Arabic.
      </p>
      <p>
        OntoNotes corpus is of crucial important for data modeling of linguistically
easier cases of coreference. Complex cases are being investigated more recently,
one of the main reasons for this is the lack of appropriated datasets [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. The
ARRAU dataset is a multi-domain corpus with large-scale annotations of
various linguistic phenomena related to anaphora. A second release of the ARRAU
is presented in [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], and the authors not only focused on increasing the number
of documents, but also invested a considerable effort into improving the data
quality. The data is manually labeled for tasks such as coreference resolution,
bridging, mention detection, referentiality an genericity. The documents were
annotated for anaphoric information, using the MMAX (Multi-Modal
Annotation in XML) tool, which is specific for corpus annotation, with main focus in
the annotation of coreference [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. The annotation followed the ARRAU
guidelines, which focused on a more detailed representation of linguistic phenomena
related to anaphoric and coreference. The authors present the main differences
between ARRAU and two coreference corpus: ACE and OntoNotes. The
difference between these corpora stands out, ARRAU considers different types of
noun phrases, including markables that do not participate in coreference chains
(singletons and non-referentials). Also, this corpus combines coreference with
bridging, and for the third release of ARRAU, the authors plan to focus on
bridging.
      </p>
      <p>
        One of the difficulties for the creation of annotated corpora is the availability
of specialists for this task. An alternative is crowd-sourcing approach, which
uses a non-expert crowd to annotate text, driven by cost, speed and scalability
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] Phrase Detectives, an interactive online game for creating annotated
anaphoric coreference corpora using GWAP (game-with-a-purpose) approach
is presented. The Phrase Detectives Corpus 1.0 contains 45 documents from
Wikipedia articles and narrative text, with 6,452 markables.
      </p>
      <p>
        HAREM is a joint evaluation effort for Portuguese (Avaliação de Sistemas
de Reconhecimento de Entidades Mencionadas) [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. This contest had the
purpose of studying expressions regarding proper names (mentioned entities). The
Second HAREM took took place in 2008 and it included the task of identifying
the semantic relations between mentioned entities, called ReRelEM track
(Reconhecimento de Relações entre Entidades Mencionadas). This was concerned
with the automatic detection of relations between named entities in a document
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. ReRelEM, although maintaining the restriction to named entities, is also
a source of coreference annotation, since the authors proposed the detection of
relations between named entities, including coreference, represented by the
relation of Identity (entities with the same referent, defined to all the categories
and whose instances must had the same category).
      </p>
      <p>
        Another related Portuguese corpus is the Summ-it corpus [
        <xref ref-type="bibr" rid="ref1 ref4">4,1</xref>
        ]. It is a
corpus gathering annotations of various linguistic levels, including coreference, but
also morphological, syntactic and rhetorical relations. Summ-it has a total of
560 coreference chains with an average of 3 noun phrases for chain, where the
largest chain has 16 members (noun phrases). Recently, a new version of
Summit corpus was enriched with two layers: named entities and the relations that
4
occur between these entities [
        <xref ref-type="bibr" rid="ref5 ref6">6,5</xref>
        ], this version is called Summ-it++1 and is
described in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The coreference information is the same from the original Summ-it
corpus. However, other layers of linguistic (morpho-syntactic) information were
generated by other tools and converted to a new format based on SemEval [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ].
Garcias’s corpus also contains coreference annotation, but only for Person
entities [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. It is also given in the SemEval format. It is a multilingual corpus2
including Portuguese, Galician and Spanish. One of the motivations for this
collaborative task of creating an annotated corpus is, therefore, to increase the
number of annotated coreference data for Portuguese. Instead of creating such
annotated corpus from scratch, we adopted a different methodology, we proposed
the edition of coreference chains produced by coreference resolution tool.
4
      </p>
    </sec>
    <sec id="sec-3">
      <title>Corpus submission and participating teams</title>
      <p>
        The general objective of the proposed task was a collective elaboration of a
Portuguese annotated corpus for nominal coreference. For that, each participant
team submitted a corpus of their own interest. Seven teams submitted their
corpus. The resulting corpus is composed by journalistic texts [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]; by miscellaneous
texts (books, magazines, journalistic, among others) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]; and Wikipedia dump
articles, selected randomly. The corpus is further described in Section 6.1.
      </p>
      <p>The first phase of the task consisted of corpus submission by participant
teams. Each participant team submitted around 30 texts written in Portuguese,
considering domains of their own interest. The proposed average size for these
texts was 1200 tokens each. Plus, each team justified the reason(s) of corpus
choice, including the related studies. Seven groups submitted texts for
annotation. Three main text sets were submitted, as described below and detailed in
Table 1.</p>
      <p>
        – CSTNews[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] is a corpus developed for multi-document summarization and
used for several studies in Portuguese, mainly for researches on discourse
phenomena. This was divided in five parts, one for each group from USP.
– A sample of the larger corpus PAROLE [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], compiled in the scope of the
European project LE-PAROLE. For each language involved in the project, a 20
million word corpus was built with harmonized design, composition and
codification, including a 250.000 word subcorpus, tagged with POS information
and revised manually.
– Wikipedia articles written in Portuguese language. This corpus is an extract
composed by 30 entire articles, each with more than 1100 and less than 1400
words, randomly selected from the Wikipedia dump from 26/03/2017.
      </p>
      <p>
        There was a training phase, when participants got used to the editing tool[
        <xref ref-type="bibr" rid="ref28">28</xref>
        ].
We provided one text annotated by CORP[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Each team’s members revised the
coreference chains and could ask questions about the task.
      </p>
      <sec id="sec-3-1">
        <title>1 http://www.inf.pucrs.br/linatural/summit_plus_plus.html 2 http://gramatica.usc.es/ marcos/coling14.tar.bz</title>
        <p>Team Corpus Texts
USP 1 1/5 28
USP 2 2/5 28
USP 3 CST-News 3/5 28
USP 4 4/5 28
USP 5 5/5 25
UFBA Wikipedia 30
EVORA Le-Parole 12</p>
        <p>Table 1. Submitted texts</p>
        <p>Finally, there was the annotation phase. First, all texts were annotated with
CORP, then, each team received its own corpus plus a few extra texts included for
measuring team level agreement (according to Table 2). The corpus annotation
phase is described in detail in the next section.
5
5.1</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Corpus Annotation</title>
      <sec id="sec-4-1">
        <title>Text Distribution among Annotators</title>
        <p>The corpora were received and distributed among team members in a way to
allow agreement measures. For that we used first a set of three texts chosen by
the organizers for calculating inter team level agreement; secondly a subgroup
of four texts from each submitted corpus should be annotated by all members in
its respective team. In Table 2, we exemplify how we organized the distribution
of texts. This example considers a scenario of a team with three annotators and
a corpus of sixteen texts. Each member annotated one of our chosen texts for
inter team agreement (TK1, TK2 and TK3), whereas four texts of the submitted
corpus were replicated to all annotators of that team (TG1, TG2, TG3 and TG4).</p>
        <p>
          The texts were then annotated with coreference and distributed among each
team. The annotation consisted in editing the generated chains. Next, we
de6
scribe CORP, the coreference resolution tool [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], and CorrefVisual [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ], the
editing tool used in this task.
The annotation task is based on previous annotation generated by a coreference
resolution tool and the edition of the generated chains with the help of an editing
tool, as described below.
        </p>
        <p>
          CORP is a coreference resolution tool for Portuguese [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] which was built on the
basis of deterministic rules, in the line with previous tools proposed for English
[
          <xref ref-type="bibr" rid="ref18 ref19">19,18</xref>
          ]. An important difference from these previous works for English is,
however, the inclusion of semantic knowledge, which is provided by Onto.PT [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
The tool produces 2 outputs: the first in XML, containing the original text, the
list of sentences, tokens, part-of-speech, coreference chains (Figure 1) and single
mentions. This format allows the interoperability with other applications. The
second output format is given in HTML for the visualization of generated
coreference chains, which can be seen through the tool’s web interface3. A desktop
version is also available for download4.
        </p>
        <p>CorrefVisual is a tool developed in order to allow the edition of coreference
chains annotated with CORP. It provides a user-friendly graphical interface for
visualizing and replacing NPs in other coreference chains. It also allows the
editing of noun phrases, creation and deletion of chains and persistency of changes.</p>
        <p>The interface displays information in three different main panels: the first
displays the text and selected noun phrases; the second displays coreference chains,
each in a particular subpanel; and the third displays single (non-coreferent)
nounphrases (unique mentions). Each chain is associated with one color in order to
show the different chains.</p>
        <p>Upon selection of noun phrases, they are highlighted in the text according to
their chain’s color. In figure 2, one chain is highlighted. CorrefVisual is available
for download5.</p>
        <sec id="sec-4-1-1">
          <title>3 http://ontolp.inf.pucrs.br/corref/</title>
          <p>4
http://www.inf.pucrs.br/linatural/wordpress/index.php/recursos-eferramentas/corp-coreference-resolution-for-portuguese/
5
http://www.inf.pucrs.br/linatural/wordpress/index.php/recursos-eferramentas/correfvisual/
We measured annotation agreement on the basis of Kappa statistics. Kappa is
usually used to measure concordance among canonical elements. For the
coreference task, we need to calculate the agreement of complex elements: coreference
8
chains. Basically, a coreference chain may have two or more noun phrases. Thus,
for the correct calculation of agreement, we need to transform these chains into
items that may be analysed as a category.</p>
          <p>One way of perform that is to transform each chain into coreference pairs.
That is, for the chain C={a,b,c}, wherein ‘a’, ‘b’ and ‘c’ represent noun phrases,
we represent it as follows: P={(a,b),(a,c),(b,c)}.</p>
          <p>To perform the calculation, we need to consider the set of documents (D) and
the set of annotators (A). For example, for a set of documents D={d1, d2, d3}
and set of annotators A={a1, a2, a3}, we create, for each document dx belonging
to the set of documents D, the set [dx, where [dx is the union of all coreference
chains annotated for that document; such that Ux={dxa1 [ dxa2 [dxa3}.</p>
          <p>Assuming that annotator a1 has created two coreference chains: c1a1={a, b,
c}, c2a1={d, f}, and annotators a2 and a3 have considered only one, c1a2={a,b,c},
c1a3={a,b,c}, while d and f are annotated as non-coreferent by both, the
resulting union set is Ud1 = {a, b, c, d, f }.</p>
          <p>Then we transform the union set into pairs and determine which pairs are
considered coreferent or not by each annotator. The set of pairs is PU d1= {(a,b),
(a,c), (a,d), (a,f), (b,c), (b,d), (b,f), (c,d), (c,f), (d,f)}.</p>
          <p>
            In Table 3, we can see the Kappa calculation of this example. Each pair
represents an item to be classified as Coreferent or Non-Coreferent. The pairs
(a, b), (a, c) and (b, c) appear in the same coreference chain for three annotators,
indicating that they considered them coreferent. The pairs (a, f), (b, d), (b, f),
(c, d) and (c, f) were considered non-coreferent by all anotators. For the pair
(d, f), there was a disagreement between the annotators. Thus, the Coreferent
class receives ‘1’ and the Non-Coreferent class receives ‘2’. This process made
for document d1 is repeated for other documents. We calculate Kappa [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ] from
the values represented in Table.
5.4
          </p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>Kappa Results</title>
        <p>
          As a result of this IBEREVAL task, we obtained a coreference corpus for
Portuguese: Corref-PT. The corpus was annotated as an effort made by seven teams,
with a total of twenty-one Portuguese native speakers annotators, varying among
students and professors in the area of computational linguists. The corpus is
available in CORP’s XML (Figure 1) and SemEval format [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] used by other
well known coreference corpora, such as Ontonotes [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ], Summ-it++ [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] and
Garcia’s corpus [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Corref-PT is available for download6.
6
http://www.inf.pucrs.br/linatural/wordpress/index.php/recursos-eferramentas/corref-pt/
10
        </p>
        <p>
          In Table 6, we show the SemEval format. It is available in a single file,
containing all texts. Each text document is contained within a “#begin document
ID” line and another line containing only “#end document”. Each sentence’s
information is organized vertically, with one token per line, and a blank line
after the last token of each sentence. The information associated with each token
is available in columns (separated by a tab character - “\t”). The annotation
columns contain, respectively: Token’s ID in sentence; the word or multiword
itself; lemma; each word’s Part-of-speech tagging; features (gender and number);
Head, denoting if the word is a head word in the NP (if so, this field receives
’0’) and coreference information, where each coreferent noun phrase starts with
“( ”, followed by the chain’s ID. Note that the “) ” just occurs in the last NP
token. Basically, coreferent NPs receives the same chain ID.
Corref-PT is composed by texts from the CSTNews corpus [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]; from the Parole
corpus (miscellaneous texts from books, magazines, journalistic, among others)
[
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]; Wikipedia articles, selected randomly; and also a few scientific texts from
Collective Elaboration of a Coreference Annotated Corpus
11
Fapesp Magazine7. Metrics about number of texts, tokens, mentions,
coreferential mentions, coreference chains and chains sizes are shown in Table 7.
        </p>
        <p>Corpus
CST-News
Le-Parole</p>
        <p>Wikipedia
Fapesp Magazine</p>
        <p>Total
The annotators evaluated the task regarding a few issues inquired through a
survey on Google Forms. Fifteen of the 21 participants sent their answers. They
were asked about their confidence level in the annotation, whether the previous
automatic annotation was helpful for the task and about the necessity of noun
phrase edition for the task (considering that noun phrase identification was made
automatically by a parser). We can see in Figure 3 that few annotators had high
confidence in their annotation. Most participants were not sure about this issue.</p>
        <p>Regarding previous annotation (Figure 4), most participants were ambivalent
whether this helps or not the process, but a greater number thought it was
helpful.</p>
        <p>Regarding noun phrase edition (Figure 5), 60% of participants strongly agreed
that is indispensable for the annotation task. That is indeed a crucial
preprocessing requirement for build the chains, that the references are correctly
identified. The main problem here was that the task was in fact mostly fixed
regarding mention detection, and it was based on the parser’s NP chunks.
Suggestions given by the annotators were most related to CorrefVisual’s usability
one major problem was related to noun phrase edition. That was very difficult to
handle by the annotators, since the mention detection is required for identifying
coreference chains correctly, but the tool was not primarily meant for that.
In this paper, we presented a collaborative coreference annotation task which
resulted in a coreference corpus for Portuguese with nearly 4000 chains.
Considering Summ-it++, a previous available resource of the kind, with around
500 chains, we now have a coreference annotated corpus with 8 times as many
chains. The resource is available both in the SemEval format and in CORP’s
XML8. The annotated corpus can be visualized in the CorrefVisual tool9. For
the next steps, we have to improve questions regarding automatic mention
detection, which seems to be a major pre-processing issue for this task, and similarly
we have also to improve the ways for their manual editing, if we consider
further annotation tasks. Regarding the annotation agreement, we can see that
there is mainly moderate agreement. However, as future work, a revision of this
annotation should be done in order to improve the quality of annotation.
8
http://www.inf.pucrs.br/linatural/wordpress/index.php/recursos-eferramentas/corref-pt/
9
http://www.inf.pucrs.br/linatural/wordpress/index.php/recursos-eferramentas/correfvisual/
Collective Elaboration of a Coreference Annotated Corpus
13</p>
        <p>E. Fonseca et al.
Collective Elaboration of a Coreference Annotated Corpus
15</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>A.</given-names>
            <surname>Antonitsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Figueira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Amaral</surname>
          </string-name>
          , E. Fonseca,
          <string-name>
            <given-names>R.</given-names>
            <surname>Vieira</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Collovini.</surname>
          </string-name>
          Summ-it++
          <article-title>: an enriched version of the summ-it corpus</article-title>
          . In N. Calzolari,
          <string-name>
            <given-names>K.</given-names>
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Goggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Grobelnik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Mazo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Odijk</surname>
          </string-name>
          , and S. Piperidis, editors,
          <source>Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC</source>
          <year>2016</year>
          ), pages
          <fpage>2047</fpage>
          -
          <lpage>2051</lpage>
          , Paris, France,
          <year>2016</year>
          .
          <string-name>
            <given-names>European</given-names>
            <surname>Language Resources Association (ELRA).</surname>
          </string-name>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>J.</given-names>
            <surname>Carletta</surname>
          </string-name>
          .
          <article-title>Assessing agreement on classification tasks: the kappa statistic</article-title>
          .
          <source>Computational linguistics</source>
          ,
          <volume>22</volume>
          (
          <issue>2</issue>
          ):
          <fpage>249</fpage>
          -
          <lpage>254</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>J.</given-names>
            <surname>Chamberlain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Poesio</surname>
          </string-name>
          , and
          <string-name>
            <given-names>U.</given-names>
            <surname>Kruschwitz</surname>
          </string-name>
          .
          <article-title>Phrase detectives corpus 1.0 crowdsourced anaphoric coreference</article-title>
          . In N. Calzolari,
          <string-name>
            <given-names>K.</given-names>
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Goggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Grobelnik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Mazo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Odijk</surname>
          </string-name>
          , and S. Piperidis, editors,
          <source>Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC</source>
          <year>2016</year>
          ), pages
          <fpage>2039</fpage>
          -
          <lpage>2046</lpage>
          , Paris, France,
          <year>2016</year>
          .
          <string-name>
            <given-names>European</given-names>
            <surname>Language Resources Association (ELRA).</surname>
          </string-name>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>S.</given-names>
            <surname>Collovini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. I.</given-names>
            <surname>Carbonel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Fuchs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Coelho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Rino</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Vieira</surname>
          </string-name>
          .
          <article-title>Summ-it: Um corpus anotado com informações discursivas visando a sumarização automática</article-title>
          .
          <source>In Proceedings of V Workshop em Tecnologia da Informação e da Linguagem Humana</source>
          , Rio de Janeiro, RJ, Brasil, pages
          <fpage>1605</fpage>
          -
          <lpage>1614</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. S. Collovini de Abreu and
          <string-name>
            <given-names>R.</given-names>
            <surname>Vieira</surname>
          </string-name>
          . Relp:
          <article-title>Portuguese open relation extraction</article-title>
          .
          <source>Knowledge Organization</source>
          ,
          <volume>44</volume>
          (
          <issue>3</issue>
          ):
          <fpage>163</fpage>
          -
          <lpage>177</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>D. O. F. do Amaral</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Vieira</surname>
          </string-name>
          .
          <article-title>NERP-CRF: uma ferramenta para o reconhecimento de entidades nomeadas por meio de conditional random fields</article-title>
          .
          <volume>6</volume>
          (
          <issue>1</issue>
          ):
          <fpage>41</fpage>
          -
          <lpage>49</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>M. F. B. do Nascimento</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Mendes</surname>
            , and
            <given-names>L. Pereira.</given-names>
          </string-name>
          <article-title>Providing on-line access to portuguese language resources: Corpora and lexicons</article-title>
          .
          <source>In Proceedings of the International Conference on Language Resources and Evaluation</source>
          , Portugal,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>G.</given-names>
            <surname>Doddington</surname>
          </string-name>
          , A. Mitchell,
          <string-name>
            <given-names>M.</given-names>
            <surname>Przybocki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ramshaw</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Strassel</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Weischedel</surname>
          </string-name>
          .
          <article-title>The automatic content extraction (ace) program: Tasks, data, and evaluation</article-title>
          . In M. T. Lino,
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Xavier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ferreira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Costa</surname>
          </string-name>
          , and R. Silva, editors,
          <source>Proceedings of the 4th International Conference on Language Resources and Evaluation - LREC</source>
          <year>2004</year>
          , pages
          <fpage>837</fpage>
          -
          <lpage>840</lpage>
          , Lisboa,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>E. B.</given-names>
            <surname>Fonseca</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Sesti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Antonitsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. A.</given-names>
            <surname>Vanin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Vieira</surname>
          </string-name>
          .
          <article-title>Corp - uma abordagem baseada em regras e conhecimento semântico para a resolução de correferências</article-title>
          .
          <source>Linguamatica</source>
          ,
          <volume>9</volume>
          (
          <issue>1</issue>
          ):
          <fpage>3</fpage>
          -
          <lpage>18</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. E. B.
          <string-name>
            <surname>Fonseca</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Vieira</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Vanin</surname>
          </string-name>
          . Corp:
          <article-title>Coreference resolution for portuguese</article-title>
          .
          <source>In 12th International Conference on the Computational Processing of Portuguese, Demo Session (PROPOR)</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>C. Freitas</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Mota</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>H. G.</given-names>
          </string-name>
          <string-name>
            <surname>Oliveira</surname>
            , and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Carvalho. Second HAREM</surname>
          </string-name>
          <article-title>: advancing the state of the art of named entity recognition in portuguese</article-title>
          .
          <source>In Proceedings of the International Conference on Language Resources and Evaluation</source>
          ,
          <string-name>
            <surname>LREC</surname>
          </string-name>
          , Valletta, Malta,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>R.</given-names>
            <surname>Gabbard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Freedman</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Weischedel</surname>
          </string-name>
          .
          <article-title>Coreference for learning to extract relations: yes, virginia, coreference matters</article-title>
          .
          <source>In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers-Volume</source>
          <volume>2</volume>
          , pages
          <fpage>288</fpage>
          -
          <lpage>293</lpage>
          . Association for Computational Linguistics,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>M.</given-names>
            <surname>Garcia</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Gamallo</surname>
          </string-name>
          .
          <article-title>Multilingual corpora with coreferential annotation of person entities</article-title>
          .
          <source>In Proceedings of the 9th edition of the Language Resources and Evaluation Conference - LREC</source>
          , pages
          <fpage>3229</fpage>
          -
          <lpage>3233</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>H. Gonçalo</given-names>
            <surname>Oliveira</surname>
          </string-name>
          . Onto. PT:
          <article-title>Towards the Automatic Construction of a Lexical Ontology for Portuguese</article-title>
          .
          <source>PhD thesis</source>
          ,
          <source>Ph. D. thesis, Univ. of Coimbra/FST</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>E. W.</given-names>
            <surname>Hinrichs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kübler</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Naumann</surname>
          </string-name>
          .
          <article-title>A unified representation for morphological, syntactic, semantic, and referential annotations</article-title>
          .
          <source>In Proceedings of the Workshop on Frontiers in Corpus</source>
          Annotations II:
          <article-title>Pie in the Sky</article-title>
          ,
          <source>CorpusAnno '05</source>
          , pages
          <fpage>13</fpage>
          -
          <lpage>20</lpage>
          , Stroudsburg, PA, USA,
          <year>2005</year>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>V.</given-names>
            <surname>Hoste and G. De Pauw.</surname>
          </string-name>
          Knack-2002:
          <article-title>a richly annotated corpus of dutch written text</article-title>
          .
          <source>In Proceedings of The Fifth international conference on Language Resources and Evaluation</source>
          , pages
          <fpage>1432</fpage>
          -
          <lpage>1437</lpage>
          , Genoa, Italy,
          <year>2006</year>
          . European Language Resources Association, European Language Resources Association.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>J.</given-names>
            <surname>Howe</surname>
          </string-name>
          .
          <article-title>Crowdsourcing: Why the Power of the Crowd Is Driving the Future of Business</article-title>
          . Crown Publishing Group, New York, NY, USA,
          <volume>1</volume>
          <fpage>edition</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>H.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Peirsman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Chambers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Surdeanu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Jurafsky</surname>
          </string-name>
          .
          <article-title>Deterministic coreference resolution based on entity-centric, precision-ranked rules</article-title>
          . volume
          <volume>39</volume>
          , pages
          <fpage>885</fpage>
          -
          <lpage>916</lpage>
          . Computational Linguistics - MIT Press,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <given-names>H.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Peirsman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Chambers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Surdeanu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Jurafsky</surname>
          </string-name>
          .
          <article-title>Stanford's multi-pass sieve coreference resolution system at the conll-2011 shared task</article-title>
          .
          <source>In Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task. Association for Computational Linguistics</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <given-names>E. G.</given-names>
            <surname>Maziero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. L. del Rosario</given-names>
            <surname>Castro Jorge</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T. A. S.</given-names>
            <surname>Pardo</surname>
          </string-name>
          .
          <article-title>Identifying multidocument relations</article-title>
          .
          <source>In Natural Language Processing and Cognitive Science, Proceedings of the 7th International Workshop on Natural Language Processing and Cognitive Science, NLPCS</source>
          <year>2010</year>
          ,
          <article-title>In conjunction with ICEIS 2010, Funchal</article-title>
          , Madeira, Portugal,
          <year>June 2010</year>
          , pages
          <fpage>60</fpage>
          -
          <lpage>69</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <given-names>C.</given-names>
            <surname>Müller</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Strube</surname>
          </string-name>
          .
          <article-title>Mmax: A tool for the annotation of multi-modal corpora</article-title>
          .
          <source>In Proceedings of the 2nd IJCAI Workshop on Adaptive Text Extraction and Mining - IJCAI</source>
          <year>2001</year>
          , Seattle, Washington,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <given-names>S.</given-names>
            <surname>Pradhan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ramshaw</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Marcus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Palmer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Weischedel</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Xue</surname>
          </string-name>
          . Conll
          <article-title>-2011 shared task: Modeling unrestricted coreference in ontonotes</article-title>
          .
          <source>In Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>27</lpage>
          . Association for Computational Linguistics,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Pradhan</surname>
          </string-name>
          , E. Hovy,
          <string-name>
            <given-names>M.</given-names>
            <surname>Marcus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Palmer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ramshaw</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Weischedel</surname>
          </string-name>
          .
          <article-title>Ontonotes: A unified relational semantic representation</article-title>
          .
          <source>In Proceedings of the International Conference on Semantic Computing, ICSC '07</source>
          , pages
          <fpage>517</fpage>
          -
          <lpage>526</lpage>
          , Washington, DC, USA,
          <year>2007</year>
          . IEEE Computer Society.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>M. Recasens</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Màrquez</surname>
            , E. Sapena,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Martí</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Taulé</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Hoste</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Poesio</surname>
            , and
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Versley</surname>
          </string-name>
          . Semeval
          <article-title>-2010 task 1: Coreference resolution in multiple languages</article-title>
          .
          <source>In Proceedings of the 5th International Workshop on Semantic Evaluation</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          . Association for Computational Linguistics,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <given-names>M.</given-names>
            <surname>Recasens</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Martí.</surname>
          </string-name>
          Ancora-co:
          <article-title>Coreferentially annotated corpora for spanish and catalan</article-title>
          .
          <source>Language Resources and Evaluation</source>
          ,
          <volume>44</volume>
          (
          <issue>4</issue>
          ):
          <fpage>315</fpage>
          -
          <lpage>345</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>K. J. Rodríguez</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Delogu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Versley</surname>
            , E. Stemle, and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Poesio</surname>
          </string-name>
          .
          <article-title>Anaphoric annotation of wikipedia and blogs in the live memories corpus</article-title>
          . In N. Calzolari,
          <string-name>
            <given-names>K.</given-names>
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Odijk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Piperidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rosner</surname>
          </string-name>
          , and D. Tapias, editors,
          <source>Proceedings of the International Conference on Language Resources and Evaluation - LREC. European Language Resources Association</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <given-names>D.</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Cardoso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Seco</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Vilela</surname>
          </string-name>
          .
          <article-title>Breve introduçao ao harem. HAREM, a primeira avaliaçao conjunta de sistemas de reconhecimento de entidades mencionadas para português: documentaçao e actas do encontro</article-title>
          ,
          <source>Linguateca</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>M. d</surname>
            . O. Tubino and
            <given-names>M. M. S.</given-names>
          </string-name>
          <string-name>
            <surname>Silva</surname>
          </string-name>
          . Visualização, manipulação e refinamento de correferência em língua portuguesa. Trabalho de conclusão de curso,
          <source>Pontifícia Universidade Católica do Rio Grande do Sul</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <given-names>O.</given-names>
            <surname>Uryupina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Artstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bristot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Cavicchio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Rodriguez</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Poesio</surname>
          </string-name>
          . ARRAU:
          <article-title>Linguistically-Motivated Annotation of Anaphoric Descriptions</article-title>
          .
          <source>In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC</source>
          <year>2016</year>
          ), pages
          <fpage>2058</fpage>
          -
          <lpage>2062</lpage>
          , Portorož, Slovenia,
          <year>2016</year>
          .
          <string-name>
            <given-names>European</given-names>
            <surname>Language Resources Association (ELRA).</surname>
          </string-name>
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>K. van Deemter</surname>
            and
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Kibble</surname>
          </string-name>
          .
          <article-title>What is coreference, and what should coreference annotation be</article-title>
          ?
          <source>In Proceedings of the Workshop on Coreference and Its Applications</source>
          ,
          <source>CorefApp '99</source>
          , pages
          <fpage>90</fpage>
          -
          <lpage>96</lpage>
          , Stroudsburg, PA, USA,
          <year>1999</year>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>A. J. Viera</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          <string-name>
            <surname>Garrett</surname>
          </string-name>
          , et al.
          <article-title>Understanding interobserver agreement: the kappa statistic</article-title>
          .
          <source>Fam Med</source>
          ,
          <volume>37</volume>
          (
          <issue>5</issue>
          ):
          <fpage>360</fpage>
          -
          <lpage>363</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>