<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Adapting NER (CRF+LG) for Many Textual Genres ?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Juliana Pirovani</string-name>
          <email>juliana.campos@ufes.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James Alves</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marcos Spalenza</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wesley Silva</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cristiano da Silveira Colombo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elias Oliveira</string-name>
          <email>eliasg@lcad.inf.ufes.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Programa de Pos-Graduaca~o em Informatica Universidade Federal do Esp rito Santo (UFES)</institution>
          ,
          <addr-line>29.075-910 - Vitoria - ES -</addr-line>
          <country country="BR">Brasil</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universidade Federal do Esp rito Santo (UFES)</institution>
          ,
          <addr-line>29.500-000 - Alegre - ES -</addr-line>
          <country country="BR">Brasil</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <fpage>421</fpage>
      <lpage>433</lpage>
      <abstract>
        <p>Named Entity Recognition is the task of automatically identifying named entities and classifying them into prede ned categories such as person, place, organization, among other categories considered relevant in speci c domains. This task is important and challenging, especially when the system must be able to recognize named entities in many textual genres, including genres that di er from those for which it was trained. CRF+LG is a hybrid system for Named Entity Recognition in Portuguese texts that combines a labeling obtained by a Conditional Random Fields with a term classi cation obtained by a Local Grammar as an additional informed feature. This paper aims to report the initial e orts made to adapt CRF+LG system for many textual genres in accordance with the proposed Portuguese Named Entity Recognition task in IberLEF 2019. We adapted the LG to capture rules of textual genres that do not appear in the examples of the training corpus and thus assist the Named Entity Recognition, even when there is no training set of an available textual genre. CRF+LG was also trained in an augmented training corpus.</p>
      </abstract>
      <kwd-group>
        <kwd>Named Entity Recognition</kwd>
        <kwd>Local Grammars</kwd>
        <kwd>Domain Adaptation</kwd>
        <kwd>Conditional Random Fields</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Named Entity Recognition (NER) is a task for identifying and classifying
automatically named entities (NEs) in free written texts. These NEs correspond to
names of person, places, organizations, among other categories considered
relevant in speci c domains. This task is important because it is a fundamental step
of preprocessing for several applications such as question answering systems [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ],
relation and event extraction [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and entity-oriented search [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Indeed, NEs are
an essential source of information in textual information retrieval.
      </p>
      <p>
        NER is a very challenging task as several categories of named entities are
written similarly and they appear in similar contexts. In addition, NER depends
on the language, the training corpus and a given domain [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Considering the
domain dependency, the same category of NE can be written in di erent ways
depending on the textual genre under analysis. For example, in e-mail texts it is
common to see person names after words as Hello and Good afternoon, whereas
in memorandum texts it is common to see person names after words as Public
servants and Professor. Consistent training sets including texts from di erent
genres are not always available.
      </p>
      <p>
        In 1995, the Message Understanding Conference [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] included the NER task
for the rst time for the English, carrying out a joint assessment of the area.
Thereafter, several similar events have emerged such as the ACE [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], CoNLL [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ],
HAREM [
        <xref ref-type="bibr" rid="ref12 ref27">12, 27</xref>
        ] and TAC [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. HAREM was an initiative for the Portuguese
organized by Linguateca [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The annotated corpora used in the First and Second
HAREM, known as the Golden Collections (GC), are used as a golden standard
reference for NER systems in Portuguese.
      </p>
      <p>
        This year (2019), the Portuguese NER task was one of the tasks proposed in
the Iberian Languages Evaluation Forum (IberLEF) [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. The objective of this
task is to evaluate the submitted systems in many textual genres. The
participants were free to choose their own training datasets. The categories person,
place, organization, value and time were evaluated in datasets that have as main
textual genres: news, memorandums, e-mails, interviews and magazine articles;
and the person category was evaluated in clinical notes and police texts.
      </p>
      <p>
        This paper presents the initial e orts to adapt the system CRF+LG [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] for
many textual genres in accordance with this proposed task in IberLEF 2019.
CRF+LG is a hybrid system for Portuguese NER that combines a labeling
obtained by a Conditional Random Fields (CRF) with a term classi cation
obtained by a Local Grammar (LG) as an additional informed feature. The idea
of this system was to study a way to improve the performance of NER systems
that use the machine learning approach using less training corpus. In order to
participate in the IberLEF 2019, we observed some datasets from di erent
textual genres, we also adapted the LG and retrained the model with an augmented
training corpus.
      </p>
      <p>The remaining of this paper is organized as follows. In Section 2 we discuss
some of the more related works which both support some of our arguments and
complement some point of view we discuss in this paper. The methodology is
explained in the Section 3. Within this section we enumerate each of the necessary
steps to perform the training and testing and we describe the adaptations made
in this architecture to the IberLEF. We also introduce some challenges we had
found within the datasets used for training which decrease the performance of
the learning process. The Section 4 discusses the results yielded by our algorithm
which was run by the IberLEF organizers. We also discuss some aspects faced
when dealing with cross domain datasets. Our conclusions are presented in the
Section 5.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related work</title>
      <p>
        Named Entity Recognition systems can be developed using the following
approaches: linguistics [
        <xref ref-type="bibr" rid="ref17 ref22">17, 22</xref>
        ], machine learning [
        <xref ref-type="bibr" rid="ref25 ref29 ref4">4, 25, 29</xref>
        ] or hybrid [
        <xref ref-type="bibr" rid="ref19 ref30">19, 30</xref>
        ]. Some
of the main NER systems for Portuguese will be described below.
      </p>
      <p>
        The system proposed by [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] is based on the CharWNN Deep Neural
Network, which uses word-level and character-level representations to perform
sequential classi cation. The system was tested for the Portuguese and Spanish
and, for the Portuguese, the GC of the First HAREM was used as training
set and the MiniHAREM as the test set. The approach was compared to the
ETLC M T system [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], an ensemble method based on Entropy Guided
Transformation Learning (ETL) and outperformed this system in both total (10
categories of HAREM) and selective (categories person, place, organization, time
and value) scenarios.
      </p>
      <p>
        A Deep Neural Network architecture with word-level and character-level
representations was also used in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. A combination of these representations is fed
into a bidirectional Long Short-Term Memory with Conditional Random Fields
(Bi-LSTM-CRF) to perform sequential classi cation. The authors evaluated
different combinations of hyperparameters for training such as word embeddings
model, tagging schemes, word capitalization feature and number of hidden units
for each LSTM, obtaining the optimal values for the parameters that had a
greatest impact in the performance of the model. A very similar architecture was used
by [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] for two sequence labeling tasks (POS-tagging and NER) obtaining very
close results.
      </p>
      <p>
        A hybrid approach to Portuguese NER is presented in [
        <xref ref-type="bibr" rid="ref18 ref21">18, 21</xref>
        ] using the
machine learning approach CRF [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and the linguistics approach LG [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The
classi cation obtained from LG was sent as an additional feature for the learning
process of the CRF prediction model. The CRF model assigns the nal label of
the NEs. This approach is a good way to take into account the human expertise
for capturing the rules that do not appear in examples of the annotated corpus
used for training by the CRF. A study about the boundaries of CRF's
performance when using a result coming from any other classi er as an additional
feature was also presented.
      </p>
      <p>
        The systems that used Neural Networks [
        <xref ref-type="bibr" rid="ref25 ref4 ref7">4, 7, 25</xref>
        ] presented superior results
using massive corpora for unsupervised learning of features, which was not the
case of the work presented in [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. However, the results obtained by [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]
outperform the results of systems reported in the literature that were evaluated under
equivalent conditions: a system that uses only CRF [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and the system based on
the CharWNN presented in [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] without the unsupervised pre-training.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <p>
        In order to participate in the IberLEF 2019, we have used the architecture of our
system CRF+LG[
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. CRF+LG does not use massive corpora for unsupervised
learning of features. The LG is a good way to take into account the human
expertise for capturing the rules and a way to perform the NER using the linguistic
approach when there is no available training corpus. The Figure 1 presents an
overview of the methodology used, demonstrating how the steps to perform the
training occur.
      </p>
      <p>Initially, each input le goes through the sentence segmentation process (step
1). Segmentation was performed using the Unitex (http://unitexgramlab.org/)
tool. Unitex uses LGs to describe the di erent ways that indicate the end of a
sentence. For this work, the LG that performs sentence segmentation in Unitex
has been changed so as not to segment sentences in a colon (:) and semicolon
(;). This exibility is a strength of the tool.</p>
      <p>A copy of the targeted les has their tags removed since the CD used has
the NEs markings (step 2). The LG built in this work is applied to these
les without any marking and the NEs identi ed by it are annotated (step
3). On the other hand, the segmented les are tokenized using the OpenNLP
(http://opennlp.apache.org/) library (step 4). This library is based on machine
learning and performs common NLP tasks such as segmentation, tokenization,
POS-Tagging, etc.</p>
      <p>
        In order to represent the NER as a sequence labeling problem, a label must be
assigned to each token of the text. The BIO notation was used (steps 4 and 5). In
the sequence, several features [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] are added for each token of the les, including
the NE label previously assigned by the LG (step 6). These characteristics are
used during supervised learning of the CRF prediction model (step 7).
      </p>
      <p>The methodology used for testing is similar, but the input les do not have
the NEs tags. In addition to the les containing the tokens and features, the
CRF receives the previously trained model to predict a label for each token.</p>
      <p>The next two sections have a short description of how the system obtains a
tip by the LG and explain how CRF works. In the last section we described the
adaptations made to participate in the IberLEF event.
3.1</p>
      <sec id="sec-3-1">
        <title>Local Grammars (LG)</title>
        <p>An LG created in Unitex is represented as a set of one or more graphs. The
LG used by CRF+LG consists of 10 graphs, one for each of the NEs categories
considered by HAREM.</p>
        <p>We observed in the training le in which context each type of NE appeared,
what words could somehow indicate the existence of NE to construct each graph.
We observed that, for example, words with the rst letter capitalized preceded
by the preposition em (in) were labeled as place. We also observed that some
NEs of the person category are preceded by words such as diz (say), explicou
(explained), a rmou (said), etc.</p>
        <p>Thus, the graphs created capture some simple heuristics to the recognition of
NEs in the training set. An example of rule in the graph created for the person
category is presented in Figure 2.
This graph recognizes words such as diz (say) or a rmou (said) followed by
words with the rst letter capitalized, as identi ed by the code &lt; FIRST &gt; in
Unitex dictionaries. Among words with the rst letter capitalized, prepositions
may appear whose recognition has been previously detailed in graph
Preposicao.grf included as subgraph. Examples of occurrences identi ed by this graph
were: diz &lt; PESSOA &gt; Moncef Kaabi &lt; =PESSOA &gt;
a rmou &lt; PESSOA &gt; Jose SOCRATES &lt; =PESSOA &gt;
a rma &lt; PESSOA &gt; Jason Knight &lt; =PESSOA &gt; .</p>
        <p>Note that identi ed person will appear between the tags &lt; PESSOA &gt; (
&lt; PERSON &gt; ) and &lt; =PESSOA &gt; in the concordance le containing the list
of occurrences identi ed.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Conditional Random Fields (CRF)</title>
        <p>
          Conditional Random Fields (CRF) is a machine learning method for structured
prediction proposed by [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. It is used for labeling of sequential data based on a
conditional approach.
        </p>
        <p>Let X = (x1; x2; :::; xn) be a sequence of words in a text, we want to determine
the best sequence of labels Y = (y1; y2; :::; yn) for these words, corresponding
to the categories of NEs (10 categories of the HAREM or the label O in this
work). The CRF models a conditional distribution p(Y jX) that represents the
probability of obtaining the output Y given the input X.</p>
        <p>
          In this work, we used a linear-chain CRF that predict the output variables
Y as a sequence for sequences of input variables X. According to [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ], a
linearchain CRF is a conditional distribution that takes the form shown in Equation
1:
where Z(x) is a normalization function given by Equation 2
p(yjx) =
        </p>
        <p>1 YT exp
Z(x) t=1
( K</p>
        <p>X kfk (yt; yt 1; xt)</p>
        <p>)
k=1</p>
        <p>T
Z(x) = X Y exp
( K</p>
        <p>X kfk (yt; yt 1; xt)</p>
        <p>)</p>
        <p>F = ffk(yt; yt 1; xt)gkK=1 is a set of feature functions that must be xed
according to the problem. An example is a function which takes the value 1
when the word begins with a capitalized letter (component of the input vector
xt ), its label is Person (yt ) and the previous label (yt 1) is Other and 0
otherwise. The vector xt contains all the components of the global observations
x that are needed for computing features at time t. = k is a vector of weights
that must be estimated from the training set. This is usually done by maximum
likelihood learning. The weights depend on each feature function and the more
discriminating the function, the higher its computed weight will be.</p>
        <p>The MALLET (http://mallet.cs.umass.edu/) toolkit was used in this work
to estimate the vector of weights and then apply the CRF model obtained to
label the test set. This CRF model combines the weights of each feature function
to determine the probability of a certain value (yt).
CRF+LG was built to recognize the 10 named entities categories of the HAREM
(person, place, organization, value, time, event, abstraction, work, thing and
other ). Then, the system was initially adaptated to consider only the ve
categories of the IberLEF (person, place, organization, value and time ) during the
CRF training phase. Nevertheless, we have kept the recognition of the 10
categories by the LG because we believe that this helps the system to disambiguate
NEs.</p>
        <p>
          The Golden Collection of the First and Second HAREM, considered as a
reference for Named Entity Recognition systems in Portuguese, were used in
previous experiments [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] as training and testing sets, respectively, for evaluation
of the CRF+LG. Several errors occurred due to some inconsistencies in the GC
of the First HAREM and Second HAREM. For example, in the GC of the First
HAREM, strings as 2004 preceded by the preposition em (in) are considered
NEs of the Time category and the CRF+LG learned this and labeled all similar
strings preceded by em as Time. However, in the GC of the Second HAREM,
the preposition em is part of the NE. So all these NEs were wrongly labeled.
The same happened in other situations of the categories time, value and person.
        </p>
        <p>
          Some of these major inconsistencies were removed by Pirovani [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] and others
were removed during this work. The goal was to get a more consistent dataset,
normalized, composed of the three GCs of the HAREM (First HAREM, Mini
HAREM and Second HAREM) to use as training.
        </p>
        <p>
          The GCs of the HAREM include documents from di erent textual genres
such as news, web texts, literary ction, transcribed oral interviews, technical
texts, journalistic and personal blog, essays and FAQ questions [
          <xref ref-type="bibr" rid="ref12 ref27">12, 27</xref>
          ].
However, the task of the IberLEF proposes to evaluate the systems in other speci c
textual genres such as memorandums, e-mails, magazine articles, clinical notes
and police texts.
        </p>
        <p>
          In order to train CRF+LG to this task, we have researched and reviewed
other corpus from di erent textual genres in Portuguese:
1. SIGARRA [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]: SIGARRA corpus has 905 articles, manually annotated
using eight NEs categories: hour, event, organization, course, person, location,
date and organic unit.
2. WikiNER [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]: This corpus is a silver-standard automatically annotated
containing three di erent NEs annotated: person, location and organization. We
created 592 subsets and reviewed 40 parts including annotation for value and
time for NEs and adjusting the automatic annotation mistakes.
3. LeNER-BR [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]: LeNER-BR was manually annotated with a focus on legal
documents. This dataset has 70 documents with the following categories of
NEs: organization, person, time, locations, law and decisions regarding law
cases.
4. aTribuna [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]: This dataset has 100 newspaper documents with 2714 NEs
person manually annotated.
5. administrative orders (http://gedoc.ifes.edu.br/): We also annotated
manually 20 administrative orders of the Instituto Federal de Educaca~o, Ci^encia
e Tecnologia do Esp rito Santo (IFES).
        </p>
        <p>Our initial intention was to use these datasets to 1) identify new rules to
insert into LG and 2) combine them to increase the training set and thus improve
the model prediction. However, some inconsistencies observed between the GCs
of the HAREM and others such as LeNER and SIGARRA made it di cult to
integrate all these datasets to create a unique training set.</p>
        <p>The LG used in CRF+LG was built by analyzing only the CD of the First
HAREM. By analyzing some texts of these new domains, we observed some very
strict patterns for writing of NEs and several adaptations have been introduced
at LG to recognize these patterns. Here are some examples:
1. Sequences of words with the rst letter capitalized or numbers beginning
with words such as Sala, Sala~o, Auditorio and An teatro as place category.
2. Recognition of dates (time category) with dots (25.12.2010).
3. Recognition of dates preceded by words such as ate, a partir de, entre, dia
and desde.
4. Recognition of values preceded by abbreviations or words such as num. N.,
art. Art., matr cula and siape.</p>
        <p>One of the main inconsistencies observed among the datasets was the di
erent categories of NEs annotated. For example, the SIGARRA corpus does not
contain the value category annotated, however there are NEs of this category in
the texts. Another example of inconsistency are the NEs annotated in di erent
ways. Sometimes speci c words in lowercase letters should form part of NEs
and other times not. For example, rainha (queen) in rainha Elizabeth (queen
Elizabeth) and mais de (more than) in mais de 30 (more than 30). This
certainly deteriorate the model learning because of the lack of correct or consistent
annotation.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experiment Result</title>
      <p>
        Before submitting the system to the IberLEF, we repeated some of the
experiments performed in [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Initially, the LG built in [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] and the new version of our
LG submitted to IberLEF were applied individually to the GC of the Second
HAREM to evaluate the new rules inserted.
      </p>
      <p>Although the precision value obtained by adapted LG was lower indicating
that more NEs have been misidenti ed (false positives) due to the new rules,
these rules also increased the recall value in 9 percentage points. Thus, the gain
obtained by adapted LG in comparison to the original LG was approximately 7
percentage points in F-measure. The decrease in the precision metric is some of
the e ect faced when we change the domain of the dataset used for testing. This
experiment only suggests that the continuing adaption of the LG is a necessity.</p>
      <p>CRF+LG was also rerun using the adapted LG. The GCs of the First HAREM
and Second HAREM were used as training and testing sets respectively. The nal
gain in F-measure was about 4 percentage points achieving 63.11% in F-measure.
These results are another example of how the combination CRF+LG can
improve the NER. In this experiment we were able to identify 31 more entities due
to the new version of the LG.</p>
      <p>We also performed some experiments combining several of the datasets
presented in the previous Section (GCs HAREM normalized, SIGARRA, selected
sentences from WikiNER, aTribuna and administrative orders) for use as
training set. The CRF+LG prediction models were obtained for all combinations
and applied in a testing set that we have created for this purpose. This dataset
contains only 15 texts from di erent textual genres annotated. The model that
presented the best results in this initial test was submitted to IberLEF. This
model was trained with the GCs HAREM normalized and the 20 administrative
orders.
4.1</p>
      <sec id="sec-4-1">
        <title>IberLEF Task Results</title>
        <p>The IberLEF organizers evaluated the submitted systems in two manually
annotated datasets: the Clinical dataset with 50 sentences and 77 NEs and the Police
dataset from the Brazil's Federal Police with 1388 sentences and 916 NEs. Both
datasets were annotated with only the person category. The systems were also
evaluated in the General dataset containing the SIGARRA dataset with NEs
categories date and time mapped to as a single category time and a subset of
sentences from the GC of the Second HAREM (SecHAREM) annotated with
only the value category since SIGARRA does not have this category annotated.</p>
        <p>
          The IberLEF organizers used the precision (P), recall (R) and F-measure
(F) [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] metrics and computed the results using the CoNLL-2002's standard
evaluation script (http://www.cnts.ua.ac.be/conll2002/ner/bin/conlleval.txt). The
results to our model are exposed in Table 1.
        </p>
        <p>Corpus Category
Police Dataset PER
Clinical Dataset PER</p>
        <p>In the rst column, we have the list of datasets: the Police dataset in the rst
line, followed by Clinical dataset, and the combined SIGARRA +SecHAREM.
Whereas for the two rst datasets only the person entity was evaluated, for the
combined dataset all the ve entities were evaluated: ORG { organization, PER
{ person, PLC { location, TME { time and VAL { value.</p>
        <p>The best result obtained by our approach was on the identi cation of the
value category (81.34% in F-measure) in the last line of the Table 1 for the
General dataset, whereas our worst result was on identifying the person category
for the Clinical dataset, in the second line (11.83% in F-measure).</p>
        <p>Note that, based on the results depicted in Table 1, our approach did not
achieve the same gures level on the two rst datasets as we could get on
the Overall evaluation when testing on the combined dataset. Although these
datasets (Police and Clinical) were not divulged by the IberLEF organization
because the information is of a sensitive nature, we imagine that these results are
due to NEs with structures very di erent from those for which this system was
trained. The Clinical dataset, for example, has a textual structure with words
that should be separated by a space and they are not, several medical
abbreviations of unusual terms and odd sequences of special characters (AnaR1 and
###Paulo as person names). In order to recognize these very speci c structures
the system would need to be trained in texts from that same domain or have
knowledge of those structures to insert into LG.</p>
        <p>The results obtained in the General dataset were a bit better. The results
for the value category exceeded 81 percentage points in F-measure and for the
time category exceeded 73 percentage points in the same metric. NEs of these
categories have better de ned structures that are easier to capture in the LG
rules and easier to learn by the CRF.</p>
        <p>In order to understand our results better, we applied the CRF+LG model
to the General dataset (https://github.com/jneto04/iberlef-2019) released by
the organization. By analyzing the results obtained, we observed that many
of the NEs of the value category have words such as mais de (more than), cerca
de (about), aproximadamente (approximately) and until (ate) which should be
part of the NE. However, with the purpose of normalizing the three GCs of
the HAREM to use as a training set, these words were removed. So, instead
of recognizing sequences such as mais de 800 milh~oes, cerca de 600 km,
aproximadamente 1,4 tonelada e ate 120 kg, CRF+LG recognized 800 milh~oes, 600
km, 1,4 tonelada e 120 kg, decreasing the value of the metrics.</p>
        <p>CRF+LG recognized sequences preceded by words such as Faculdade
(College), Universidade (University), Instituto (Institute) and Departamento
(Department) as organization (Faculdade de Ci^encias Medicas da Universidade Nova
de Lisboa, Departamento de Qu mica). However, the IberLEF organization did
not consider the organic unit category of the SIGARRA as an organization.</p>
        <p>We also believe that the use of the 20 administrative orders as training set
may have somewhat impaired the recognition of words in capital letters since
many NEs are written in uppercase in this dataset.</p>
        <p>It is important to note that the results obtained by the systems should not
be directly compared as the participants used di erent training corpora. In this
case, the CRF+LG also did not use massive corpora for unsupervised learning
of features. In order to compare the techniques used by the systems, they must
be trained in the same dataset and under equivalent conditions.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>
        This paper is a result of the IberLEF task force which the objective is to evaluate
intelligent algorithm models on the NER problem in many textual genres. Our
proposed model used the combination of two strategies: a supervised learning
algorithm named CRF, and a tailored set of LGs used here to give tips to the
former algorithm. In [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] we discussed that the more valuable tips we o er to
the CRF algorithm, the better is its performance.
      </p>
      <p>In this paper we present the results yielded by the IberLEF organizers when
running our model over the three datasets used to compare the participating
systems. Two of these datasets were used solely to compute the performance
of the submitted algorithms on automatically annotating the person entity on
police texts and clinical notes.</p>
      <p>The LG adapted in this work for use with the CRF+LG approach obtained
a gain of 7 percentage points in F-measure in comparison to the original LG
and a nal gain of approximately 4 percentage points combined with the CRF
according to the experiments presented in Section 4. These results show the
potential of LG for use in the NER task and the necessity of the continuous
adaptation of the LG.</p>
      <p>The results obtained by the CRF+LG in the IberLEF task, especially for
the Police and Clinical datasets, show the di culty of the NER in new textual
genres containing very speci c structures that di er from those for which the
system was trained. Our F-measure metric was below 12 percentage points in
the Clinical dataset that presents particular challenges.</p>
      <p>We observed some errors when analyzing the result obtained by the CRF+LG
in the General dataset that could be avoided if we knew in advance which words
should or should not be part of the NEs. In this way, LG and the training dataset
could be tailored for this.</p>
      <p>We claim that the IberLEF is a milestone towards on building a more uniform
and better way to compare di erent approaches, measure their results and build
better datasets for experimentation.</p>
      <p>As a possible future work we think of better understanding how to decrease
the impact of increasingly learning from a di erent domain. The idea is that
a learning model from one domain can be cheaply used into another domain
without a great impact observed in this paper. Besides, the preprocessing stage
of the algorithms has also a great impact on the results. We are working on a way
to introduce an intelligence layer within this stage in order to quickly learn the
di erent textual genres and thus reduce the mistakes we also could nd during
the experiments carried out in this work.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Amaral</surname>
            ,
            <given-names>D.O.F.: O</given-names>
          </string-name>
          <string-name>
            <surname>Reconhecimento de Entidades Nomeadas por Meio de Conditional Random</surname>
          </string-name>
          <article-title>Fields para a L ngua Portuguesa</article-title>
          .
          <article-title>Master's thesis, Pontif cia Universidade Catolica do Rio Grande do Sul, Porto Alegre</article-title>
          ,
          <string-name>
            <surname>Brasil</surname>
          </string-name>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Araujo</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Campos</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oliveira</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , Stau er, M.,
          <string-name>
            <surname>Couto</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bermejo</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>LeNERBr: a Dataset for Named Entity Recognition in Brazilian Legal Text</article-title>
          .
          <source>In: International Conference on the Computational Processing of Portuguese (PROPOR)</source>
          . pp.
          <volume>313</volume>
          {
          <fpage>323</fpage>
          .
          <source>Lecture Notes on Computer Science (LNCS)</source>
          , Springer, Canela,
          <string-name>
            <surname>RS</surname>
          </string-name>
          ,
          <source>Brazil (September</source>
          <volume>24</volume>
          -26
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Baeza-Yates</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ribeiro-Neto</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          : Recuperaca~o de Informaca~
          <fpage>o</fpage>
          - 2ed
          <string-name>
            <surname>: Conceitos e Tecnologia das Maquinas de Busca. Bookman Editora</surname>
          </string-name>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Castro</surname>
            ,
            <given-names>P.V.Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>da Silva</surname>
            ,
            <given-names>N.F.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>da Silva</surname>
            <given-names>Soares</given-names>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>Portuguese Named Entity Recognition Using LSTM-CRF</article-title>
          .
          <article-title>In: Villavicencio A</article-title>
          . et al. (
          <article-title>eds) Computational Processing of the Portuguese Language</article-title>
          .
          <source>PROPOR 2018. Lecture Notes in Computer Science</source>
          , vol
          <volume>11122</volume>
          . pp.
          <volume>83</volume>
          {
          <fpage>92</fpage>
          . Springer, Cham, Canela,
          <string-name>
            <surname>RS</surname>
          </string-name>
          (Sep
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Chan</surname>
            ,
            <given-names>Y.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roth</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Exploiting Syntactico-Semantic Structures for Relation Extraction</article-title>
          . In:
          <article-title>Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1</article-title>
          . pp.
          <volume>551</volume>
          {
          <fpage>560</fpage>
          .
          <article-title>Association for Computational Linguistics (</article-title>
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6. Cheng, T.,
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>K.C.C.</given-names>
          </string-name>
          :
          <article-title>Supporting Entity Search: a Large-scale Prototype Search Engine</article-title>
          .
          <source>In: Proceedings of the 2007 ACM SIGMOD international conference on Management of data</source>
          . pp.
          <volume>1144</volume>
          {
          <fpage>1146</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Costa</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paetzold</surname>
            ,
            <given-names>G.H.:</given-names>
          </string-name>
          <article-title>E ective Sequence Labeling with Hybrid Neural-CRF Models</article-title>
          .
          <article-title>In: Villavicencio A</article-title>
          . et al. (
          <article-title>eds) Computational Processing of the Portuguese Language</article-title>
          .
          <source>PROPOR 2018. Lecture Notes in Computer Science</source>
          , vol
          <volume>11122</volume>
          . pp.
          <volume>490</volume>
          {
          <fpage>498</fpage>
          . Springer, Cham, Canela,
          <string-name>
            <surname>RS</surname>
          </string-name>
          (Sep
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Doddington</surname>
            , G.R., Mitchell,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Przybocki</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramshaw</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Strassel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weischedel</surname>
            ,
            <given-names>R.M.:</given-names>
          </string-name>
          <article-title>The Automatic Content Extraction (ACE) Program-Tasks, Data, and Evaluation</article-title>
          . In: LREC. vol.
          <volume>2</volume>
          , p.
          <fpage>1</fpage>
          .
          <string-name>
            <given-names>European</given-names>
            <surname>Language Resources</surname>
          </string-name>
          <article-title>Association (ELRA), Lisboa</article-title>
          ,
          <string-name>
            <surname>PORTUGAL</surname>
          </string-name>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Gross</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The Construction of Local Grammars</article-title>
          . In ROCHE, E.; SCHABES,
          <string-name>
            <surname>Y</surname>
          </string-name>
          . (eds.).
          <article-title>Finite-state language processing</article-title>
          , Language, Speech, and Communication, Cambridge, Mass. pp.
          <volume>329</volume>
          {
          <issue>354</issue>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>La</surname>
            <given-names>erty</given-names>
          </string-name>
          , J.,
          <string-name>
            <surname>McCallum</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pereira</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data</article-title>
          .
          <source>In: Proceedings of the Eighteenth International Conference on Machine Learning</source>
          ,
          <string-name>
            <surname>ICML</surname>
          </string-name>
          <year>2001</year>
          . vol.
          <volume>1</volume>
          , pp.
          <volume>282</volume>
          {
          <fpage>289</fpage>
          . ACM, San Francisco, CA, USA (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. Linguateca: (
          <year>2018</year>
          ), http://www.linguateca.pt/HAREM/, acesso em:
          <volume>02</volume>
          /03/2018
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Mota</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Desa os na Avaliac~ao Conjunta do Reconhecimento de Entidades Mencionadas: O Segundo HAREM</article-title>
          .
          <string-name>
            <surname>Linguateca</surname>
          </string-name>
          (
          <year>2008</year>
          ), https://www.linguateca.pt/LivroSegundoHAREM/
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13. MUC-7: MUC-7
          <string-name>
            <surname>Proceedings</surname>
          </string-name>
          (
          <year>2016</year>
          ), acesso em:
          <volume>11</volume>
          /10/2018
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <source>NIST: Text Analysis Conference (TAC)</source>
          (
          <year>2018</year>
          ), https://tac.nist.gov/2018/index.html, acesso em:
          <volume>24</volume>
          /05/2018
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Nothman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ringland</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radford</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murphy</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Curran</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          :
          <article-title>Learning Multilingual Named Entity Recognition from Wikipedia</article-title>
          .
          <source>Arti cial Intelligence</source>
          <volume>194</volume>
          ,
          <fpage>151</fpage>
          {
          <fpage>175</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Pires</surname>
            ,
            <given-names>A.R.O.</given-names>
          </string-name>
          :
          <article-title>Named Entity Extraction from Portuguese Web Text</article-title>
          .
          <source>Ph.D. thesis</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Pirovani</surname>
            ,
            <given-names>J.P.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oliveira</surname>
          </string-name>
          , E.: Extrac~ao de Nomes de
          <article-title>Pessoas em Textos em Portugu^es: uma Abordagem Usando Gramaticas Locais</article-title>
          . In: Computer on the Beach
          <year>2015</year>
          . pp.
          <volume>1</volume>
          {
          <fpage>10</fpage>
          . SBC, Florianopolis,
          <source>SC (March</source>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Pirovani</surname>
            ,
            <given-names>J.P.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oliveira</surname>
          </string-name>
          , E.:
          <article-title>CRF+LG: A Hybrid Approach for the Portuguese Named Entity Recognition</article-title>
          . In: Abraham A.,
          <string-name>
            <surname>Muhuri</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muda</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gandhi</surname>
            <given-names>N</given-names>
          </string-name>
          . (
          <article-title>eds) Intelligent Systems Design and Applications (ISDA</article-title>
          <year>2017</year>
          ).
          <source>Advances in Intelligent Systems and Computing</source>
          . vol.
          <volume>736</volume>
          , pp.
          <volume>102</volume>
          {
          <fpage>113</fpage>
          . Springer, Cham, Delhi, India (
          <year>2017</year>
          ). https://doi.org/https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -76348-4 11
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Pirovani</surname>
            ,
            <given-names>J.P.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oliveira</surname>
          </string-name>
          , E.:
          <article-title>Portuguese Named Entity Recognition using Conditional Random Fields and Local Grammars</article-title>
          . In: chair),
          <string-name>
            <given-names>N.C.C.</given-names>
            ,
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Cieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Goggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Hasida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Isahara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Mazo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Odijk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Piperidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Tokunaga</surname>
          </string-name>
          , T. (eds.)
          <source>Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC</source>
          <year>2018</year>
          ).
          <article-title>European Language Resources Association (ELRA), Miyazaki</article-title>
          , Japan (may
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Pirovani</surname>
            ,
            <given-names>J.P.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spalenza</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oliveira</surname>
          </string-name>
          , E.: Gerac~ao Automatica de Questo~
          <article-title>es a Partir do Reconhecimento de Entidades Nomeadas em Textos Didaticos</article-title>
          .
          <source>In: XXVIII Brazilian Symposium on Computers in Education (Simposio Brasileiro de Informatica na Educaca~o - SBIE</source>
          <year>2017</year>
          ). vol.
          <volume>28</volume>
          , pp.
          <volume>1147</volume>
          {
          <fpage>1156</fpage>
          .
          <string-name>
            <surname>Sociedade Brasileira de Computao - SBC</surname>
          </string-name>
          , Recife, Brasil (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Pirovani</surname>
            ,
            <given-names>J.P.C.</given-names>
          </string-name>
          :
          <article-title>CRF+ LG: Uma Abordagem H brida para o Reconhecimento de Entidades Nomeadas em Portugu^es</article-title>
          .
          <source>Ph.D. thesis</source>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Rocha</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jorge</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sionara</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brito</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pimenta</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rezende</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>PAMPO: Using Pattern Matching and Pos-tagging for E ective Named Entities Recognition in Portuguese (</article-title>
          <year>2016</year>
          ), http://arxiv.org/abs/1612.09535
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Sandra</surname>
            <given-names>Collovini</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Joaquim</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.C.J.T.R.V.P.</given-names>
            <surname>Q.M.S.D.B.C.R.G.C.C.</surname>
          </string-name>
          <article-title>a.X.: Portuguese Named Entity Recognition and Relation Extraction Tasks at IberLEF</article-title>
          <year>2019</year>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Sang</surname>
            ,
            <given-names>E.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meulder</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Introduction to the CoNLL-2003 Shared Task: LanguageIndependent Named Entity Recognition</article-title>
          .
          <source>In: Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003-Volume 4</source>
          . pp.
          <volume>142</volume>
          {
          <fpage>147</fpage>
          . Association for Computational Linguistics, Stroudsburg, PA, USA (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>C.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guimaraes</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Boosting Named Entity Recognition with Neural Character Embeddings</article-title>
          .
          <source>In: Proceedings of the Fifth Named Entities Workshop</source>
          ,
          <string-name>
            <surname>ACL</surname>
          </string-name>
          <year>2015</year>
          . pp.
          <volume>25</volume>
          {
          <fpage>33</fpage>
          . Association for Computational Linguistics, Stroudsburg, PA, USA (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>C.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Milidiu</surname>
            ,
            <given-names>R.L.</given-names>
          </string-name>
          :
          <article-title>Entropy Guided Transformation Learning: Algorithms</article-title>
          and Applications. Springer-Verlag London, London, United
          <string-name>
            <surname>Kingdom</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cardoso</surname>
          </string-name>
          , N.:
          <string-name>
            <surname>Reconhecimento de Entidades</surname>
          </string-name>
          <article-title>Mencionadas em Portugu^es: Documentaca~o e Actas do HAREM, a Primeira Avaliaca~o Conjunta na Area</article-title>
          .
          <source>Linguateca</source>
          (
          <year>2007</year>
          ), http://www.linguateca.pt/aval conjunta/LivroHAREM/LivroSantosCardoso2007.pdf
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Sutton</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCallum</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>An Introduction to Conditional Random Fields</article-title>
          .
          <source>Foundations and Trends R in Machine Learning</source>
          <volume>4</volume>
          ,
          <volume>267</volume>
          {
          <fpage>373</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>Y.</given-names>
            ,
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <surname>F.</surname>
          </string-name>
          :
          <article-title>Neural Reranking for Named Entity Recognition</article-title>
          .
          <source>arXiv preprint arXiv:1707.05127</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Blissett</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Kazemi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Whitehead</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Ji</surname>
          </string-name>
          , H.:
          <article-title>RPI BLENDER TAC-KBP2017 13 Languages EDL System</article-title>
          .
          <source>In: Proceedings of the Tenth Text Analysis Conference (TAC2017)</source>
          . NIST, Maryland, USA (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>