<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>ER
no-case-sensitive .</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Bi-directional LSTM-CNNs-CRF for Italian Sequence Labeling</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pierpaolo Basile</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giovanni Semeraro</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pierluigi Cassotti</string-name>
          <email>pierluigicassotti@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Bari Aldo Moro Via</institution>
          ,
          <addr-line>E. Orabona, 4 - 70125 Bari</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>8286</year>
      </pub-date>
      <volume>8182</volume>
      <abstract>
        <p>English. In this paper, we propose a Deep Learning architecture for sequence labeling based on a state of the art model that exploits both word- and characterlevel representations through the combination of bidirectional LSTM, CNN and CRF. We evaluate the proposed method on three Natural Language Processing tasks for Italian: PoS-tagging of tweets, Named Entity Recognition and Super-Sense Tagging. Results show that the system is able to achieve state of the art performance in all the tasks and in some cases overcomes the best systems previously developed for the Italian. Italiano. In questo lavoro viene descritta un'architettura di Deep Learning per l'etichettatura di sequenze basata su un modello allo stato dell'arte che utilizza rappresentazioni sia a livello di carattere che di parola attraverso la combinazione di LSTM, CNN e CRF. Il metodo e` stato valutato in tre task di elaborazione del linguaggio naturale per la lingua italiana: il PoS-tagging di tweet, il riconoscimento di entita` e il Super-Sense Tagging. I risultati ottenuti dimostrano che il sistema e` in grado di raggiungere prestazioni allo stato dell'arte in tutti i task e in alcuni casi riesce a superare i sistemi precedentemente sviluppati per la lingua italiana.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        ing (NLP) several DL architectures have been
proposed to solve many tasks, ranging from speech
recognition to parsing. Some typical NLP tasks
can be solved as sequence labeling problem, such
as part-of-speech (PoS) tagging and Named Entity
Recognition (NER). Traditional high performance
NLP methods for sequence labeling are linear
statistical models, including Conditional Random
Fields (CRF) and Hidden Markov Models (HMM)
        <xref ref-type="bibr" rid="ref15 ref17 ref18 ref20 ref21 ref26">(Ratinov and Roth, 2009; Passos et al., 2014; Luo
et al., 2015)</xref>
        , which rely on hand-crafted features
and task/language specific resources. However,
developing such task/language specific resources
has a cost, moreover it makes difficult to adapt
the model to new tasks, new domains or new
languages. In
        <xref ref-type="bibr" rid="ref11 ref16 ref8">(Ma and Hovy, 2016)</xref>
        , the authors
propose a state of the art sequence labeling method
based on a neural network architecture that
benefits from both word- and character-level
representations through the combination of bidirectional
LSTM, CNN and CRF. The method is able to
achieve state of the art performance in sequence
labeling tasks for the English without the use of
hand-crafted features.
      </p>
      <p>In this paper, we exploit the aforementioned
architecture for solving three NLP tasks in Italian:
PoS-tagging of tweets, NER and Super Sense
Tagging (SST). Our research question is to prove the
effectiveness of the DL architecture in a different
language, in this case Italian, without using
language specific features. The results of the
evaluation prove that our approach is able to achieve
state of the art performance and in some cases it is
able to overcome the best systems developed for
the Italian without the usage of specific language
resources.</p>
      <p>
        The paper is structured as follows: Section 2
provides details about our methodology and
summarizes the DL architecture proposed in
        <xref ref-type="bibr" rid="ref11 ref16 ref8">(Ma and
Hovy, 2016)</xref>
        , while Section 3 shows the results of
the evaluation. Final remarks are reported in
Section 4.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Methodology</title>
      <p>
        Our approach relies on the DL architecture
proposed in
        <xref ref-type="bibr" rid="ref11 ref16 ref8">(Ma and Hovy, 2016)</xref>
        , where the authors
combine two aspects previously exploited
separately: 1) the use of a character-level
representation
        <xref ref-type="bibr" rid="ref13 ref5 ref7">(Chiu and Nichols, 2015)</xref>
        ; 2) the addition
of an output layer based on CRF
        <xref ref-type="bibr" rid="ref13 ref15">(Huang et al.,
2015)</xref>
        . The architecture is sketched in Figure 1:
the input level of the Convolution Neural Network
is represented by the character-level
representation. A dropout layer
        <xref ref-type="bibr" rid="ref24">(Srivastava et al., 2014)</xref>
        is
applied before feeding the CNN with character
embeddings. Then the character embeddings are
concatenated with the word embeddings to form
the input for the Bi-directional LSTM layer. The
dropout layer is also applied to output vectors from
the LSTM layer. The output layer is based on
Conditional Random Fields and it modifies the output
vectors of the LSTM in order to find the best
output sequence. The CRF layer is useful for
learning correlations between labels in neighborhoods,
for example generally a noun follows an article in
PoS-tagging, or the I-ORG tag1 cannot follow the
I-PER tag in the NER task.
      </p>
      <p>The aforementioned architecture can be easily
adapted to other languages since it does not rely
on language dependent features. The only
components outside the architecture are the word
embeddings that can be built by relying on a corpus
of documents of the specific language. In
Section 3, we provide details about the setup of the
architecture parameters and the building of word
embeddings for Italian, in particular we adopt two
different word embeddings: ones for PoS-tagging
and ones for NER and SST. Moreover, we
reimplement2 the architecture by using the Keras3
framework and Tensorflow4 as back-end.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Evaluation</title>
      <p>
        We provide an evaluation in the context of three
sequence labeling tasks: 1) PoS tagging of
Italian tweets; 2) NER of Italian news and 3) Super
Sense Tagging. All tasks are performed using
Italian datasets, in particular we exploit data coming
from the last edition (2016) of EVALITA5
        <xref ref-type="bibr" rid="ref3">(Basile
et al., 2016)</xref>
        and previous ones (2009
        <xref ref-type="bibr" rid="ref17 ref18 ref21 ref26">(Magnini and
Cappelli, 2009)</xref>
        and 20116). EVALITA7 is a
periodic evaluation campaign of Natural Language
Processing (NLP) and speech tools for the Italian
language. The usage of a standard benchmark
allows to compare our system with the state of the
art approaches for the Italian language.
      </p>
      <p>
        Each task has its specific parameters, but there
are some ones that are in common as reported in
Table 1. We do not perform any parameters
optimization and we use the values proposed in the
English evaluation
        <xref ref-type="bibr" rid="ref11 ref16 ref8">(Ma and Hovy, 2016)</xref>
        . We
choose this strategy in order to not reduce the
training set since validation set is not provided in
all the tasks.
3.1
      </p>
      <sec id="sec-3-1">
        <title>PoS tagging of Tweets</title>
        <p>
          The goal of the task is to perform PoS-tagging
of tweets. The task is more challenging with
respect to the classical PoS-tagging due to the short
and noisy nature of tweets. For the evaluation
we adopt the dataset used during the EVALITA
2016 PoSTWITA task
          <xref ref-type="bibr" rid="ref6">(Bosco et al., 2016)</xref>
          in order
2The code is available on line: https://github.
com/pippokill/bilstm-cnn-crf-seq-ita
3https://keras.io/
4https://www.tensorflow.org/
5https://github.com/evalita2016/data
6http://www.evalita.it/2011/working_
notes
7http://www.evalita.it/
to compare our system with the other EVALITA
participants. The dataset contains 6,438 tweets
(114,967 tokens) for training and 300 tweets
(4,759 tokens) for test. The metric used for the
evaluation is the classical tagging accuracy: it is
defined as the number of correct PoS tag
assignment divided by the total number of tokens in the
test set. Participants can predict only one tag for
each token.
        </p>
        <p>All the top-performing PoSTWITA systems are
based on Deep Neural Networks and, in
particular, on LSTM, moreover most systems use word or
character embeddings as inputs for their systems.
This makes other systems more similar to the one
proposed in this paper.</p>
        <p>
          Results of the evaluation are reported in
Table 2, our best approach (UNIBA-twita) is able to
overcome the first three PoSTWITA participants.
(UNIBA-twita) exploits a corpus of 70M tweets
randomly extracted from Twita, a collection of
about 800M tweets, for building the word
embeddings. It is important to underline that the best
system (ILC-CNR)
          <xref ref-type="bibr" rid="ref11 ref16 ref8">(Cimino and Dell’orletta, 2016)</xref>
          in
PoSTWITA uses a biLSTM and a RNN by
exploiting both word and char embeddings, moreover
it use further features based on morpho-syntactic
category and spell checker. The good performance
of our system probably depends by the CRF layer
and the corpus used for building the word
embeddings. This hypothesis is supported by the fact
that the configuration (UNIBA-itwiki) based on
word embeddings extracted from Wikipedia
obtains the worst result. The configuration
UNIBArandom300 adopts random embeddings, we report
this result in order to underline the importance of
pre-trained word embeddings. Moreover, the
second best system (UniDuisburg)
          <xref ref-type="bibr" rid="ref11 ref16 ref8">(Horsmann and
Zesch, 2016)</xref>
          in PoSTWITA exploits a CRF
classifier using several features without a DL
architecture, while the system UniBologna UnOFF
          <xref ref-type="bibr" rid="ref25 ref6">(Tamburini, 2016)</xref>
          uses a BiLSTM with a CRF layer by
exploiting word embeddings and additional
morphological features.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>NER Task</title>
        <p>
          Three tasks about named entities have been
organized during the EVALITA evaluation
campaigns, respectively in 2007
          <xref ref-type="bibr" rid="ref22">(Speranza, 2007)</xref>
          ,
2009
          <xref ref-type="bibr" rid="ref23">(Speranza, 2009)</xref>
          , and 2011
          <xref ref-type="bibr" rid="ref14">(Lenzi et al.,
2013)</xref>
          . In this paper we take into account the 2009
edition since the I-CAB dataset 8used in the
evaluation is the same adopted in 2009. In 2007 a
different version of I-CAB was used, while in 2011 the
task was focused on data transcribed by an ASR
system. The I-CAB dataset consists of a set of
news manually annotated with four kinds of
entities: GPE (geo-political), LOC (location), ORG
(organization) and PER (person). The dataset
contains 525 news for training and 180 for testing for a
total number of 11,410 annotated entities for
training and 4,966 ones for testing. The dataset is
provided in the IOB2 format.
        </p>
        <p>We build word embeddings by exploiting the
Italian version of Wikipedia. Word2vec is used
for creating embeddings with a dimension of 300,
we remove all words that have less than 40
occurrences in Wikipedia, for the other parameters we
adopt the standard values provided by word2vec.</p>
        <p>Results of the evaluation are reported in Table
3 and Table 4. Table 3 reports precision (P),
recall (R) and F1-measure (F1) for different
configurations of the system. In particular:
no-casesensitive does not perform lowercase of words
for both word embeddings and the lookup table,
while case-sensitive does it. The random
configuration randomly initializes embeddings without
using pre-trained embeddings, while no char does
not adopt char embeddings. The results show that
8http://ontotext.fbk.eu/icab.html
the best performance is obtained by applying
lowercase, moreover the contribution of char
embeddings is significant.</p>
        <p>
          Table 4 reports the result of our best
configuration (no-case-sensitive) with respect to the other
EVALITA 2009 participants. The system is able
to outperform the first three EVALITA
participants thanks to the best performance in recall. All
the first three participants adopt classical
classification methods: the first system
          <xref ref-type="bibr" rid="ref26">(Zanoli et al.,
2009)</xref>
          combines two classifiers (HMM and CRF),
the second participant
          <xref ref-type="bibr" rid="ref10">(Gesmundo, 2009)</xref>
          uses a
Perceptron algorithm, while the third participant
          <xref ref-type="bibr" rid="ref18">(Mehdad et al., 2009)</xref>
          adopts Support Vector
Machine and feature selection. We can conclude that
the DL architecture is more effective in the model
generalization and in tackling the data sparsity
problem. This behavior is supported by the good
performance in recognizing LOC entities, in fact
the LOC class represents about the 3% of
annotated entities in both training and test. Other two
systems
          <xref ref-type="bibr" rid="ref19 ref5">(Nguyen and Moschitti, 2012;
Bonadiman et al., 2015)</xref>
          able to overcome the EVALITA
2009 participants have been proposed in the
literature. The former
          <xref ref-type="bibr" rid="ref19">(Nguyen and Moschitti, 2012)</xref>
          achieves the 84.33% of F1 by using re-ranking
techniques and the combination of two
state-ofthe-art NER learning algorithms: conditional
random fields and support vector machines. The latter
          <xref ref-type="bibr" rid="ref5">(Bonadiman et al., 2015)</xref>
          exploits a Deep Neural
Network with a log-likelihood cost function and
a recurrent feedback mechanism to ensure the
dependencies between the output tags. This system
is able to achieves the 82.81% of F1, a
performance comparable with our DL architecture.
3.3
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Super Sense Tagging</title>
        <p>
          The Super-Sense Tagging (SST) task
          <xref ref-type="bibr" rid="ref1 ref9">(Dei Rossi
et al., 2011)</xref>
          consists in annotating each significant
entity in a text, like nouns, verbs, adjectives and
adverbs, within a general semantic taxonomy
defined by the WordNet lexicographer classes (called
super-senses, for a total of 45 senses). SST can be
considered as a task half-way between NER and
Word Sense Disambiguation (WSD): it is an
extension of NER, since it uses a larger set of
semantic categories, and it is an easier and more
practical task with respect to WSD. The dataset has been
tagged using the IOB2 format as for the NER task
and contains about 276,000 tokens for training and
about 50,000 for testing. The metric adopted for
the evaluation is the F1, results of the evaluation
are reported in Table 5. As word embeddings we
use the same ones adopted for the NER task and
built upon Wikipedia with lowercase.
        </p>
        <p>System F1
UNIBA-pos-Adagrad .7871
UNIBA-pos .7787
UNIBA .7453
UNIBA-SVMcat .7866</p>
        <p>UNIPI-run3 .7827</p>
        <p>
          The best performance (UNIBA-pos-Adagrad)
is obtained using Adagrad instead of Adadelta
(UNIBA-pos) as optimization method.
Moreover, we exploits PoS-tags as additional features,
while UNIBA uses only tokens and word/char
embeddings. The difference in performance
between UNIBA-pos and UNIBA proves the
effectiveness of the PoS-tag in this task. The best
system in EVALITA 2011 SST task, UNIBA-SVMcat
          <xref ref-type="bibr" rid="ref2 ref4">(Basile, 2013, 2011)</xref>
          , is very close to our best
configuration. This system combines lexical and
distributional features through an SVM classifier,
while the second system (UNIPI-run3)
          <xref ref-type="bibr" rid="ref1">(Attardi
et al., 2011)</xref>
          exploits lexical features and a
Maximum Entropy classifier.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions and Future Work</title>
      <p>We propose an evaluation of a state of the art DL
architecture for sequence labeling in the context
of the Italian language. In particular, we consider
three tasks: PoS-tagging of tweets, Named Entity
Recognition and Super-Sense tagging. All tasks
exploit data coming from EVALITA a standard
benchmark for the evaluation of Italian NLP
systems. Our system is able to achieve good
performance in all the tasks without using hand-crafted
features. Analyzing the results, we observe the
importance of building word embeddings on
appropriate corpora and we note that the system in the
SST task is not able to generalize a good model
without the pos-tag feature, this underline the
importance of this kind of feature in the SST task.
As future work, we plan to perform a parameters
optimization by reducing the training set and
using a portion as validation set. Using less data for
training could affect the final performance and it
could be interesting to have insights on the
tradeoff between training on more examples versus the
parameters optimization.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work is partially supported by the project
“Multilingual Entity Liking” funded by the
Apulia Region under the program FutureInResearch.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Giuseppe</given-names>
            <surname>Attardi</surname>
          </string-name>
          , Luca Baronti, Stefano Dei Rossi, and
          <string-name>
            <given-names>Maria</given-names>
            <surname>Simi</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>SuperSense Tagging with a Maximum Entropy Classifier and Dynamic Programming</article-title>
          .
          <source>In Working Notes of EVALITA</source>
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>P.</given-names>
            <surname>Basile</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Super-sense tagging using support vector machines and distributional features</article-title>
          .
          <source>Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</source>
          7689 LNAI:
          <fpage>176</fpage>
          -
          <lpage>185</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>P.</given-names>
            <surname>Basile</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Cutugno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Nissim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Patti</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Sprugnoli</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>EVALITA 2016: Overview of the 5th evaluation campaign of natural language processing and speech tools for Italian</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <source>CEUR Workshop Proceedings</source>
          , volume
          <volume>1749</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Pierpaolo</given-names>
            <surname>Basile</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>UNIBA: Super-sense Tagging at EVALITA 2011</article-title>
          .
          <source>In Working Notes of EVALITA</source>
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Daniele</given-names>
            <surname>Bonadiman</surname>
          </string-name>
          , Aliaksei Severyn, and
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Moschitti</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Deep neural networks for named entity recognition in italian</article-title>
          . In CLiC-it
          <source>2015 Proceedings of the second Italian Conference on Computational Linguistics. page 51.</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>C.</given-names>
            <surname>Bosco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Tamburini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bolioli</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Mazzei</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Overview of the EVALITA 2016 Part of speech on twitter for Italian task</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <source>CEUR Workshop Proceedings</source>
          , volume
          <volume>1749</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>Jason PC Chiu and Eric Nichols</source>
          .
          <year>2015</year>
          .
          <article-title>Named entity recognition with bidirectional LSTM-CNNs</article-title>
          .
          <source>arXiv preprint arXiv:1511</source>
          .
          <fpage>08308</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>A.</given-names>
            <surname>Cimino</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Dell</surname>
          </string-name>
          <article-title>'orletta</article-title>
          .
          <year>2016</year>
          .
          <article-title>Building the state-of-the-art in POS tagging of Italian Tweets</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <source>CEUR Workshop Proceedings</source>
          , volume
          <volume>1749</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Stefano</given-names>
            <surname>Dei</surname>
          </string-name>
          <string-name>
            <surname>Rossi</surname>
          </string-name>
          , Giulia Di Pietro, and
          <string-name>
            <given-names>Maria</given-names>
            <surname>Simi</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>EVALITA 2011: Description and Results of the SuperSense Tagging Task</article-title>
          .
          <source>In Working Notes of EVALITA</source>
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Andrea</given-names>
            <surname>Gesmundo</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Bidirectional sequence classification for named entities recognition</article-title>
          .
          <source>In Proceedings of the Workshop Evalita</source>
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>T.</given-names>
            <surname>Horsmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Zesch</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Building a social media adapted PoS tagger using flexTag - A case study on Italian tweets</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <source>CEUR Workshop Proceedings</source>
          , volume
          <volume>1749</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Zhiheng</given-names>
            <surname>Huang</surname>
          </string-name>
          , Wei Xu,
          <string-name>
            <given-names>and Kai</given-names>
            <surname>Yu</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Bidirectional LSTM-CRF models for sequence tagging</article-title>
          .
          <source>arXiv preprint arXiv:1508</source>
          .
          <year>01991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Valentina</given-names>
            <surname>Bartalesi</surname>
          </string-name>
          <string-name>
            <surname>Lenzi</surname>
          </string-name>
          , Manuela Speranza, and
          <string-name>
            <given-names>Rachele</given-names>
            <surname>Sprugnoli</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Named entity recognition on transcribed broadcast news at EVALITA 2011</article-title>
          .
          <source>In Revised Papers from EVALITA11: International Workshop on the Evaluation of Natural Language and Speech Tools for Italian</source>
          . Springer, volume
          <volume>7689</volume>
          , pages
          <fpage>86</fpage>
          -
          <lpage>97</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>G.</given-names>
            <surname>Luo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.-Y.</given-names>
            <surname>Lin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Nie</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Joint named entity recognition and disambiguation</article-title>
          . In
          <source>Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing</source>
          . pages
          <fpage>879</fpage>
          -
          <lpage>888</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Xuezhe</given-names>
            <surname>Ma</surname>
          </string-name>
          and
          <string-name>
            <given-names>Eduard</given-names>
            <surname>Hovy</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>End-to-end sequence labeling via bi-directional lstm-cnns-crf</article-title>
          .
          <source>arXiv preprint arXiv:1603</source>
          .
          <fpage>01354</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Bernardo</given-names>
            <surname>Magnini</surname>
          </string-name>
          and
          <string-name>
            <given-names>Amedeo</given-names>
            <surname>Cappelli</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Introduction to Evalita 2009</article-title>
          .
          <source>In Proceedings of the Workshop Evalita</source>
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Yashar</given-names>
            <surname>Mehdad</surname>
          </string-name>
          , Vitalie Scurtu, and
          <string-name>
            <given-names>Evgeny</given-names>
            <surname>Stepanov</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Italian named entity recognizer participation in NER task @ Evalita 09</article-title>
          .
          <source>In Proceedings of the Workshop Evalita</source>
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Truc-Vien T Nguyen</surname>
            and
            <given-names>Alessandro</given-names>
          </string-name>
          <string-name>
            <surname>Moschitti</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Structural reranking models for named entity recognition</article-title>
          .
          <source>Intelligenza Artificiale</source>
          <volume>6</volume>
          (
          <issue>2</issue>
          ):
          <fpage>177</fpage>
          -
          <lpage>190</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Alexandre</given-names>
            <surname>Passos</surname>
          </string-name>
          , Vineet Kumar, and
          <string-name>
            <surname>Andrew McCallum</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Lexicon infused phrase embeddings for named entity resolution</article-title>
          .
          <source>arXiv preprint arXiv:1404</source>
          .
          <fpage>5367</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>Lev</given-names>
            <surname>Ratinov</surname>
          </string-name>
          and
          <string-name>
            <given-names>Dan</given-names>
            <surname>Roth</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Design challenges and misconceptions in named entity recognition</article-title>
          .
          <source>In Proceedings of the Thirteenth Conference on Computational Natural Language Learning. Association for Computational Linguistics</source>
          , pages
          <fpage>147</fpage>
          -
          <lpage>155</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Manuela</given-names>
            <surname>Speranza</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Evalita 2007: the named entity recognition task</article-title>
          .
          <source>In Proceedings of the Workshop Evalita</source>
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>Manuela</given-names>
            <surname>Speranza</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>The named entity recognition task at evalita 2009</article-title>
          .
          <source>In Proceedings of the Workshop Evalita</source>
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <given-names>Nitish</given-names>
            <surname>Srivastava</surname>
          </string-name>
          , Geoffrey E Hinton, Alex Krizhevsky, Ilya Sutskever, and
          <string-name>
            <given-names>Ruslan</given-names>
            <surname>Salakhutdinov</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Dropout: a simple way to prevent neural networks from overfitting</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          <volume>15</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1929</fpage>
          -
          <lpage>1958</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>F.</given-names>
            <surname>Tamburini</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>A BiLSTM-CRF PoS-tagger for Italian tweets using morphological information</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <source>CEUR Workshop Proceedings</source>
          , volume
          <volume>1749</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Zanoli</surname>
          </string-name>
          , Emanuele Pianta, and
          <string-name>
            <given-names>Claudio</given-names>
            <surname>Giuliano</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Named entity recognition through redundancy driven classifiers</article-title>
          .
          <source>In Proceedings of the Workshop Evalita</source>
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>