<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Overview of the EVALITA 2016 Named Entity rEcognition and Linking in Italian Tweets (NEEL-IT) Task</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pierpaolo Basile</string-name>
          <email>1pierpaolo.basile@uniba.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Annalina Caputo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anna Lisa Gentile</string-name>
          <email>3annalisa@informatik.uni-mannheim.de</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giuseppe Rizzo</string-name>
          <email>4giuseppe.rizzo@ismb.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT Centre, Trinity Collge Dublin</institution>
          ,
          <addr-line>Dublin</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Computer Science, University of Bari Aldo Moro</institution>
          ,
          <addr-line>Bari</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Istituto Superiore Mario Boella</institution>
          ,
          <addr-line>Turin</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Mannheim</institution>
          ,
          <addr-line>Mannheim</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>English. This report describes the main outcomes of the 2016 Named Entity rEcognition and Linking in Italian Tweet (NEEL-IT) Challenge. The goal of the challenge is to provide a benchmark corpus for the evaluation of entity recognition and linking algorithms specifically designed for noisy and short texts, like tweets, written in Italian. The task requires the correct identification of entity mentions in a text and their linking to the proper named entities in a knowledge base. To this aim, we choose to use the canonicalized dataset of DBpedia 201510. The task has attracted five participants, for a total of 15 runs submitted.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Italiano. In questo report descriviamo
i principali risultati conseguiti nel primo
task per la lingua Italiana di Named Entity
rEcognition e Linking in Tweet
(NEELIT). Il task si prefigge l’obiettivo di offrire
un framework di valutazione per gli
algoritmi di riconoscimento e linking di entità
a nome proprio specificamente disegnati
per la lingua italiana per testi corti e
rumorosi, quali i tweet. Il task si compone di
una fase di riconoscimento delle menzioni
di entità con nome proprio nel testo e del
loro successivo collegamento alle
opportune entità in una base di conoscenza. In
questo task abbiamo scelto come base di
conoscenza la versione canonica di
DBpedia 2015. Il task ha attirato cinque
partecipanti per un totale di 15 diversi run.
1</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>
        Tweets represent a great wealth of information
useful to understand recent trends and user
behaviours in real-time. Usually, natural language
processing techniques would be applied to such
pieces of information in order to make them
machine-understandable. Named Entity
rEcongition and Linking (NEEL) is a particularly useful
technique aiming aiming to automatically
annotate tweets with named entities. However, due to
the noisy nature and shortness of tweets, this
technique is more challenging in this context than
elsewhere. International initiatives provide evaluation
frameworks for this task, e.g. the Making Sense of
Microposts workshop (Dadzie et al., 2016) hosted
the 2016 NEEL Challenge
        <xref ref-type="bibr" rid="ref11">(Rizzo et al., 2016)</xref>
        , or
the W-NUT workshop at ACL 2015
        <xref ref-type="bibr" rid="ref2">(Baldwin et
al., 2015)</xref>
        , but the focus is always and strictly on
the English language. We see an opportunity to
(i) encourage the development of language
independent tools for for Named Entity Recognition
(NER) and Linking (NEL) systems and (ii)
establish an evaluation framework for the Italian
community. NEEL-IT at EVALITA has the vision to
establish itself as a reference evaluation
framework in the context of Italian tweets.
2
      </p>
    </sec>
    <sec id="sec-3">
      <title>Task Description</title>
      <p>
        NEEL-IT followed a setting similar to NEEL
challenge for English Micropost on Twitter
        <xref ref-type="bibr" rid="ref11">(Rizzo et
al., 2016)</xref>
        . The task consists of annotating each
named entity mention (like people, locations,
organizations, and products) in a text by linking it to
a knowledge base (DBpedia 2015-10).
      </p>
      <p>Specifically, each task participant is required to:
1. Recognize and typing each entity mention
that appears in the text of a tweet;
id
begin</p>
      <p>end link
288... 0
288... 73
288... 89
290... 1
2. Disambiguate and link each mention to the
canonicalized DBpedia 2015-10, which is
used as referent Knowledge Base. This
means that if an entity is present in the
Italian DBpedia but not in the canonicalized
version, this mention should be tagged as NIL.
For example, the mention Agorà can only
be referenced to the Italian DBpedia entry
Agorà &lt;programma televisivo&gt;1, but this
entry has no correspondence into the
canonicalized version of DBpedia. Then, it has been
tagged as a NIL instance.
3. Cluster together the non linkable entities,
which are tagged as NIL, in order to provide
a unique identifier for all the mentions that
refer to the same named entity.</p>
      <p>In the annotation process, a named entity is a
string in the tweet representing a proper noun that:
1) belongs to one of the categories specified in a
taxonomy and/or 2) can be linked to a DBpedia
concept. This means that some concepts have a
NIL DBpedia reference2.</p>
      <p>The taxonomy is defined by the following
categories:
Thing languages, ethnic groups, nationalities,
religions, diseases, sports, astronomical
objects;
Event holidays, sport events, political events,
social events;
Character fictional character, comics character,
title character;
Location public places, regions, commercial
places, buildings;
Organization companies, subdivisions of
companies, brands, political parties, government
1http://it.dbpedia.org/resource/
AgorÃa˘\_(programma\_televisivo)</p>
      <p>2These concepts belong to one of the categories but they
have no corresponding concept in DBpedia
bodies, press names, public organizations,
collection of people;
Person people’s names;
Product movies, tv series, music albums, press
products, devices.</p>
      <p>From the annotation are excluded the preceding
article (like il, lo, la, etc.) and any other prefix
(e.g. Dott., Prof.) or post-posed modifier. Each
participant is asked to produce an annotation file
with multiple lines, one for each annotation. A
line is a tab separated sequence of tweet id, start
offset, end offset, linked concept in DBpedia, and
category. For example, given the tweet with id
288976367238934528:
Chameleon Launcher in arrivo anche per
smartphone: video beta privata su Galaxy Note 2
e Nexus 4: Chameleon Laun...</p>
      <p>the annotation process is expected to produce
the output as reported in Table 1.</p>
      <p>The annotation process is also expected to link
Twitter mentions (@) and hashtags (#) that
refer to a named entities, like in the tweet with id
290460612549545984:
@CarlottaFerlito io non ho la forza di alzarmi e
prendere il libro! Help me
the correct annotation is also reported in Table 1.</p>
      <p>Participants were allowed to submit up to three
runs of their system as TSV files. We encourage
participants to make available their system to the
community to facilitate reuse.
3</p>
      <p>Corpus Description and Annotation
Process
The NEEL-IT corpus consists of both a
development set (released to participants as training set)
and a test set. Both sets are composed by two
TSV files: (1) the tweet id file, this is a list of all
tweet ids used for training; (2) the gold standard,
containing the annotations for all the tweets in the
development set following the format showed in
Table 1.</p>
      <p>
        The development set was built upon the dataset
produced by Basile et al. (2015). This dataset is
composed by a sample of 1,000 tweets randomly
selected from the TWITA dataset
        <xref ref-type="bibr" rid="ref4">(Basile and
Nissim, 2013)</xref>
        . We updated the gold standard links
to the canonicalized DBpedia 2015-10.
Furthermore, the dataset underwent another round of
annotation performed by a second annotator in order
to maximize the consistency of the links. Tweets
that presented some conflicts were then resolved
by a third annotator.
      </p>
      <p>
        Data for the test set was generated by randomly
selecting 1,500 tweets from the SENTIPOLC test
data
        <xref ref-type="bibr" rid="ref3">(Barbieri et al., 2016)</xref>
        . From this pool, 301
tweets were randomly chosen for the annotation
process and represents our Gold Standard (GS).
This sub-sample was choose in coordination with
the task organisers of SENTIPOLC
        <xref ref-type="bibr" rid="ref3">(Barbieri et
al., 2016)</xref>
        , POSTWITA
        <xref ref-type="bibr" rid="ref13">(Tamburini et al., 2016)</xref>
        and FacTA
        <xref ref-type="bibr" rid="ref7">(Minard et al., 2016b)</xref>
        with the aim of
providing a unified framework for multiple layers
of annotations.
      </p>
      <p>
        The tweets were split in two batches, each of
them was manually annotated by two different
annotators. Then, a third annotator intervened in
order to resolve those debatable tweets with no exact
match between annotations. The whole process
has been carried out by exploiting BRAT3
webbased tool
        <xref ref-type="bibr" rid="ref12">(Stenetorp et al., 2012)</xref>
        .
      </p>
      <p>Table 2 reports some statistics on the two sets:
in both the most represented categories are
“Person”, “Organization” and “Location”. “Person”
is also the most populated category among the
NIL instances, along to “Organization” and
“Product”. In the development set, the least represented
category is “Character” among the NIL instances
and both “Thing” and “Event” between the linked
ones. A different behaviour can be found in the
test set where the least represented category is
“Thing” in both NIL and linked instances.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Evaluation Metrics</title>
      <p>Each participant was asked to submit up to three
different run. The evaluation is based on the
following three metrics:
STMM (Strong_Typed_Mention_Match). This
metrics evaluates the micro average F-1 score
3http://brat.nlplab.org/
for all annotations considering the mention
boundaries and their types. This is a measure
of the tagging capability of the system.</p>
      <p>SLM (Strong_Link_Match). This metrics is the
micro average F-1 score for annotations
considering the correct link for each mention.
This is a measure of the linking performance
of the system.</p>
      <p>MC (Mention_Ceaf ). This metrics, also known
as Constrained Entity-Alignment F-measure
(Luo, 2005), is a clustering metric developed
to evaluate clusters of annotations. It
evaluates the F-1 score for both NIL and non-NIL
annotations in a set of mentions.</p>
      <p>The final score for each system is a combination
of the aforementioned metrics and is computed as
follows:
score = 0:4 M C +0:3 ST M M +0:3 SLM:
(1)</p>
      <p>All the metrics were computed by using the
TAC KBP scorer4.</p>
      <p>4https://github.com/wikilinks/neleval/</p>
    </sec>
    <sec id="sec-5">
      <title>Systems Description</title>
      <p>The task was well received by the NLP
community and was able to attract 17 participants who
expressed their interest in the evaluation. Five
groups participated actively to the challenge by
submitting their system results, each group
presented three different runs, for a total amount of
15 runs submitted. In this section we briefly
describe the methodology followed by each group.
5.1</p>
      <sec id="sec-5-1">
        <title>UniPI</title>
        <p>
          The system proposed by the University of Pisa
          <xref ref-type="bibr" rid="ref1">(Attardi et al., 2016)</xref>
          exploits word embeddings
and a bidirectional LSTM for entity recognition
and linking. The team produced also a training
dataset of about 13,945 tweets for entity
recognition by exploiting active learning, training data
taken from the PoSTWITA task
          <xref ref-type="bibr" rid="ref13">(Tamburini et al.,
2016)</xref>
          and manual annotation. This resource, in
addition to word embeddings built on a large
corpus of Italian tweets, is used to train a bidirectional
LSTM for the entity recognition step. In the
linking step, for each Wikipedia page its abstract is
extracted and the average of the word embeddings is
computed. For each candidate entity in the tweet,
the word embedding for a context of words of size
c before and after the entity is created. The
linking is performed by comparing the mention
embedding with the DBpedia entity whose lc2
distance is the smallest among those entities whose
abstract embeddings were computed at the
previous step. The Twitter mentions were resolved by
retrieving the real name with the Twitter API and
looking up in a gazetteer in order to identify the
Person-type entities.
5.2
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>MicroNeel</title>
        <p>
          MicroNeel
          <xref ref-type="bibr" rid="ref8">(Corcoglioniti et al., 2016)</xref>
          investigates the use on microposts of two standard
NER and Entity Linking tools originally
developed for more formal texts, namely Tint
          <xref ref-type="bibr" rid="ref1 ref10 ref13 ref7 ref8 ref9">(Palmero
Aprosio and Moretti, 2016)</xref>
          and The Wiki
Machine
          <xref ref-type="bibr" rid="ref1 ref10 ref13 ref7 ref8 ref9">(Palmero Aprosio and Giuliano, 2016)</xref>
          .
Comprehensive tweet preprocessing is performed
to reduce noisiness and increase textual context.
Existing alignments between Twitter user
profiles and DBpedia entities from the Social Media
Toolkit
          <xref ref-type="bibr" rid="ref8">(Nechaev et al., 2016)</xref>
          resource are
exploited to annotate user mentions in the tweets.
wiki/Evaluation
Rule-based and supervised (SVM-based)
techniques are investigated to merge annotations from
different tools and solve possible conflicts. All the
resources listed as follows were employed in the
evaluation:
        </p>
        <p>
          The Wiki Machine
          <xref ref-type="bibr" rid="ref1 ref10 ref13 ref7 ref8 ref9">(Palmero Aprosio and
Giuliano, 2016)</xref>
          : an open source entity
linking for Wikipedia and multiple languages.
Tint
          <xref ref-type="bibr" rid="ref1 ref10 ref13 ref7 ref8 ref9">(Palmero Aprosio and Moretti, 2016)</xref>
          : an
open source suite of NLP modules for Italian,
based on Stanford CoreNLP, which supports
named entity recognition.
        </p>
        <p>
          Social Media Toolkit (SMT)
          <xref ref-type="bibr" rid="ref8">(Nechaev et al.,
2016)</xref>
          : a resource and API supporting the
alignment of Twitter user profiles to the
corresponding DBpedia entities.
        </p>
        <p>
          Twitter ReST API5: a public API for
retrieving Twitter user profiles and tweet metadata.
Morph-It!
          <xref ref-type="bibr" rid="ref14">(Zanchetta and Baroni, 2005)</xref>
          : a
free morphological resource for Italian used
for preprocessing (true-casing) and as source
of features for the supervised merging of
annotations.
tagdef6: a website collecting user-contributed
descriptions of hashtags.
        </p>
        <p>list of slang terms from Wikipedia7.
5.3</p>
        <p>
          FBK-HLT-NLP
The system proposed by the FBK-HLT-NLP team
          <xref ref-type="bibr" rid="ref7">(Minard et al., 2016a)</xref>
          follows 3 steps: entity
recognition and classification, entity linking to
DBpedia and clustering. Entity recognition and
classification is performed by the EntityPro
module (included in the TextPro pipeline), which is
based on machine learning and uses the SVM
algorithm. Entity linking is performed using the
named entity disambiguation module developed
within the NewsReader and based on DBpedia
Spotlight. The FBK team exploited a specific
resource to link the Twitter profiles to DBpedia: the
Alignments dataset. The clustering step is
stringbased, i.e. two entities are part of the same cluster
if they are equal.
        </p>
        <p>5https://dev.twitter.com/rest/public
6https://www.tagdef.com/
7https://it.wikipedia.org/wiki/Gergo_
di_Internet</p>
        <p>Moreover, the FBK team exploits active
learning for domain adaptation, in particular to adapt
a general purpose Named Entity Recognition
system to a specific domain (tweets) by creating new
annotated data. In total they have annotated 2,654
tweets.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.4 Sisinflab</title>
        <p>The system proposed by Sisinflab (Cozza et al.,
2016) faces the neel-it challenge through an
ensamble approach that combines unsupervised and
supervised methods. The system merges results
achieved by three strategies:
1. DBpedia Spotlight for span and URI
detection plus SPARQL queries to DBpedia for
type detection;
2. Stanford CRF-NER trained with the
challenge train corpus for span and type detection
and DBpedia lookup for URI detection;
3. DeepNL-NER, a deep learning classifier
trained with the challenge train corpus for
span and type detection, it exploits ad-hoc
gazetteers and word embedding vectors
computed with word2vec trained over the Twita
dataset8 (a subset of 12,000,000 tweets).
DBpedia is used for URI detection.</p>
        <p>Finally, the system computes NIL clusters for
those mentions that do not match with an entry
in DBpedia, by grouping in the same cluster
entities with the same text (no matter the case). The
Sisinflab team submitted three runs combining the
previous strategies, in particular: run1) combines
(1), (2) and (3); run2 involves strategies (1) and
(3); run3 exploits strategies (1) and (2).
5.5</p>
      </sec>
      <sec id="sec-5-4">
        <title>UNIMIB</title>
        <p>
          The system proposed by the UNIMIB team
          <xref ref-type="bibr" rid="ref6">(Cecchini et al., 2016)</xref>
          is composed of three steps: 1)
Named Entity Recognition using Conditional
Random Fields (CRF); 2) Named Entity Linking by
considering both Supervised and Neural-Network
Language models and 3) NIL clustering by
using a graph-based approach. In the first step two
kinds of CRF are exploited: 1) a simple CRF on
the training data and 2) CRF+Gazetteers, in this
8http://www.let.rug.nl/basile/files/
proc/
configuration the model has been induced by
exploiting several gazetteers, i.e. products,
organizations, persons, events and characters. Two
strategies are adopted for the linking. A decision
strategy is used to select the best link by exploiting a
large set of supervised methods. Then, word
embeddings built on Wikipedia are used to compute a
similarity measure used to select the best link for
a list of candidate entities. NIL clustering is
performed by a graph-based approach; in particular,
a weighted indirect co-occurrence graph where an
edge represents the co-occurrence of two terms in
a tweet is built. The ensuing word graph was then
clustered using the MaxMax algorithm.
6
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Results</title>
      <p>The performance of the participant systems were
assessed by exploiting the final score measure
presented in Eq. 1. This measure combines the
three different aspects evaluated during the task,
i.e. the correct tagging of the mentions (STMM),
the proper linking to the knowledge base (SLM),
and the clustering of the NIL instances (MC).
Results of the evaluation in terms of the final score
are reported in Table 3.</p>
      <p>The best result was reported by Uni.PI.3, this
system obtained the best final score of 0:5034
with an improvement with respect to the Uni.PI.1
(second classified) of +1:27. The difference
between these two runs lays on the different
vector dimension (200 in Uni.PI.3 rather than 100
in Uni.Pi.1) combined with the use of Wikipedia
embeddings and a specific training set for
geographical entities (Uni.PI.3) rather than a mention
frequency strategy for disambiguation (Uni.PI.1).
MicroNeel.base and FBK-HLT-NLP obtain
remarkable results very close to the best system.
Indeed, MicroNeel.base reported the highest
linking performance (SLM = 0:477) while
FBK-HLTNLP showed the best clustering (MC = 0:585) and
tagging (STMM = 0:516) results. It is
interesting to notice that all these systems (UniPI,
MicroNeel and FBK-HLT-NLP) developed specific
techniques for dealing with Twitter mentions
reporting very good results for the tagging metric
(with values always above 0:46).</p>
      <p>All participants have made used of
supervised algorithms at some point of their
tagging/linking/clustering pipeline. UniPi,
Sisinflab and UNIMIB have exploited word
embeddings trained on the development set plus some
other external resources (manual annotated
corpus, Wikipedia, and Twita). UniPI and
FBK-HLTNLP built additional training data obtained by
active learning and manual annotation. The use of
additional resources is allowed by the task
guidelines, and both the teams have contributed to
develop additional data useful for the research
community.
7</p>
    </sec>
    <sec id="sec-7">
      <title>Conclusions</title>
      <p>We described the first evaluation task for entity
linking in Italian tweets. The task evaluated the
performance of participant systems in terms of (1)
tagging entity mentions in the text of tweets; (2)
linking the mentions with respect to the
canonicalized DBpedia 2015-10; (3) clustering the entity
mentions that refer to the same named entity.</p>
      <p>The task has attracted many participants who
specifically designed and developed algorithm for
dealing with both Italian language and the specific
peculiarity of text on Twitter. Indeed, many
participants developed ad-hoc techniques for
recognising Twitter mentions and hashtag. In addition,
the participation in the task has fostered the
building of new annotated datasets and corpora for the
purpose of training learning algorithms and word
embeddings.</p>
      <p>We hope that this first initiative has set up the
scene for further investigations and developments
of best practises, corpora and resources for the
Italian name entity linking on Tweets and other
microblog contents.</p>
      <p>As future work, we plan to build a bigger dataset
of annotated contents and to foster the release of
state-of-the-art methods for entity linking in
Italian language.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>This work is supported by the project
“Multilingual Entity Liking” co-funded by the Apulia
Region under the program FutureInResearch, by
the ADAPT Centre for Digital Content
Technology, which is funded under the Science
Foundation Ireland Research Centres Programme (Grant
13/RC/2106) and is co-funded under the
European Regional Development Fund, and by H2020
FREME project (GA no. 644771).</p>
      <p>0.5034
0.4971
0.4967
0.4962
0.4932
0.4894
0.4822
0.4751
0.4736
0.3418
0.3418
0.3343
0.2224
0.2031
0.1924
+1.27
+0.08
+0.10
+0.61
+0.78
+1.49
+1.49
+0.32
+38.56</p>
      <p>0.00
+2.24
+50.31
+9.50
+5.56
0.00
name
UniPI.3
UniPI.1
MicroNeel.base
UniPI.2
FBK-HLT-NLP.3
FBK-HLT-NLP.2
FBK-HLT-NLP.1
MicroNeel.merger
MicroNeel.all
sisinflab.1
sisinflab.3
sisinflab.2
unimib.run_02
unimib.run_03
unimib.run_01</p>
      <p>MC
Francesco Corcoglioniti, Alessio Palmero Aprosio,
Yaroslav Nechaev, and Claudio Giuliano. 2016.
MicroNeel: Combining NLP Tools to Perform Named
Entity Detection and Linking on Microposts. In
Pierpaolo Basile, Anna Corazza, Franco Cutugno,
Simonetta Montemagni, Malvina Nissim, Viviana
Patti, Giovanni Semeraro, and Rachele Sprugnoli,
editors, Proceedings of Third Italian Conference on
Computational Linguistics (CLiC-it 2016) &amp; Fifth
Evaluation Campaign of Natural Language
Processing and Speech Tools for Italian. Final
Workshop (EVALITA 2016). Associazione Italiana di
Linguistica Computazionale (AILC).</p>
      <p>Vittoria Cozza, Wanda La Bruna, and Tommaso
Di Noia. 2016. sisinflab: an ensemble of
supervised and unsupervised strategies for the neel-it
challenge at Evalita 2016. In Pierpaolo Basile, Anna
Corazza, Franco Cutugno, Simonetta Montemagni,
Malvina Nissim, Viviana Patti, Giovanni
Semeraro, and Rachele Sprugnoli, editors, Proceedings of
Third Italian Conference on Computational
Linguistics (CLiC-it 2016) &amp; Fifth Evaluation Campaign
of Natural Language Processing and Speech Tools
for Italian. Final Workshop (EVALITA 2016).
Associazione Italiana di Linguistica Computazionale
(AILC).</p>
      <p>Aba-Sah Dadzie, Daniel PreoÅcˇiuc-Pietro, Danica
RadovanoviÄG˘ , Amparo E. Cano Basave, and
Katrin Weller, editors. 2016. Proceedings of the 6th
Workshop on Making Sense of Microposts, volume
1691. CEUR.</p>
      <p>Xiaoqiang Luo. 2005. On coreference resolution
performance metrics. In Proceedings of the
conference on Human Language Technology and
Empirical Methods in Natural Language Processing, pages
25–32. Association for Computational Linguistics.
Anne-Lyse Minard, R. H. Mohammed Qwaider, and
Bernardo Magnini. 2016a. FBK-NLP at
NEELIT: Active Learning for Domain Adaptation. In
Pierpaolo Basile, Anna Corazza, Franco Cutugno,
Simonetta Montemagni, Malvina Nissim, Viviana
Patti, Giovanni Semeraro, and Rachele Sprugnoli,
editors, Proceedings of Third Italian Conference on
Computational Linguistics (CLiC-it 2016) &amp; Fifth
Evaluation Campaign of Natural Language
Processing and Speech Tools for Italian. Final
Workshop (EVALITA 2016). Associazione Italiana di
Linguistica Computazionale (AILC).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Giuseppe</given-names>
            <surname>Attardi</surname>
          </string-name>
          , Daniele Sartiano, Maria Simi, and
          <string-name>
            <given-names>Irene</given-names>
            <surname>Sucameli</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Using Embeddings for Both Entity Recognition and Linking in Tweets</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <article-title>Associazione Italiana di Linguistica Computazionale (AILC).</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Timothy</given-names>
            <surname>Baldwin</surname>
          </string-name>
          ,
          <string-name>
            <surname>Young-Bum</surname>
            <given-names>Kim</given-names>
          </string-name>
          , Marie Catherine de Marneffe, Alan Ritter, Bo Han, and
          <string-name>
            <given-names>Wei</given-names>
            <surname>Xu</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Shared tasks of the 2015 workshop on noisy user-generated text: Twitter lexical normalization and named entity recognition</article-title>
          .
          <source>ACL-IJCNLP</source>
          ,
          <volume>126</volume>
          :
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Francesco</given-names>
            <surname>Barbieri</surname>
          </string-name>
          , Valerio Basile, Danilo Croce, Malvina Nissim, Nicole Novielli, and
          <string-name>
            <given-names>Viviana</given-names>
            <surname>Patti</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Overview of the EVALITA 2016 SENTiment POLarity Classification Task</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <article-title>Associazione Italiana di Linguistica Computazionale (AILC).</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Valerio</given-names>
            <surname>Basile</surname>
          </string-name>
          and
          <string-name>
            <given-names>Malvina</given-names>
            <surname>Nissim</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Sentiment analysis on italian tweets</article-title>
          .
          <source>In Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</source>
          , pages
          <fpage>100</fpage>
          -
          <lpage>107</lpage>
          , Atlanta, Georgia, June. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Pierpaolo</given-names>
            <surname>Basile</surname>
          </string-name>
          , Annalina Caputo, and
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Semeraro</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Entity Linking for Italian Tweets</article-title>
          . In Cristina Bosco, Sara Tonelli, and Fabio Massimo Zanzotto, editors,
          <source>Proceedings of the Second Italian Conference on Computational Linguistics CLiCit</source>
          <year>2015</year>
          , Trento, Italy, December 3-
          <issue>8</issue>
          ,
          <year>2015</year>
          ., pages
          <fpage>36</fpage>
          -
          <lpage>40</lpage>
          . Accademia University Press.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Flavio</given-names>
            <surname>Massimiliano</surname>
          </string-name>
          <string-name>
            <surname>Cecchini</surname>
          </string-name>
          , Elisabetta Fersini, Enza Messina Pikakshi Manchanda, Debora Nozza, Matteo Palmonari, and
          <string-name>
            <given-names>Cezar</given-names>
            <surname>Sas</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>UNIMIB@NEEL-IT : Named Entity Recognition and Linking of Italian Tweets</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <article-title>Associazione Italiana di Linguistica Computazionale (AILC).</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Anne-Lyse</surname>
            <given-names>Minard</given-names>
          </string-name>
          , Manuela Speranza, and
          <string-name>
            <given-names>Tommaso</given-names>
            <surname>Caselli</surname>
          </string-name>
          .
          <year>2016b</year>
          .
          <article-title>The EVALITA 2016 Event Factuality Annotation Task (FactA)</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <article-title>Associazione Italiana di Linguistica Computazionale (AILC).</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Yaroslav</given-names>
            <surname>Nechaev</surname>
          </string-name>
          , Francesco Corcoglioniti, and
          <string-name>
            <given-names>Claudio</given-names>
            <surname>Giuliano</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Linking knowledge bases to social media profiles</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Alessio</given-names>
            <surname>Palmero</surname>
          </string-name>
          Aprosio and
          <string-name>
            <given-names>Claudio</given-names>
            <surname>Giuliano</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>The Wiki Machine: an open source software for entity linking and enrichment</article-title>
          . ArXiv e-prints.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Alessio</given-names>
            <surname>Palmero</surname>
          </string-name>
          Aprosio and
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Moretti</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Italy goes to Stanford: a collection of CoreNLP modules for Italian</article-title>
          . ArXiv e-prints,
          <year>September</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Giuseppe</given-names>
            <surname>Rizzo</surname>
          </string-name>
          , Marieke van Erp,
          <string-name>
            <surname>Julien Plu</surname>
            , and
            <given-names>Raphaël</given-names>
          </string-name>
          <string-name>
            <surname>Troncy</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Making Sense of Microposts (#Microposts2016) Named Entity rEcognition and Linking (NEEL) Challenge</article-title>
          .
          <source>In 6th Workshop on Making Sense of Microposts (#Microposts2016).</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Pontus</given-names>
            <surname>Stenetorp</surname>
          </string-name>
          , Sampo Pyysalo, Goran Topic´,
          <string-name>
            <surname>Tomoko</surname>
            <given-names>Ohta</given-names>
          </string-name>
          , Sophia Ananiadou, and
          <string-name>
            <surname>Jun'ichi Tsujii</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Brat: A web-based tool for nlp-assisted text annotation</article-title>
          .
          <source>In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, EACL '12</source>
          , pages
          <fpage>102</fpage>
          -
          <lpage>107</lpage>
          , Stroudsburg, PA, USA. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Fabio</given-names>
            <surname>Tamburini</surname>
          </string-name>
          , Cristina Bosco, Alessandro Mazzei, and
          <string-name>
            <given-names>Andrea</given-names>
            <surname>Bolioli</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Overview of the EVALITA 2016 Part Of Speech on TWitter for ITAlian Task</article-title>
          . In Pierpaolo Basile, Anna Corazza, Franco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <article-title>Associazione Italiana di Linguistica Computazionale (AILC).</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Eros</given-names>
            <surname>Zanchetta</surname>
          </string-name>
          and
          <string-name>
            <given-names>Marco</given-names>
            <surname>Baroni</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Morph-it! a free corpus-based morphological resource for the Italian language</article-title>
          .
          <source>Corpus Linguistics</source>
          <year>2005</year>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>