<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Named Entity Recognition in Albanian Based on CRFs Approach</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gridi Kono</string-name>
          <email>gridi.kono@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Klesti Hoxha</string-name>
          <email>klesti.hoxha@fshn.edu.al</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Informatics, Faculty of Natural Sciences, University of Tirana</institution>
          ,
          <addr-line>1001 Tirana</addr-line>
          ,
          <country country="AL">Albania</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Named Entity Recognition (NER) refers to the process of extracting named entities (people, locations, organizations, sport teams, etc.) from text documents. In this work we describe our NER approach for documents written in Albanian. We explore the use of Conditional Random Fields (CRFs) for this purpose. Adequate annotated training corpora are not yet publicly available for Albanian. We have created our own corpus annotated manually by humans. The domain of this corpus is based on Albanian news documents published in 2015 and 2016. We have tested our trained model with two test sets. Overall precision, recall and F-score are 83.2%, 60.1% and 69.7% respectively.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Democratic Party since 15 December 2013.",
"Matteo Renzi", "Italy" and "Democratic Party" can be
classi ed as person, location and organization entities,
respectively.</p>
      <p>In this work we describe a machine learning
approach for recognizing named entities in Albanian text
documents. The Albanian language lacks of publicly
available annotated training corpora for NER. We have
created a custom annotated corpus consisting of news
articles written in Albanian published in various
online news media. The corpus has been created using a
custom built web application software that allowed for
n-gram based annotation sessions. Experiments were
conducted using Standford CRF based NER toolkit1.
Results were promising despite the small size of the
created corpus.</p>
      <p>The rest of this paper is structured as follows.</p>
      <p>In Section 2 we will present previous works in NER
and related approaches. In Section 3 the Conditional
Random Fields approach is described. In Section 4
we will describe our corpus and the methodology used
for creating it. In Section 5 we will present
experiments and their results. Finally, Section 6 concludes
the paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related</title>
    </sec>
    <sec id="sec-3">
      <title>Works</title>
      <p>NER approaches have been reported since the early
90s. One of the rst works has been described by Rau
in [Rau91]. This paper describes the idea of a system
that extracts and recognizes company names. It relied
on handcrafted rules and heuristics.</p>
      <p>Since NER is language dependent, many systems
have been presented for di erent languages. In
[DBG+00] is described a NER system that
recognizes named entities in texts written in Greek. This
1http://nlp.stanford.edu/software/CRF-NER.html
approach followed the MUC-7 NER task de nition
[CR97] with certain adaptions. Entity classes captured
in this paper are people, organizations, location names,
date and time expressions, and percent and money
expressions. This system is based on nite state
machine techniques. The achieved precision and recall
were 0.86 and 0.81 respectively.</p>
      <p>An interesting study by Pathak et al. [PGJ+13]
focuses in clinical named entities. It recognizes three
types of named entities like Problem, Test and
Treatment. In this study, authors proposed an approach
which uses domain speci c knowledge in the form of
clinical features along with textual and linguistic
features. The used textual features are stemming,
prex, su x and orthographical features. The used
linguistics features are part-of-speech(POS), chunks and
NP Head. While the used clinical features are section
headers, customized stop words, dictionary search,
abbreviations and acronyms. They performed
experiments with i2b2 shared task using CRF++2. The
evaluation task was done using micro-averaged
precision, recall, and F-Score for exact and inexact matches.
For exact matches they achieved 0.889 precision, 0.813
recall and 0.849 F-score respectively. For inexact
matches they achieved 0.966 precision, 0.883 recall and
0.923 F-Score.</p>
      <p>An approach for German language is presented by
Faruqui et al. in [FPS10]. Their work consists of
training an existing Stanford NER system on various
German semantic generalization corpora. Semantic
generalization refers to acquiring semantic similarities from
large, unlabelled corpora that can support the
generalization of predictions to new, unseen words in the
test set while avoiding over- tting. Corpora was
evaluated on both in-domain and out-of-domain data,
assessing the impact of generalization corpus size and
quality. The F-score of this system improves by 6%
(in-domain) and 9% (out-of-domain) over supervised
training approaches.</p>
      <p>Benajiba et al. in [BDR+08] have developed a NER
system for Arabic language. The features used are
contextual, lexical, morphological, geographical
dictionaries (gazzeters), Part-of-speech tags and
Base-phrasechunking, nationality and the corresponding English
capitalization. The system has been evaluated using
ACE Corpora3 and ANERcorp4. The aggregate
Fscore for this system (when all the features are
considered) is 82.71%.</p>
      <p>A valuable approach for Albanian Language is
presented for the rst time by Skenduli and Biba in
[SB13]. Their work uses a human annotated corpus.
The domain of this corpus is focused in Politics and</p>
      <sec id="sec-3-1">
        <title>2https://taku910.github.io/crfpp/</title>
      </sec>
      <sec id="sec-3-2">
        <title>3http://corpus.ied.edu.hk/ace/Corpus.html</title>
      </sec>
      <sec id="sec-3-3">
        <title>4http://users.dsic.upv.es/ ybenajiba/</title>
        <p>History documents. The corpus is a collection of three
sub-corpora: People corpus, Locations corpus and
Organizations corpus. They performed experiments with
these corpora using Apache OpenNLP5 as a
framework for running their machine learning based NER
approach. The achieved results of this approach were
as follows:</p>
        <p>The People corpus produced values of Precision,
Recall and F-score as 0.85, 0.70 and 0.76 respectively.
The Locations corpus produced values of Precision,
Recall and F-score as 0.83, 0.66 and 0.73 respectively.
While Organizations corpus produced values of
Precision, Recall and F-score as 0.69, 0.60 and 0.64
respectively.</p>
        <p>In general, NER approaches reported for most
languages belong to these categories:</p>
        <sec id="sec-3-3-1">
          <title>1. Rule Based</title>
        </sec>
        <sec id="sec-3-3-2">
          <title>2. Machine Learning</title>
        </sec>
        <sec id="sec-3-3-3">
          <title>3. Hybrid Models</title>
          <p>The rst one is based on handcrafted rules,
linguistic approaches and Gazzeters. The second is based
on statistical methods. The most used methods for
statistical NER are Maximum Entropy Model [SB13],
Conditional Random Fields [PGJ+13,FPS10,LMP01],
Hidden Markov Models [ZS02] and Support
Vector Machines [BDR+08]. The third one combines
Rule based and Machine learning methods [Rau91,
DBG+00, BDR+08]. Machine learning based methods
depend on preliminary training. The training methods
can be divided into three groups: Supervised learning,
Semi-supervised learning and Unsupervised learning
method. Supervised methods need annotated
training data to retrieve optimal results from the classi er.
Semi-supervised learning methods require some data
which are used as a help for the training.
Unsupervised learning methods do not depend on training data
and are mostly clustering based.
3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conditional Random Fields</title>
      <p>In this work we used a linear chain CRF sequence
classi er. Conditional Random Fields is a probabilistic
framework used to segment and label sequence data.
Conditional Random Fields are undirected graphical
models, used to calculate the conditional probability
of values on designated output nodes, given already
assigned values to the input nodes. The conditional
probability of a state sequence x = (x1; : : : ; xT ) given
an observation sequence y = (y1; : : : ; yT ) calculated
as:</p>
      <sec id="sec-4-1">
        <title>5https://opennlp.apache.org/</title>
        <p>p (yjx) =</p>
        <p>1
Z (x)
exp
( T K</p>
        <p>X X kfk(yt 1; yt; xt)</p>
        <p>)
where, fk(yt 1; yt; xt) is a feature function whose
weight k , is to be learned via training. The values of
feature functions may range between 1 to +1 , but
usually they are binary. Usually, when applying CRFs
to the named entity recognition problem, an
observation sequence is a sequence of tokens or a raw text
and the state sequence is its corresponding sequence
of labels [LMP01]. By Hammersley-Cli ord theorem,
the conditional probability of a state sequence given
an input sequence will be:
(1)
Z (x) = X exp
( T K )</p>
        <p>X X kfk(yt 1; yt; xt) : (2)
y2Y T</p>
        <p>t=1 k=1
where Z is a normalisation factor over the all state
sequences, which ensures that the probability
distribution sums up to 1.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Corpus</title>
      <p>There are no publicly available NER annotated
corpora for Albanian texts. Hence we decided to
create a corpus of Albanian based on news articles
published online from di erent local newspapers. We have
used the news aggregator for Albanian news, built by
[HBN16] using Scrapy6. News articles retrieved by this
news aggregator are stored in a MySQL database. We
used Python NLTK toolkit to generate all n-grams(for
n=1,2,3,4) for each news article. All generated
ngrams are stored in the same database with
corresponding news articles. In this paper we have
considered only unigrams. Figure 1 shows the work ow
diagram of building our corpus.</p>
      <sec id="sec-5-1">
        <title>6https://scrapy.org/</title>
        <p>Tokenizer
N-Grams
Generator
Annotated
N-Grams
Annotated</p>
        <p>Corpus</p>
        <p>In order to add labels to the generated unigrams,
we have built a web application using ASP.NET
WebForms7, C#8, JQuery/Ajax9 and Javascript
technologies.</p>
        <p>Our application has two simple user interfaces. The
rst user interface (Figure 2) lists titles of news articles
and allows selecting each of them for n-gram labeling.</p>
        <p>The second user interface (Figure 3) consists of two
parts. The rst part displays raw content of a selected
news article and the second part displays all unigrams
of it. For each unigram, annotators are able to set
a corresponding label from a list of prede ned entity
classes. Actually, our web application o ers interfaces
for also labeling bigrams and trigrams, but because the
NER training model that we used for our experiments
depends on labeled unigrams we were limited to these.</p>
        <p>In order to visually aid the entity identi cation
process, each word which starts with an uppercase
charac</p>
      </sec>
      <sec id="sec-5-2">
        <title>7https://www.asp.net/web-forms/</title>
      </sec>
      <sec id="sec-5-3">
        <title>8https://msdn.microsoft.com/en-us/library/67ef8sbd.aspx</title>
      </sec>
      <sec id="sec-5-4">
        <title>9https://jquery.com/</title>
        <p>ter inside the news content is highlighted with yellow
color.</p>
        <p>This web application allows annotators to work on
the same news item without overriding previous
ngram labels, but storing each annotation instead
allowing so for quality control processes. However, we
avoided this for the experiments reported in this work,
leaving the experimentation with annotation quality
assurance techniques for future works.</p>
        <p>Our corpus consists of 130 documents. The selected
news documents were published in two di erent years
(2015 and 2016). They belong to eight categories:
Politics News, Economic News, Sport News, Health News,
Technology News, Culture News, Chronics and
Opinions.</p>
        <p>This corpus has been manually annotated by
humans. We have organized three sessions with
volunteer annotators in order to annotate more n-grams. In
the rst and second sessions, volunteers annotated all
news articles designated for the training set. In the
third session we used di erent annotators that have
not participated in previous sessions, in order to
annotate test sets. The annotation has been done
according to the Inside Outside(IO) format10 with four
tags as described in Table 1.</p>
        <p>NE tag
PER
LOC
ORG
O</p>
        <p>Meaning
person name
location
name
organization
name
Not an entity</p>
        <p>Example
George PER
Bush PER
Tirana LOC</p>
        <sec id="sec-5-4-1">
          <title>OSCE ORG</title>
          <p>76% O
5
5.1</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Experiments and Results</title>
      <p>Experimental Set-up
We performed our experiments in Stanford NER.
Stanford NER is a Java implementation of a Named Entity
10http://nlp.stanford.edu/software/crf-faq.html
Recognizer. It comes with well-engineered feature
extractors for Named Entity Recognition, and many
options for de ning feature extractors. Stanford NER
is also known as CRFClassi er. The software
provides a general implementation of (arbitrary order)
linear chain Conditional Random Field (CRF) sequence
models. That is, by training your own models on
labeled data, you can actually use this code to build
sequence models for NER or any other task [FGM05].
5.1.1</p>
      <p>Evaluation Metrics
We have evaluated the results of our experiments
with well-accepted standard measures in evaluation of
trained NER models. This can be performed by
annotating a corpus and then compare the human
annotations with a gold standard corpus. Thus, each
annotation must be classi ed as being a:
1. True Positive (TP): the system provides an
annotation that exists in the gold standard corpus.
2. True Negative (TN): the non existence of an
annotation is correct according to the gold standard
corpus.
3. False Positive (FP): the system provides an
annotation that does not exist in the gold standard
corpus;
4. False Negative (FN): the system does not provide
an annotation that is present in the gold standard
corpus.</p>
      <p>Concretely we used Precision, Recall and F-score as
used by other authors in [DBG+00] [BDR+08] [SB13].</p>
      <p>Recall measures the ability of a NE trained model
to present all relevant entities, and is formulated as:
Recall =</p>
      <p>T P</p>
      <p>T P + F P</p>
      <p>Precision measures the ability of a NE trained
model to present only relevant entities, and it is
formulated as:</p>
      <p>P recision =</p>
      <p>T P</p>
      <p>T P + F N</p>
      <p>These two measures of performance can be
combined as one performance metrics, the F-score, which
is computed by the weighted harmonic mean of
precision and recall.
Our corpus is further divided into the training and the
test set, which contain 100 and 30 documents
respectively.</p>
      <p>The training set contains news documents published
in 2015, in total around 50.000 words.</p>
      <p>The test set is divided into two subsets. The rst
subset contains news documents published in 2015,
while the second subset contains news documents
published in 2016. Each subset contains 15 documents
respectively.</p>
      <p>We have conducted two experiments, the rst using
the rst subset of test data and the second makes use
of the second subset.
6</p>
    </sec>
    <sec id="sec-7">
      <title>Results</title>
      <p>The evaluation task for each experiment as described
above was done using three di erent metrics:
Precision, Recall and F-score. The following tables show
results for each test set that has been used. The
used training model is the same for both experiments.
These calculations were carried out automatically by
Stanford NER.</p>
      <p>In the rst experiment the NE class with highest
F-score is Locations class, 81.1%. The NE class with
lowest value is Organizations class, 47.1%. Overall for
the rst experiment we have got Precision of 80.8%,
Recall of 64.0% and F-score of 71.4% (see Table 2).</p>
      <p>The overall average Precision, Recall and F-score
are 83.2%, 60.1% and 69.7% respectively (see Table
4).
In this paper we presented the results of a machine
learning approach for identifying named entities in text
documents written in Albanian. It is based in
Conditional Random Fields and was evaluated against two
di erent test sets on a corpus of Albanian news
documents. The corpus was created by annotating news
articles through the use of a custom built web
application software. Volunteer annotators manually
performed this process by using a n-gram based news
visualization interface. The experiments were restricted
in the recognition of three entity classes: people,
locations, and organizations.</p>
      <p>Even though the size of the annotated corpus is
modest, we got promising results, showing that the
experimented model can be used for successfully
extracting named entities from Albanian text documents.
The relatively low recall values for organization
entities may be improved by using a larger corpus and
expand it beyond news text documents written in
Albanian.</p>
      <p>In the future we intend to increase the size of the
corpus in order to get more signi cant results.
Furthermore, we aim to improve the quality of the
annotated data by switching to a semiautomatic corpus
creation approach [ACS14]. It would need to use a
publicly available knowledge base of people, locations,
and organizations. This way we may aid human
annotators in better recognizing possible named entities in
the provided texts. Also we want to improve the user
interface involved in the annotation process and also
tweak it in order to avoid confusion and produce
annotation results better suited for the NLP toolkit that is
being used. Another aspect that we want to improve in
the future, is the inclusion of a quality control scheme
in the annotation process. This way we will be able
to avoid false or ambiguous tagging of named entities
present in the text documents in question.</p>
      <p>Experimenting with other NER machine learning
techniques like Hidden Markov Model (HMM),
Support Vector Machine (SVM) and studying the
behaviour of these approaches for Albanian written
documents is also in our future plans.</p>
      <p>A NER tool for Albanian texts will also enable
concrete applications like the creation of a knowledge base
that stores facts about named entities present in news
articles [HBN16].
[ACS14]</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>In Proceedings of the First Italian Conference on Computational Linguistics</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [BDR+08]
          <string-name>
            <surname>Yassine</surname>
            <given-names>Benajiba</given-names>
          </string-name>
          , Mona Diab,
          <string-name>
            <given-names>Paolo</given-names>
            <surname>Rosso</surname>
          </string-name>
          , et al.
          <article-title>Arabic named entity recognition: An svm-based approach</article-title>
          .
          <source>In Proceedings of 2008 Arab International Conference on Information Technology (ACIT)</source>
          , pages
          <fpage>16</fpage>
          {
          <fpage>18</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [CR97]
          <string-name>
            <given-names>Nancy</given-names>
            <surname>Chinchor</surname>
          </string-name>
          and
          <string-name>
            <given-names>Patricia</given-names>
            <surname>Robinson</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <article-title>Muc-7 named entity task de nition</article-title>
          .
          <source>In Proceedings of the 7th Conference on Message Understanding, page 29</source>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [DBG+00]
          <string-name>
            <surname>Iason</surname>
            <given-names>Demiros</given-names>
          </string-name>
          , Sotiris Boutsis, Voula Giouli, Maria Liakata, Harris Papageorgiou, and
          <string-name>
            <given-names>Stelios</given-names>
            <surname>Piperidis</surname>
          </string-name>
          .
          <article-title>Named entity recognition in greek texts</article-title>
          .
          <source>In LREC</source>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>[FGM05] Jenny Rose</surname>
            <given-names>Finkel</given-names>
          </string-name>
          , Trond Grenager, and
          <string-name>
            <given-names>Christopher</given-names>
            <surname>Manning</surname>
          </string-name>
          .
          <article-title>Incorporating nonlocal information into information extraction systems by gibbs sampling</article-title>
          .
          <source>In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics</source>
          , pages
          <volume>363</volume>
          {
          <fpage>370</fpage>
          . Association for Computational Linguistics,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [FPS10] [HBN16] [LMP01]
          <string-name>
            <given-names>Manaal</given-names>
            <surname>Faruqui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Sebastian</given-names>
            <surname>Pado</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Maschinelle</given-names>
            <surname>Sprachverarbeitung</surname>
          </string-name>
          .
          <article-title>Training and evaluating a german named entity recognizer with semantic generalization</article-title>
          .
          <source>In KONVENS</source>
          , pages
          <volume>129</volume>
          {
          <fpage>133</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Klesti</given-names>
            <surname>Hoxha</surname>
          </string-name>
          , Artur Baxhaku, and
          <string-name>
            <given-names>Ilia</given-names>
            <surname>Ninka</surname>
          </string-name>
          .
          <article-title>Bootstrapping an online news knowledge base</article-title>
          .
          <source>In International Conference on Web Engineering</source>
          , pages
          <volume>501</volume>
          {
          <fpage>506</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Springer</surname>
          </string-name>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          John La erty,
          <string-name>
            <surname>Andrew McCallum</surname>
            ,
            <given-names>and Fernando</given-names>
          </string-name>
          <string-name>
            <surname>Pereira</surname>
          </string-name>
          .
          <article-title>Conditional random elds: Probabilistic models for segmenting and labeling sequence data</article-title>
          .
          <source>In Proceedings of the eighteenth international conference on machine learning</source>
          , ICML, volume
          <volume>1</volume>
          , pages
          <fpage>282</fpage>
          {
          <fpage>289</fpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [PGJ+13]
          <string-name>
            <surname>Parth</surname>
            <given-names>Pathak</given-names>
          </string-name>
          , Raxit Goswami, Gautam Joshi,
          <string-name>
            <given-names>Pinal</given-names>
            <surname>Patel</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Amrish</given-names>
            <surname>Patel</surname>
          </string-name>
          .
          <article-title>Crfbased clinical named entity recognition using clinical nlp</article-title>
          .
          <source>In Proceedings of 10th [Rau91] [SB13] [ZPZ04] [ZS02] International Conference on Natural Language Processing</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>L. F.</given-names>
            <surname>Rau</surname>
          </string-name>
          .
          <article-title>Extracting company names from text</article-title>
          .
          <source>In Proc. Seventh IEEE Conf Arti cial Intelligence Applications</source>
          , volume i, pages
          <volume>29</volume>
          {
          <fpage>32</fpage>
          ,
          <year>February 1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Marjana</given-names>
            <surname>Prifti</surname>
          </string-name>
          <article-title>Skenduli and Marenglen Biba. A named entity recognition approach for albanian</article-title>
          .
          <source>In Advances in Computing, Communications and Informatics (ICACCI)</source>
          , 2013 International Conference on, pages
          <volume>1532</volume>
          {
          <fpage>1537</fpage>
          . IEEE,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Li</given-names>
            <surname>Zhang</surname>
          </string-name>
          , Yue Pan, and Tong Zhang.
          <article-title>Focused named entity recognition using machine learning</article-title>
          .
          <source>In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '04</source>
          , pages
          <fpage>281</fpage>
          {
          <fpage>288</fpage>
          , New York, NY, USA,
          <year>2004</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>GuoDong</given-names>
            <surname>Zhou and Jian Su</surname>
          </string-name>
          .
          <article-title>Named entity recognition using an hmm-based chunk tagger</article-title>
          .
          <source>In proceedings of the 40th Annual Meeting on Association for Computational Linguistics</source>
          , pages
          <volume>473</volume>
          {
          <fpage>480</fpage>
          . Association for Computational Linguistics,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>