<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Monolingual and Multilingual Question Answering on European Legislation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Radu ION</string-name>
          <email>radu@racai.ro</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexandru CEAUŞU</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dan ŞTEFĂNESCU</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dan TUFIŞ</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elena IRIMIA</string-name>
          <email>elena@racai.ro</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Verginica BARBU MITITELU</string-name>
          <email>vergi@racai.ro</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Research Institute for Artificial Intelligence, Romanian Academy Calea 13 Septembrie no. 13</institution>
          ,
          <addr-line>Bucharest 050711</addr-line>
          ,
          <country country="RO">Romania</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper documents the participation of the Research Institute for Artificial Intelligence to the CLEF 2010 ResPubliQA lab. We answered questions in Romanian and English from Romanian documents of Acquis Communautaire and the European Parliament Proceedings. We extend the report from the previous ResPubliQA participation by introducing multi-factored paragraph relevance score training onto English-Romanian QA. We also investigate how our monolingual parametric QA system developed for the last year's ResPubliQA track scales up to current challenges.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Research Institute for Artificial Intelligence (RACAI) is at the 5th participation in the
CLEF (http://www.clef-campaign.org/) series of Question Answering systems
evaluation. We have built Question Answering (QA) systems for the
EnglishRomanian or the Romanian-Romanian tracks experimenting with each passing year.
Beginning with last year, the QA task simplified in that the organizers asked for the
single most relevant paragraph containing the answer to the user’s natural language
question. Also, questions were independent one from the other and no anaphora
resolution was required in order to find referents of the question pronouns in previous
questions and/or answers. Thus, the road for a reliable QA system development was
opened and continues to be opened for the 2010 edition of the popular QA systems
evaluation forum.</p>
      <p>
        This year we participated to the Romanian-Romanian track of the ResPubliQA lab
as we did in 2009, but we also enrolled in the English-Romanian cross-lingual QA in
order to check some hypotheses that we put forward [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The approach that we took
was to use the last year’s test set comprised of 500 questions from the JRC Acquis
corpus in order to train our paragraph relevance weights. The difference is that for
ResPubliQA 2010, the Europarl corpus was added along with a new type of question
for which we did not have any training data. This type of question (dubbed OPINION
in the “ResPubliQA 2010 - Track Guidelines” document) was specific to the Europarl
corpus in which each speaker in the European Parliament expresses an opinion about
a given state of affairs.
      </p>
      <p>
        In what follows, we will describe the document collection, our QA systems (both
monolingual and cross-lingual) and the results we have obtained. In doing so, we will
be brief on subjects that have already been presented at length elsewhere [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], focusing
on new aspects and discussions pertaining to the task at hand.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. The Document Collection</title>
      <p>
        The document collection was composed from two corpora: the JRC Acquis [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] that
was introduced last year and the new addition of Europarl [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The latter corpus
consists of 142 documents in both English and Romanian containing almost 8.6M
tokens in English and almost 9.3M tokens in Romanian (including punctuation). Both
parts of the corpus were preprocessed using the TTL web service [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] to obtain POS
tagging, lemma and chunking information (the same annotations as for the JRC
Acquis corpus). As with the JRC Acquis corpus, we paragraph-aligned the Europarl
corpus using the 1:1 sentence aligner developed by Moore [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. We managed to obtain
a percent of 98.76% English paragraphs that were 1:1 aligned to Romanian
paragraphs which means that the corpus was already “almost” aligned with paragraph
counts differing very little between English or Romanian parts for each pair of
documents.
      </p>
      <p>
        For this year’s ResPubliQA competition, the JRC Acquis and the Europarl corpora
were also word sense disambiguated using one of the algorithms with which we
participated to the “Task #17: All-words Word Sense Disambiguation (WSD) on a
Specific Domain” of the SemEval-2010 semantic evaluations forum1. We wanted to
evaluate the impact of WSD onto the accuracy of our QA system by doing an
informed, WSD-driven query expansion and WSD-enhanced document retrieval. The
algorithm that we used to sense-annotate the document collection was the variant of
RACAI-1 [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] which outputs the best two senses for the target word and whose
reported accuracy is around 82.5% if it is applied onto highly domain-specific content
words.
      </p>
      <p>Figure 1 exemplifies the level of corpus annotation used by our present QA
system:</p>
      <sec id="sec-2-1">
        <title>1 http://semeval2.fbk.eu/semeval2.php</title>
        <p>In order to check for the query expansion benefits, for each sense disambiguated
word, we also indexed its synonyms as given by respective ILIs2. For instance, for the
lemma “proposal”, the index contained the synonym “proposition” since this literal
appears next to “proposal” in the synset which is identified by ILI
“ENG2006719629-n”.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. The QA System</title>
      <p>
        The QA system has no significant modifications since the ResPubliQA 2009 [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. It is
based on a flow of web services that takes a user’s natural language question,
preprocess it on the fly to obtain all the annotations from Figure 1, transforms it into a
Boolean query using one of the two query generation algorithms [2; pages 7, 8] and
then retrieves a list of relevant paragraphs that are very likely to contain the answer to
the user’s question.
      </p>
      <p>The way in which the paragraph list is sorted (in order to extract and return the first
paragraph as the single answer to the question) is the key to the trainable quality of
our QA system. Thus, the sort key S of a paragraph p is a linear combination of
paragraph relevance scores:</p>
      <p>S ( p) = ∑λi si , ∑λi = 1 (1)</p>
      <p>
        i i
where the weights sum to 1 and are estimated by the following MERT-like procedure
[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]: given a training set of questions for which the correct paragraphs are known, run
the QA system for all possible values of weights such that the increment step is 0.01
and compute the MRR of each run. Retain that set of weights for which the MRR is
the highest.
      </p>
      <p>In order to comply with the organizers’ suggestion that an “I don’t know/I’m not
sure” answer (identified with NOA – standing for “NO Answer”) is better than a
wrong answer, we introduced the combined QA system which considers the outputs
of the two different query generation algorithms in the following manner:
arg min (rank1( p) + rank2 ( p)),
p</p>
      <p>
        rank1( p) ≤ K , rank2 ( p) ≤ K , K ≤ 50 (2)
where rank ( p) is the rank of the paragraph p in the list returned by the search
engine and subscripts 1 and 2 indicate the paragraph lists returned by the respective
query generation algorithm (1 for the TF-IDF one and 2 for the chunk-based one).
Intuitively, the paragraph that is preferred by the combined QA system is the lowest
numbered one that is common to both lists. When no such paragraph exists, the QA
system returns NOA.
2 The Inter-Lingual Index (ILI) was a major outcome of the EuroWordNet project [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] and it
ensures the cross-linguistic alignment of wordnets.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. The Monolingual Runs</title>
      <p>We participated in the Romanian-Romanian section of the ResPubliQA 2010
Paragraph Selection (PS) track. As in the previous year, the requirement was to return
exactly one paragraph containing the correct answer to each natural language question
in the 200 questions test set. If the system is not sure, the NOA answer may be
returned with an option to record an actual answer (paragraph) with the NOA so that
the organizers may compute additional performance figures such as the percent of
correct/incorrect answers out of the NOA ones.</p>
      <p>
        We have submitted two Romanian-Romanian runs: icia101PSroro and
icia102PSroro with the following characteristics:
• The first run is simply the QA system from ResPubliQA 2009 with global
weights training as described in that paper [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and using the TF-IDF query
generation algorithm. We wanted to see how our 2009 QA system scales up
to current challenges without any modifications;
• The second run is a more elaborate one. We trained the weights of the QA
system on ResPubliQA 2009 500 questions test set in order to derive a set of
weights for each question class for each query generation algorithm (see
tables 1 and 2). Then, we combined the outputs of the TF-IDF and CHUNK
QA systems, using K=1 from eq. 2.
      </p>
      <p>TF-IDF Q. Gen.</p>
      <p>DEFINITION
FACTOID
PROCEDURE
REASON-PURPOSE</p>
      <sec id="sec-4-1">
        <title>Lexical Chains</title>
        <p>0.03
0.06
0.09
0.48</p>
      </sec>
      <sec id="sec-4-2">
        <title>CHUNK Q. Gen.</title>
        <p>DEFINITION
FACTOID
PROCEDURE
REASON-PURPOSE</p>
      </sec>
      <sec id="sec-4-3">
        <title>Lexical Chains</title>
        <p>0.11
0.11
0.59
0
The official results of our two runs are given in Table 3.</p>
      </sec>
      <sec id="sec-4-4">
        <title>ANSWERED</title>
        <p>UNANSWERED
ANSWERED with RIGHT
ANSWERED with WRONG
UNANSWERED with RIGHT
UNANSWERED with WRONG
UNANSWERED with EMPTY
Overall accuracy
c@1 measure
icia102PSroro
92
108
63
29
0
0
108
0.32
0.49
Table 3 reveals the fact that MERT training procedure is rather sensitive to the
training data: a c@1 measure of 0.49 is significantly lower than the one of 0.68 we
obtained last year. Still, there is also the issue of the size of the test data which was
200 questions vs. 500 questions last year (more than double). This translates in a
reduced margin of error.</p>
        <p>For this year’s ResPubliQA competition we also wanted to test the influence the
WSD has on both document/paragraph retrieval and query generation. We have
already explained how we index a term using its assigned senses.</p>
        <p>
          For the query side, we opted to implement a query expansion mechanism based on
performing WSD to the user’s question and generate all synonyms from the
Romanian WordNet for each semantically disambiguated term. In order to do that, we
used the RACAI-1 WSD algorithm [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] with which we have obtained an 82.5%
accuracy on a domain-limited lexical sample if we allowed it to output the first two
senses for each target word. The query expansion algorithm works in the following
way:
1. obtain a query from the natural language question using the TF-IDF query
generation algorithm [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ];
2. for each term in that query, apply RACAI-1 WSD algorithm (which uses a
WSD model derived from the document collection and always outputs the
domain-computed most frequent sense of the term according to the model –
the “One Sense per Domain” hypothesis) to obtain the most probable 2
senses of the term; using Romanian WordNet, generate all its synonyms for
each disambiguated sense.
        </p>
        <p>Evaluating the results of the WSD-enhanced QA system, we differentiated between
several types of runs: with/without query expansion and with/without WSD-enhanced
indexing. We tested both query generation algorithms (TF-IDF and chunk-based) but
we could only expand queries produced by the TF-IDF algorithm. Table 4
summarizes the results that we have obtained on the ResPubliQA 2010 official 200
questions test set. The QA system ran with the same global weights (see eq. 1) as per
last year ResPubliQA.</p>
      </sec>
      <sec id="sec-4-5">
        <title>Q. Gen. Algorithms</title>
        <p>TF-IDF</p>
        <sec id="sec-4-5-1">
          <title>CHUNK</title>
        </sec>
        <sec id="sec-4-5-2">
          <title>Query Expansion</title>
          <p>WSD Idx. No WSD Idx.
0.2577/ 0.2602/
0.3084/ 0.3054/
0.4690 0.4438
–
–</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. The Cross-lingual Runs</title>
      <p>
        The cross-lingual system uses the same index as the monolingual Romanian QA but
for the query generation it uses an already available statistical machine translation
(SMT) system experimented in several RACAI projects [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The SMT system is based
on Moses [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], an open source framework for rapid prototyping of machine translation
systems.
      </p>
      <p>
        The training data for the translation system consisted of the JRC Acquis corpus and
EMEA - European Medicines Agency documents [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The abundant foreign
languages fragments were filtered-out as well as the translation units with significant
length differences. After filtering, the remaining corpus had a total of 1.4 million
translation units. Also, the translation pairs from the Romanian Wordnet [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] aligned
to the Princeton Wordnet (http://wordnet.princeton.edu/) were added to the training
data.
      </p>
      <p>From the training data only the lemmas of the content words were kept. The first
two letters of the morpho-syntactical description were added to the lemma in order to
syntactically disambiguate the terms. For example, the sentence “The medicine can
only be obtained with a prescription.” is transformed into the sequence: “medicine^Nc
obtain^Vm prescription^Nc”. Using the Moses scripts and Giza++ alignments we
extracted a phrase-table (3-gram maximum length) for content-words lemmas.</p>
      <p>We used different query translation algorithms for the two submitted runs. For the
first run, we selected from the translation table the translation equivalents for each
content word lemma. For example, the question “Why was Perwiz Kambakhsh
sentenced to death?” is translated into the query:</p>
      <p>de_ce Perwiz Kambakhsh (condamna OR condamnat OR condamnare) (deces OR
moarte)</p>
      <p>For the second run, we used the Moses decoder to generate the n-best translation
list. The terms from the lists were collected into a single query. For example, the same
English question as above is translated into the query:
de_ce Perwiz Kambakhsh condamna condamnat condamnare deces moarte fi
The two approaches have similar results as can be seen in the official results shown
in Table 5:</p>
      <sec id="sec-5-1">
        <title>ANSWERED</title>
        <p>UNANSWERED
ANSWERED with RIGHT candidate
ANSWERED with WRONG candidate
UNANSWERED with RIGHT candidate
UNANSWERED with WRONG candidate
UNANSWERED with EMPTY candidate
Overall accuracy
c@1 measure
The performance measures are significantly lower than those of the monolingual
counterpart suggesting the fact that either the translation can be improved or the issue
of noise introduction with alternate translations for a term is to be addressed.</p>
        <p>Radu ION, Alexandru CEAUŞU, Dan ŞTEFĂNESCU, Dan TUFIŞ, Elena IRIMIA
and Verginica BARBU MITITELU</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions</title>
      <p>RACAI participated in the Romanian-Romanian and English-Romanian settings of
the ResPubliQA 2010 Paragraph Selection track using the QA system that it has
developed for the last year’s ResPubliQA. The main aim of our participation this year
was to test the scalability of our QA system to new challenges given the fact that it
performed the best last year in the Romanian-Romanian setting out of 4 runs
belonging to 2 participating groups but also overall out of the 28 runs in all languages.
Even if the results were lower than those of the last year, we acquired important
insights on how to scale this QA system to new challenges. Thus, for instance, we
validated the per-class training that gives the best results and also, we know now that
MERT estimation is very sensitive to the training data set.</p>
      <sec id="sec-6-1">
        <title>Acknowledgements</title>
        <p>RACAI participation to CLEF series of QA systems evaluation was possible due to
the SIR-RESDEC national project (no. D1.1.0.0.0.7/18.09.2007) that aims at
developing an open-domain QA system with application to the European Legislation.
This project is approaching the finish line and as such, we will leverage all our CLEF
experience in order to put forward this QA system and make it available on the web.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ion</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Ştefănescu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <source>RACAI: Unsupervised WSD Experiments @ SemEval2</source>
          , Task #
          <fpage>17</fpage>
          .
          <source>In Proceedings of the 5th International Workshop on Semantic Evaluations, SemEval-2010</source>
          , pages
          <fpage>411</fpage>
          -
          <lpage>416</lpage>
          , Uppsala, Sweden, July
          <volume>15</volume>
          -16
          <year>2010</year>
          .
          <article-title>ACL 2010</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ion</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ştefănescu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ceauşu</surname>
          </string-name>
          , Al.,
          <string-name>
            <surname>Tufiş</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Irimia</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , and
          <string-name>
            <given-names>Barbu</given-names>
            <surname>Mititelu</surname>
          </string-name>
          ,
          <string-name>
            <surname>V.</surname>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>A Trainable Multi-factored QA System</article-title>
          . In Carol Peters et al., editor,
          <source>Working Notes for the CLEF 2009 Workshop</source>
          , pages
          <fpage>14</fpage>
          ,
          <string-name>
            <surname>Corfu</surname>
          </string-name>
          , Greece, September,
          <fpage>30th</fpage>
          - October,
          <year>2nd 2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Irimia</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Ceauşu</surname>
          </string-name>
          , Al. (
          <year>2010</year>
          ).
          <article-title>Dependency-based translation equivalents for factored machine translation</article-title>
          , In Alexander Gelbukh (ed.) Research In Computer Science,
          <source>Special Issue on NLP and its Applications 46</source>
          , pp.
          <fpage>205</fpage>
          -
          <lpage>216</lpage>
          , ISSN:
          <fpage>1870</fpage>
          -
          <lpage>4069</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Koehn</surname>
            ,
            <given-names>Ph.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>EuroParl: A Parallel Corpus for Statistical Machine Translation</article-title>
          .
          <source>MT Summit</source>
          <year>2005</year>
          . Phuket, Thailand. http://people.csail.mit.edu/koehn/publications/europarl/
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Koehn</surname>
            ,
            <given-names>Ph.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoang</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Callison-Burch</surname>
          </string-name>
          , Ch.,
          <string-name>
            <surname>Federico</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bertoldi</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cowan</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , Moran, Ch.,
          <string-name>
            <surname>Zens</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , Dyer, Ch.,
          <string-name>
            <surname>Bojar</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Constantin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herbst</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Moses: Open Source Toolkit for Statistical Machine Translation, Annual Meeting of the Association for Computational Linguistics (ACL), demonstration session</article-title>
          , Prague, Czech Republic,
          <year>June 2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Moore</surname>
            ,
            <given-names>R. C.</given-names>
          </string-name>
          (
          <year>2002</year>
          ).
          <article-title>Fast and Accurate Sentence Alignment of Bilingual Corpora</article-title>
          . In Machine Translation:
          <article-title>From Research to Real Users (Proceedings, 5th Conference of the Association for Machine Translation in the Americas</article-title>
          , Tiburon, California),
          <source>SpringerVerlag</source>
          , Heidelberg, Germany, pp.
          <fpage>135</fpage>
          -
          <lpage>244</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Och</surname>
            ,
            <given-names>F. J.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>Minimal Error Rate Training in Statistical Machine Translation</article-title>
          .
          <source>In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics</source>
          ,
          <year>July 2003</year>
          , pp.
          <fpage>160</fpage>
          -
          <lpage>167</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Sanderson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>1997</year>
          ).
          <article-title>Word Sense Disambiguation and Information Retrieval</article-title>
          .
          <source>Technical Report (TR-1997-7)</source>
          , University of Glasgow, Glasgow G12 8QQ, UK,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Steinberger</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pouliquen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Widiger</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ignat</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erjavec</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tufiş</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varga</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages</article-title>
          .
          <source>In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC</source>
          <year>2006</year>
          ), pp.
          <fpage>2142</fpage>
          -
          <lpage>2147</lpage>
          , Genoa, Italy, May
          <year>2006</year>
          . ELRA - European Language Ressources Association.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Tiedemann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>News from OPUS - A Collection of Multilingual Parallel Corpora with Tools and Interfaces</article-title>
          . In N. Nicolov and
          <string-name>
            <given-names>K.</given-names>
            <surname>Bontcheva</surname>
          </string-name>
          and
          <string-name>
            <given-names>G.</given-names>
            <surname>Angelova</surname>
          </string-name>
          and R. Mitkov (eds.)
          <source>Recent Advances in Natural Language Processing</source>
          (vol V), pages
          <fpage>237</fpage>
          -
          <lpage>248</lpage>
          , John Benjamins, Amsterdam/Philadelphia.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Tufiş</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ion</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bozianu</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ceauşu</surname>
          </string-name>
          , Al., and
          <string-name>
            <surname>Ştefănescu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2008a</year>
          ).
          <article-title>Romanian WordNet: Current State, New Applications and Prospects</article-title>
          . In Attila Tanacs, Dora Csendes, Veronika Vincze, Christiane Fellbaum,
          <source>Piek Vossen: Proceedings of 4th Global WordNet Conference</source>
          , GWC-2008, University of Szeged, Hungary, January
          <volume>22</volume>
          -25
          <year>2008</year>
          , pp.
          <fpage>441</fpage>
          -
          <lpage>452</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Tufiş</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ion</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ceauşu</surname>
          </string-name>
          , Al., and
          <string-name>
            <surname>Ştefănescu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2008b</year>
          ).
          <article-title>RACAI's Linguistic Web Services</article-title>
          .
          <source>In Proceedings of the 6th Language Resources and Evaluation Conference - LREC</source>
          <year>2008</year>
          , Marrakech, Morocco, May
          <year>2008</year>
          . ELRA - European
          <source>Language Ressources Association. ISBN 2-9517408-4-0.</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Vossen</surname>
          </string-name>
          , P. (eds) (
          <year>1998</year>
          ).
          <article-title>EuroWordNet: A Multilingual Database with Lexical Semantic Networks</article-title>
          , Kluwer Academic Publishers, Dordrecht.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>