<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Cross-Language French-English Question Answering using the DLT System at CLEF 2006</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Richard F. E. Sutcliffe</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kieran White Darina Slattery</string-name>
          <email>Darina.Slattery@ul.ie</email>
          <email>Kieran.White@ul.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Igal Gabbay</string-name>
          <email>Igal.Gabbay@ul.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Mulcahy</string-name>
          <email>Michael.Mulcahy@ul.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Documents and Linguistic Technology Group Department of Computer Science University of Limerick</institution>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>On Sabbatical from University of Limerick</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Essex</institution>
          ,
          <addr-line>Wivenhoe Park, Colchester CO4 3SQ</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2005</year>
      </pub-date>
      <abstract>
        <p />
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The basic architecture of our factoid system is standard in nature and comprises query type identification, query
analysis and translation, retrieval query formulation, document retrieval, text file parsing, named entity
recognition and answer entity selection. Factoid classification into 69 query types is carried out using keywords.
Associated with each type is a set of one or more Named Entities. Xelda is used to tag the French query for
partof-speech and then shallow parsing is carried out over these in order to recognise thirteen different kinds of
significant phrase. These were determined after a study of the constructions used in French queries together with
their English counterparts. Our observations were that (1) Proper names usually only start with a capital letter
with subsequent words un-capitalised, unlike English; (
        <xref ref-type="bibr" rid="ref1">2</xref>
        ) Adjective-Noun combinations either capitalised or not
can have the status of compounds in French and hence need special treatment; (3) Certain noun-preposition-noun
phrases are also of significance. The phrases are then translated into English by the engine WorldLingo and using
the Grand Dictionnaire Terminologique, the results being combined. Each phrase has a weight assigned to it by
the parser. A Boolean retrieval query is formulated consisting of an AND of all phrases in increasing order of
weight. The corpus is indexed by sentence using Lucene. The Boolean query is submitted to the engine and if
unsuccessful is re-submitted with the first (least significant) term removed. The process continues until the search
succeeds. The documents (i.e. sentences) are retrieved and the NEs corresponding to the identified query type are
marked. Significant terms from the query are also marked. Each NE is scored based on its distance from query
terms and their individual weights. The answer returned is the highest-scoring NE. Temporarily Restricted
Factoids are treated in the same way as Factoids. Definition questions are classified in three ways: organisation,
person or unknown. This year Factoids had to be recognised automatically by an extension of the classifier. An
IR query is formulated using the main term in the original question plus a disjunction of phrases depending on the
identified type. All matching sentences are returned complete. Results this year were as follows: 32/150 (21%) of
Factoids were R, 14/150 (9%) were X, 4/40 (10%) of Definitions were R and 2 List results were R (P@N = 0.2).
Our ranking in Factoids relative to all thirteen runs was Fourth. However, scoring all systems over R&amp;X together
and including Definitions, our ranking would be Second Equal because we had more X scores than any other
system. Last year our score on Factoids was 26/150 (17%) but the difference is probably the easier queries this
year.
      </p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>This article outlines the participation of the Documents and Linguistic Technology (DLT) Group in the Cross
Language French-English Question Answering Task of the Cross Language Evaluation Forum (CLEF).</p>
    </sec>
    <sec id="sec-3">
      <title>2. Architecture of the CLEF 2006 DLT System</title>
      <sec id="sec-3-1">
        <title>2.1 Outline</title>
        <p>The basic architecture of our factoid system is standard in nature and comprises query type identification, query
analysis and translation, retrieval query formulation, document retrieval, text file parsing, named entity
recognition and answer entity selection.</p>
      </sec>
      <sec id="sec-3-2">
        <title>2.2 Query Type Identification</title>
        <p>As last year, simple keyword combinations and patterns are used to classify the query into a fixed number of
types. Currently there are 69 categories plus the default ‘unknown’. In a major change this year, the queries were
not tagged in the input file as Factoid, Definition or List. Instead this information had to be inferred. We altered
the keyword classifier to recognise Factoids using last year’s data for training. We made no attempt to recognise
List questions and simply treated them as Factoids. This partly explains our low list score.</p>
      </sec>
      <sec id="sec-3-3">
        <title>2.3 Query Analysis and Translation</title>
        <p>We start off by tagging the Query for part-of-speech using XeLDA (2004). We then carry out shallow parsing
looking for various types of phrase. Each phrase is then translated using two different methods. One translation
engine and one dictionary are used. The engine is WorldLingo (2004). The dictionary used was the Grand
Dictionnaire Terminologique (GDT, 2004) which is a very comprehensive terminological database for Canadian
French with detailed data for a large number of different domains. The two candidate translations are then
combined – if a GDT translation is found then the WorldLingo translation is ignored. The reason for this is that if
a phrase is in GDT, the translation for it is nearly always correct. In the case where words or phrases are not in
GDT, then the WorldLingo translation is used.</p>
        <p>The types of phrase recognised were determined after a study of the constructions used in French queries together
with their English counterparts. The aim was to group words together into sufficiently large sequences to be
independently meaningful but to avoid the problems of structural translation, split particles etc which tend to
occur in the syntax of a question, and which the engines tend to analyse incorrectly.</p>
        <p>
          The structures used were number, quote, cap_nou_prep_det_seq, all_cap_wd, cap_adj_cap_nou,
cap_adj_low_nou, cap_nou_cap_adj, cap_nou_low_adj, low_nou_low_adj, low_nou_prep_low_nou,
low_adj_low_nou, nou_seq and wd. These were based on our observations that (1) Proper names usually only
start with a capital letter with subsequent words un-capitalised, unlike English; (
          <xref ref-type="bibr" rid="ref1">2</xref>
          ) Adjective-Noun combinations
either capitalised or not can have the status of compounds in French and hence need special treatment; (3) Certain
noun-preposition-noun phrases are also of significance.
        </p>
        <p>As part of the translation and analysis process, weights are assigned to each phrase in an attempt to establish
which parts are more important in the event of query simplification being necessary.</p>
      </sec>
      <sec id="sec-3-4">
        <title>2.4 Retrieval Query Formulation</title>
        <p>The starting point for this stage is a set of possible translations for each of the phrases recognised above. For each
phrase, a Boolean query is created comprising the various alternatives as disjunctions. In addition, alternation is
added at this stage to take account of morphological inflections (e.g 'go'&lt;-&gt;'went', 'company'&lt;-&gt;'companies' etc)
and European English vs. American English spelling ('neighbour'&lt;-&gt;'neighbor', 'labelled'&lt;-&gt;'labeled' etc). The list
of the above components is then ordered by the weight assigned during the previous stage and the ordered
components are then connected with AND operators to make the complete Boolean query.</p>
      </sec>
      <sec id="sec-3-5">
        <title>Question</title>
      </sec>
      <sec id="sec-3-6">
        <title>Type</title>
        <p>who
when
how_many3
what_country</p>
      </sec>
      <sec id="sec-3-7">
        <title>Translation</title>
        <p>0018 'Qui est le principal organisateur du
concours international "Reine du futur" ?'</p>
        <sec id="sec-3-7-1">
          <title>Who is the main organizer of the</title>
          <p>international contest "Queen of the Future"?
0190 'En quelle année le président de
Chypres, Makarios III est-il décédé ?'</p>
        </sec>
        <sec id="sec-3-7-2">
          <title>What year did the president of Cyprus, Makarios III, die? 0043 'Combien de communautés Di Mambro a-t-il crée ?'</title>
        </sec>
        <sec id="sec-3-7-3">
          <title>How many communities did Di Mambro</title>
          <p>found?
0102 'Dans quel pays l'euthanasie est-elle
autorisée si le patient le souhaite et qu'il
souffre de douleurs physiques et mentales
insupportables ?'</p>
        </sec>
        <sec id="sec-3-7-4">
          <title>In which country is euthanasia permitted if</title>
          <p>requested by a patient suffering intolerable
physical or mental pain?
how_much_rate
0016 'Quel pourcentage de personnes
touchées par le virus HIV vit en Afrique ?'</p>
        </sec>
        <sec id="sec-3-7-5">
          <title>What percentage of people infected by HIV</title>
          <p>lives in Africa?
unknown
0048 'Quel contrat a cours de 1995 à 2004 ?'</p>
        </sec>
        <sec id="sec-3-7-6">
          <title>Which contract runs from 1995 to 2004?</title>
        </sec>
      </sec>
      <sec id="sec-3-8">
        <title>2.5 Document Retrieval</title>
        <p>Lucene (2005) was used to index the LA Times and Glasgow Herald collections, with each sentence in the
collection being considered as a separate document for indexing purposes. This followed our observation that in
most cases the search keywords and the correct answer appear in the same sentence. We use the standard query
language.</p>
        <p>In the event that no documents are found, the conjunct in the query (corresponding to one phrase recognised in
the query) with the lowest weight is eliminated and the search is repeated.</p>
      </sec>
      <sec id="sec-3-9">
        <title>2.6 Text File Parsing</title>
      </sec>
      <sec id="sec-3-10">
        <title>2.7 Named Entity Recognition</title>
      </sec>
      <sec id="sec-3-11">
        <title>2.8 Answer Entity Selection</title>
        <p>This stage is straightforward and simply involves retrieving the matching 'documents' (i.e. sentences) from the
corpus and extracting the text from the markup.</p>
        <p>Named Entity (NE) recognition is carried out in the standard way using a mixture of grammars and lists. The
number of NE types was increased to 75 by studying previous CLEF and TREC question sets.
Answer selection was updated this year so that the weight of a candidate answer is the sum of the weights of all
search terms co-occurring with it. Because our system works by sentence, search terms must appear in the same
sentence as the candidate answer. The contribution of a term reduces with the inverse of its distance from the
candidate.</p>
      </sec>
      <sec id="sec-3-12">
        <title>2.9 Temporally Restricted Questions</title>
        <p>Temporally restricted factoids are processed in exactly the same way as normal factoids. Effectively this means
that any temporal restrictions are analysed as normal syntactic phrases within the query, are translated and hence
become weighted query terms. As with all phases, therefore, the weight assigned depends on the syntactic form of
the restriction and not on any estimate of its temporal restricting significance.
Queries are classified as def_organisation, def_person or def_unknown during the query classification stage using
keywords inferred from last year’s data. This is necessary because Definitions are no longer tagged as such in the
query file – a significant departure from last year. The target is identified in the query (usually the name of an
organisation or person). For an organisation query, a standard list of phrases is then added to the search
expression, each suggesting that something of note is being said about the organisation. Example phrases are ‘was
founded’ and ‘manufacturer of’. All sentences including the target term plus at least one significant phrase are
returned. These are concatenated to yield the answer to the question. This approach does work on occasion but
the result is rarely concise and it can therefore result in inordinate number of answers being judged ineXact. For
def_person queries the method is the same, but using a different set of phrases such as ‘brought up’, ‘founded’
etc. If the categoriser is unable to decide between def_organisation and def_person, it assigns def_unknown
which results in both sets of patterns being used.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. Runs and Results</title>
      <p>3.1 Runs</p>
      <sec id="sec-4-1">
        <title>This year we submitted just one run.</title>
        <sec id="sec-4-1-1">
          <title>3.2 Results</title>
          <p>The performance can be summarised as follows: 32 out of 150 Factoids were Right (21%) and 14 out of 150 were
ineXact (9%). 4 out of 40 Definitions were Right (10%). Unfortunately the count of ineXact answers is for
Factoids and Definitions combined. For Lists, 2 Right answers were returned, P@N = 0.2. By comparison, last
year our score on Factoids was 26/150 (17%) but the difference is probably that the queries were easier this year.
In terms of our overall position in the French-English task, there were thirteen runs in all and our ranking on
Factoids is position 4, based on a simple count of correct answers. However we had a lot of X scores, more in
fact than any other submitted run in this task. If we combine R and X and score these over Factoids and
Definitions together our position would be Second Equal.</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>3.3 Platform</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Conclusions</title>
      <p>We used a Viglen PC running Windows XP and having 1 Gb RAM. The majority of the system is written in
SICStus Prolog 3.11.1 (SICStus, 2004) with Part-of-Speech tagging, Web translation and Local Context Analysis
components being written in Java.</p>
      <p>The overall performance this year was similar to last. Unfortunately, we were able to do very little work on the
system this year. The only real differences in the system were the automatic recognition of Factoids (quite
successful), the non-recognition of Lists (which lowered our score for these significantly) and the use of just
WorldLingo and GDT instead of these plus Reverso. The last change seemed to make very little difference
although we have not yet quantified this.</p>
    </sec>
    <sec id="sec-6">
      <title>5. References</title>
      <sec id="sec-6-1">
        <title>DTSearch (2000). www.dtsearch.com</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>2.10 Definition Questions</mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>