<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>INAOE-UPV Joint Participation at CLEF 2005: Experiments in Monolingual Question Answering</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>M. Montes-y-Gómez</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>L. Villaseñor-Pineda</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>M. Pérez-Coutiño</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>J. M. Gómez-Soriano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>E. Sanchis-Arnal</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>P. Rosso</string-name>
          <email>prosso@dsic.upv.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Departamento de Sistemas Informáticos y Computación Universidad Politécnica de Valencia (UPV)</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Laboratorio de Tecnologías del Lenguaje Instituto Nacional de Astrofísica</institution>
          ,
          <addr-line>Óptica y Electrónica (INAOE)</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Recent works on question answering are based on complex natural language processing techniques: named entity extractors, parsers, chunkers, etc. While these approaches have proven to be effective they have the disadvantage of being targeted to a particular language. In this paper we present a full datadriven method that uses simple lexical pattern matching and statistical techniques in order to identify the relevant passages as well as the more probable candidate answers for factual and definition questions. The main quality of this method is that it can be applied to different languages without requiring major adaptation changes. Experimental results of the method in Spanish, Italian and French show that the approach can be a practical solution for monolingual and multilingual question answering applications.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>The volume of online available documents is growing every day. As a consequence, better information retrieval
methods are required to achieve the needed information. Question Answering (QA) systems are information
retrieval applications whose aim is to provide inexperienced users with a flexible access to the information,
allowing them writing a query in natural language and obtaining not a set of documents which contain the answer,
but the concise answer itself (Vicedo et al, 2003). That is, given a question like: “Where is the Popocatepetl
located?”, a QA system must respond “Mexico”, instead of just returning a list of documents related to the
volcano.</p>
      <p>Recent developments in QA use a variety of linguistic resources to help in understanding the questions and
the documents. The most common linguistic resources include: part-of-speech taggers, parsers, named entity
extractors, dictionaries, and WordNet (Jijkoun et al., 2004; Ageno et al., 2004; Pérez-Coutiño et al., 2004).
Despite of the promising results of these approaches, they have two main inconveniences: (i) the construction of
such linguistic resources is a very complex task; and (ii) these resources are highly binding to a specific
language.</p>
      <p>In this paper we present a QA system that allows answering factual and definition questions. This system is
based on a full data-driven approach (Brill et al., 2001), which requires minimum knowledge about the lexicon
and the syntax of the specified language. Mainly, it is supported on the idea that the questions and their answers
are commonly expressed using the same set of words, and therefore, it simply uses a lexical pattern matching
method to identify relevant document passages and to extract the candidate answers.</p>
      <p>The proposed approach has the advantage to be easily adapted to several different languages, in particular to
moderately inflected languages such as Spanish, English, Italian and French. Unfortunately, this generality has
its price. To obtain a good performance, the approach requires using a redundant target collection, that is, a
collection in which the question answers occurs more than once. On the one hand, this redundancy increases the
probability of finding a passage containing a simple lexical matching between the question and the answers. On
the other hand, it enhances the answer extraction, since correct answers tend to be more frequent than incorrect
responses.</p>
      <p>The presented system also uses a set of heuristics that attempt to capture some regularities of language and
some stylistic conventions of news letters. For instance, it considers that most named entities are written with an
initial uppercase letter, and that most concept definitions are usually expressed using a very small number of
fixed arrangements of noun phrases. This kind of heuristics guides the extraction of the candidate answers from
the relevant passages.</p>
      <p>In the rest of the paper we present the main architecture of our data-driven QA system. We also discuss the
evaluation results on Spanish, Italian and French.</p>
    </sec>
    <sec id="sec-2">
      <title>System Overview</title>
      <p>The figure 1 shows the general architecture of our system. It is divided in two main modules. One of them
focuses on answering factual questions. It considers the tasks of passage indexing, where documents are
preprocessed, and a structured representation of the collection is built; passage retrieval, where the passages with more
probability to contain the answer are recovered from the index; and answer extraction; where candidate answers
are ranked and the final answer recommendation of the system is produced.</p>
      <p>The other module concentrates on answering definition questions. It includes the tasks of definition
extraction; where all possible pairs of acronym-meaning and referent-description are located and indexed; and
definition selection, where the relevant data pair is identified and the final answer of the system is generated.</p>
      <p>The following sections describe each of these modules and their main tasks.</p>
      <p>Passage
Indexing
Definition
Extraction</p>
      <p>Passage
Index
Definition</p>
      <p>Index</p>
      <p>Factual Questions
Question</p>
      <p>Question</p>
      <p>Classification
Definition Questions</p>
      <p>Passage
Retrieval</p>
      <p>Answer
Extraction
Definition
Selection
Docs
Answer
offline</p>
      <p>online
The passage retrieval (PR) method is specially suited for the QA task (Gómez-Soriano et al., 2005). It allows
retrieving the passages with the highest probability to contain the answer, instead of simply recover the passages
sharing a subset of words with the question.</p>
      <p>Given a user question, the PR method finds the passages with the relevant terms (non-stopwords) using a
classical information retrieval technique based on the vector space model. Then, it measures the similarity between
the n-gram sets of the passages and the user question in order to obtain the new weights for the passages. The
weight of a passage is related to the largest n-gram structure of the question that can be found in the passage
itself. The larger the n-gram structure, the greater the weight of the passage. Finally, it returns to the user the
passages with the new weights.
3.1.1</p>
      <sec id="sec-2-1">
        <title>Similarity measure</title>
        <p>
          The similarity between a passage d and a question q is defined by (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ).
        </p>
        <p>
          n
∑ ∑ h(x( j), D j )
sim(d , q) = jn=1 ∀x∈Qj
∑ ∑ h(x( j), Q j )
j=1 ∀x∈Qj
(
          <xref ref-type="bibr" rid="ref1">1</xref>
          )
Where sim(d, q) is a function which measures the similarity of the set of n-grams of the question q with the set of
n-grams of the passage d. Qj is the set of j-grams that are generated from the question q and Dj is the set of
jgrams of the passage d. That is, Q1 will contain the question unigrams whereas D1 will contain the passage
unigrams, Q2 and D2 will contain the question and passage bigrams respectively, and so on until Qn and Dn. In both
cases, n is the number of question terms.
        </p>
        <p>
          The result of (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) is equal to 1 if the longest n-gram of the question is in the set of passage n-grams.
        </p>
        <p>
          The function h(x(j), Dj) measures the relevance of the j-gram x(j) with respect to the set of passage j-grams,
whereas the function h(x(j), Qj) is a factor of normalization1. The function h assigns a weight to every question
ngram as defined in (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ).
        </p>
        <p>
          Where the notation xˆk (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) indicates the k-th unigram included in the j-gram x, and wxˆk (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) specifies the
associated weight to this unigram. This weight gives an incentive to the terms –unigrams– that appear rarely in the
document collection. Moreover, this weight should also discriminate the relevant terms against those (e.g.
stopwords) which often occur in the document collection.
        </p>
        <p>
          The weight of a unigram is calculated by (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ):
⎧ j
⎪∑ wxˆk (
          <xref ref-type="bibr" rid="ref1">1</xref>
          )
h(x( j), D j ) = ⎨ k=1
⎪⎩ 0
if x( j) ∈ D j
otherwise
wxˆk (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) = 1
log(nxˆk (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) )
1 + log(N )
        </p>
        <p>
          Where nxˆk (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) is the number of passages in which appears the unigram xˆk (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) , and N is the total number of
passages in the collection. We assume that the stopwords occur in every passage (i.e., n takes the value of N). For
instance, if the term appears once in the passage collection, its weight will be equal to 1 (the maximum weight),
whereas if the term is a stopword, then its weight will be the lowest.
3.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Answer Extraction</title>
        <p>This component aims to establish the better answer to the given question. In order to do that, it first determines a
small set of candidate answers, and then, it selects the final unique answer taking into consideration the position
of the candidate answers inside the retrieved passages.</p>
        <p>The algorithm applied to extract the more probable answer from the given set of relevant passages is
described below. For more detail refer to (Del-Castillo et al., 2004).</p>
        <p>1. Extract all the unigrams that satisfy some given typographic criteria. These criteria depend on the
type of expected answer. For instance, if the expected answer is a named entity, then we select the
unigrams starting with an uppercase letter. But if the answer must be a quantity, then we select the
unigrams expressing numbers.
2. Determine all the n-grams assembled from the selected unigrams. These n-grams can only contain
the selected unigrams and some stopwords.
3. Rank the n-grams based on their compensated frequency. The compensated frequency of the n-gram
x(n) is computed as follows:</p>
        <p>Fx(n) =
1 n n-i+1</p>
        <p>∑ ∑
n i=1 j=1</p>
        <p>
          f xˆ j (i)
∑ f y(i)
∀ y∈Gi
(
          <xref ref-type="bibr" rid="ref2">2</xref>
          )
(
          <xref ref-type="bibr" rid="ref3">3</xref>
          )
(
          <xref ref-type="bibr" rid="ref4">4</xref>
          )
where Gi indicates the set of i-grams, y(i) represents the i-gram y, xˆ j (i) is the j-th i-gram included
in x(n), f y(i) specifies the frequency of occurrence of the i-gram y, and Fx(n) indicates the
compensated frequency of x(n).
4.
5.
        </p>
        <p>Select the top five n-grams as candidate answers.</p>
        <p>Compute a ranking score for each candidate answer. This score is defined as the weight of the first
retrieved passage (refer to formula 1) that contains the candidate answer.</p>
        <p>Select as the final respond the candidate answer with the greatest ranking score. In the case that two
or more of the candidate answers have the same ranking score, then we select the one with the
greatest compensated frequency.
1 We introduce the notation x(n) for the sake of simplicity. In this case x(n) indicates the n-gram x of size n.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Answering Definition Questions</title>
      <p>Our system uses an alternative method to answer definition questions. This method makes use of some
regularities of language and some stylistic conventions of news letters to capture the possible answer for a given
definition question. A similar approach was presented in (Ravichandran et al., 2001; Saggion, 2004).</p>
      <p>The process of answering a definition question considers to main tasks. First, the definition extraction, which
detects the text segments that contains the description or meaning of a term (in particular those related with the
name of a person or an organization). Then, the definition selection, where the most relevant description for a
given question term is identified and the final answer of the system is generated.
4.1</p>
      <sec id="sec-3-1">
        <title>Definition Extraction</title>
        <p>The language regularities and the stylistic conventions of news letters are captured by two basic lexical patterns.
These patterns allow constructing two different definition catalogs. The first one includes a list of pairs of
acronym-meaning. The other one consists of a list of referent-description couples.</p>
        <p>
          In order to extract the acronym-meaning pairs we use an extraction pattern based on the use of parentheses.
w1 &lt;meaning&gt; ( &lt;acronym&gt; )
(
          <xref ref-type="bibr" rid="ref5">5</xref>
          )
        </p>
        <p>In this pattern, w1 is a lowercase non stopword, &lt;meaning&gt; is a sequence of words starting with an uppercase
letter (that can also include some stopwords), and &lt;acronym&gt; indicates a single word also starting with an
uppercase letter.</p>
        <p>By means of this pattern we could identify pairs like [PARM – Partido Auténtico de la Revolución Mexicana].
In particular this pair was catch from the following paragraph:
“El Partido Auténtico de la Revolución Mexicana (PARM) nombró hoy, sábado, a Álvaro
Pérez Treviño candidato presidencial de ese organismo para las elecciones federales del 21 de
agosto de 1994”.</p>
        <p>In contrast, the extraction of referent-description pairs is guided by the occurrence of a special kind of
appositive phrases. This information was encapsulated in the following extraction pattern.</p>
        <p>
          w1 w2 &lt;description&gt; , &lt;referent&gt; ,
(
          <xref ref-type="bibr" rid="ref6">6</xref>
          )
        </p>
        <p>Where w1 may represent any word, except for a preposition, w2 is a determiner, &lt;description&gt; is a free
sequence of words, and &lt;referent&gt; indicates a sequence of words starting with an uppercase letter or being in the
stopwords list.</p>
        <p>Applying this extraction pattern over the below paragraph we could find the pair [Alain Lombard - El director
de la Orquesta Nacional de Burdeos].</p>
        <p>“El director de la Orquesta Nacional de Burdeos, Alain Lombard, ha sido despedido por el
Ayuntamiento de esta ciudad, que preside Alain Juppé, tras un informe que denuncia malos
funcionamientos y gastos excesivos”.
4.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Definition Selection</title>
        <p>
          The main quality of the above extraction patterns is their generality. They can be applied to different languages
without requiring major adaptation changes. However, this generality causes the patterns to often extract non
relevant information, i.e., information that does not indicate a relation acronym-meaning or referent-description.
For instance, when using the extraction pattern (
          <xref ref-type="bibr" rid="ref5">5</xref>
          ) to analyze the following news we obtain the incorrect
definition pair [Ernie Els - AFS]. In this case the resultant pair does not express an acronym-meaning relation; instead
it indicates a person-nationality association.
        </p>
        <p>Ernie Els (AFS) se mantiene en cabeza de la lista de ganancias de la "Orden de Mérito" de
golf, con más de 17 millones de pesetas, mientras que el primer español es Miguel Angel
Martín, situado en el puesto decimoséptimo, con 4.696.020.</p>
        <p>Given that the catalogs contains a mixture of correct and incorrect relation pairs, it is necessary to do an
additional process in order to select the most probable answer for a given definition question. The proposed approach
is supported on the idea that, on the one hand, the correct information is more abundant than the incorrect one,
and on the other hand, that the correct information is redundant.</p>
        <p>Thus, the process of definition selection considers the following two criteria:
1. The more frequent definition in the catalog has the highest probability to be the correct answer.
2. The largest and therefore more specific definitions tend to be the more pertinent answers.</p>
        <p>The following example illustrates the process. Assume that the user question is “who is Félix Ormazabal?”,
and that the definition catalog contains the records showed below. Then, the method selects the description
“diputado general de Alava” as the most probable answer. This decision is based on the fact that this answer is the
more frequent description in the catalog related to Félix Ormazabal.</p>
        <p>Félix Ormazabal: Joseba Egibar:
Félix Ormazabal: candidato alavés:
Félix Ormazabal: diputación de este territorio:
Félix Ormazabal: presidente del PNV de Alava y candidato a diputado general:
Félix Ormazabal: nuevo diputado general
Félix Ormazabal: diputado Foral de Alava
Félix Ormazabal: través de su presidente en Alava
Félix Ormazaba : diputado general de Alava
Félix Ormazabal: diputado general de Alava</p>
        <p>Félix Ormazabal: diputado general de Alava
5</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Evaluation Results</title>
      <p>We participate in the evaluation task on three different languages: Spanish, Italian and French. For each
language we submitted two runs. The first group of them (tova051 runs) implements the system as described in the
previous sections. The second group (tova052 runs) resolves some factual questions as if they were definition
questions. The selected questions were those asking for the name of a personality or for the acronym of an
organization.</p>
      <p>Table 1 shows our global results on the three languages. It is noticed that the Spanish results were slightly
better than the Italian and French ones.</p>
      <p>The following tables detail our results by question types. Table 2 shows the accuracy on factual questions;
table 3 indicates the results on definition questions, and finally, table 4 shows the achieved results on temporal
questions. Our general conclusion is that the method for answering factual questions is language independent.
Unfortunately we can assert the same for our approach to answer definition questions.</p>
      <p>On the other hand, it is important to mention that the temporal questions were treated as if they were factual
questions. Currently we do not have a specific method for answering this kind of questions.</p>
      <p>As we mention in section 4, the definition catalogs have several erroneous registers. Also, they are
incomplete, since they do not include all possible acronym-meaning and referent-description pairs. Nevertheless, they
contain a large amount of registers and constitute a valuable information repository for answering definition
questions. The tables 5 and 6 compare the catalogs that were extracted for each language.</p>
      <p>The above tables reveal an important association between the sizes of the document collection and the
generated catalogs. This is of great relevance since our approach of answer selection is mainly based on the definition
redundancy. However, the tables also seem indicate that the extraction patterns are not totally language
independent. This assertion is based on the fact that we obtain fewer descriptions per referent for the French, even
when the Italian collection is slightly smaller.</p>
      <p>The tables 7 and 8 show our results on answering definition questions related to acronyms as well as to
personality descriptions.
This paper presents a question answering system that allows answering factual and definition questions. This
system is based on a full data-driven approach. The main idea behind the approach is that the questions and their
answers are commonly expressed using the same set of words, and therefore, it simply uses a lexical pattern
matching method to identify relevant document passages and to extract the candidate answers.</p>
      <p>The experiments on Spanish, Italian and French have shown the potential and portability of our approach.
They also indicated that our method for answering factual question, which is based on the matching and counting
of n-grams, is language independent. However, this method greatly depends on the redundancy of the answers in
the target collection. This condition limited the method to a poor accuracy.</p>
      <p>On the contrary, the method for answering definition questions is very precise. Nevertheless, we can not
conclude about it language independence.</p>
      <p>As future work we plan to improve the ranking score for the factual answers. This will help in reducing the
dependence of our method to the data redundancy. We also consider to evaluate the quality of the definition
catalogs in order to conclude something about the language independence of our approach.
Acknowledgements. This work was done under partial support of CONACYT (Project Grant 43990),
SNIMexico, and the Human Language Technologies Laboratory of INAOE. We also like to thanks to the CLEF
organization committee.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ageno</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferrés</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>González</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kanaan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodríguez</surname>
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Surdeanu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Turmo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <article-title>TALP-QA System for Spanish at CLEF-2004</article-title>
          . Working Notes for the CLEF 2004 Workshop, Bath, UK,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Brill</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Banko</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumais</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Ng</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <article-title>Data-intensive Question Answering</article-title>
          .
          <source>TREC 2001 Proceedings</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Del-Castillo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montes-</surname>
            y-Gómez,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Villaseñor-Pineda</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <article-title>QA on the web: A preliminary study for Spanish language</article-title>
          .
          <source>Proceedings of the 5th Mexican International Conference on Computer Science (ENC04)</source>
          , Colima, Mexico,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gómez-Soriano</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montes-</surname>
            y-Gómez,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sanchis-Arnal</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            <given-names>P.</given-names>
          </string-name>
          <article-title>A Passage Retrieval System for Multilingual Question Answering</article-title>
          . To appear
          <source>in the proceedings of the 8th International Conference on Text, Speech and Dialog</source>
          ,
          <string-name>
            <surname>TSD</surname>
          </string-name>
          <year>2005</year>
          . Karlovy Vary, Czech Republic,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Jijkoun</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mishne</surname>
            , G., de Rijke,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlobach</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ahn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Müller</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          . The University of Amsterdam at QA@
          <article-title>CLEF 2004</article-title>
          .
          <article-title>Working Notes for the CLEF</article-title>
          2004 Workshop, Bath, UK,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Pérez-Coutiño</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solorio</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <article-title>Montes-y-</article-title>
          <string-name>
            <surname>Gómez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>López-López</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Villaseñor-Pineda</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <article-title>The Use of Lexical Context in Question Answering for Spanish</article-title>
          .
          <source>Working Notes for the CLEF 2004 Workshop</source>
          , Bath, UK,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ravichandran</surname>
            <given-names>D.</given-names>
          </string-name>
          and Hovy E.
          <article-title>Learning Surface Text Patterns for a Question Answering System</article-title>
          .
          <source>In ACL Conference</source>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Saggion</surname>
            ,
            <given-names>H. Identifying</given-names>
          </string-name>
          <article-title>Definitions in Text Collections for Question Answering</article-title>
          .
          <source>LREC</source>
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Vicedo</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodríguez</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peñas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Massot</surname>
          </string-name>
          , M. Los sistemas de Búsqueda de
          <article-title>Respuestas desde una perspectiva actual</article-title>
          .
          <source>Revista de la Sociedad Española para el Procesamiento del Lenguaje Natural, n.31</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>