<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The PUCRS-PLN Group participation at CLEF 2006</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marco Gonzalez</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>- Prédio</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>- PPGCC</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Porto Alegre</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Brazil</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>gonzalez</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>vera} @inf.pucrs.br</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>This paper presents the 2006 participation of the PUCRS-PLN Group in CLEF Monolingual Ad Hoc Task for Portuguese. We participated with the TR+ Model based on nominalization, binary lexical relations (BLR), Boolean queries, and the evidence concept. Our alternative strategy for lexical normalization, the nominalization, is the transformation of a word (adjective, verb, or adverb) into a semantically corresponding noun. BLRs, which identify relationships between nominalized terms, capture phrasal cohesion mechanisms, like those that occur between subject and predicate, subject and object (direct or indirect), noun and adjective or verb and adverb. In our strategy, an index unit may be a single term or a BLR, and we adopt the evidence concept, i.e., the index unit weighting depends on the occurrence of phrasal cohesion mechanisms, besides the frequency of occurrence. We detail here these features, which implement lexical normalization and term dependence in an information retrieval system.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;CLEF</kwd>
        <kwd>ad hoc</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        The PUCRS-PLN Group participated in CLEF 2006 in Monolingual Ad Hoc Portuguese Retrieval Task with the
trevd06 run using manual query construction from topics. This run adopts the TR+ Model [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        TR+ Model is based on nominalization [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ], binary lexical relations (BLRs) [
        <xref ref-type="bibr" rid="ref4 ref6">4, 6</xref>
        ], Boolean queries [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], and
the evidence concept [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Nominalization is an alternative strategy used for lexical normalization. BLRs, which
identify relationships between nominalized terms, and Boolean queries are strategies to specify term
dependences. The evidence concept is part of TR+ Model for term weighting using word frequency and phrasal
cohesion mechanisms [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Trevd06 run uses a probabilistic approach for information retrieval.
      </p>
      <p>In our strategy, a index unit (henceforth descriptor) may be a single term (e.g.: “house”) or a relationship
between terms (e.g.: “house of stone”). BLRs represent those relationships (e.g.: “of(house,stone)”). To each
descriptor (a term or a BLR) is assigned a weight, an evidence in TR+ Model. Its evidence shows the importance of the
concept that the term or the BLR describes in the text. Descriptors and their weights constitute the descriptor
space.</p>
      <p>This paper is organized as follows. Section 2 introduces the TR+ Model based on the nominalization process,
the binary lexical relation recognition, a new term weighting schema based on the evidence concept, and the
Boolean query formulation. Section 3 describes the collection, features of indexing files, and difficulties found.
Section 4 shows the results of trevd06, and Section 5 presents final considerations.</p>
      <p>
        In TR+ Model [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], documents and queries in natural language receive the same treatment in order to construct
the descriptor space, in indexing phase, and to start the Boolean query formulation, in searching phase. First, in
preprocessing step there are tokenization (words and punctuations are identified) and morphological tagging
(morphological tags are assigned to each word or punctuation). Then, the nominalization process is performed
for generating nominalized terms and, in the next step, BLRs are extracted.
      </p>
      <p>documents
preprocessing
nominalization
BLR extraction
terms
and
BLRs
classified
document
references
classification</p>
      <p>search
Boolean query
formulation
query in natural language
preprocessing
nominalization</p>
      <p>BLR extraction
indexing phase</p>
      <p>searching phase
In searching phase, we look for nominalized terms and BLRs recognized in the query, in the descriptor space.
The document relevance values are computed according to descriptor weights (evidences) and to predefined
Boolean operations included in the query. Finally, the documents are classified.</p>
      <sec id="sec-1-1">
        <title>3.1 Nominalization process</title>
        <p>
          Nominalization [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] is an alternative strategy used for lexical normalization. It is based on the fact that nouns are
usually the most representative words of the document content [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], and because queries are usually formulated
through noun phrases [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. In our work, nominalization is understood as the transformation of a word (adjective,
verb, or adverb) found in the text, into a semantically corresponding noun which appears in the lexicon.
        </p>
        <p>Nominalization operations, according to TR+ Model, derive abstract and concrete nouns. Abstract nouns refer
to events (e.g., “to meet ® meeting”), qualities (e.g., “good ® goodness”), states (e.g., “free ® freedom”), or other
abstract entities, which can be derived from adjectives, participles, verbs, or adverbs. Concrete nouns, on the
other hand, refer to agents mostly derived from verbs (e.g., “to build ® builder”), or something that is involved or
associated with an entity, mainly derived from adjectives (e.g., “numerical ® number”).</p>
        <p>
          To develop this idea, an automatic nominalization process was implemented and integrated to our indexing
strategy. We developed the tools FORMA and CHAMA that automatically derive nominalized terms from a
Brazilian Portuguese text [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. Four finite automata perform the nominalization process: the synonymy automaton
with 327 entries, the exception automaton with 4,223 exceptions, the adjective pattern automaton with 663
patterns, and the verb pattern automaton with 351 patterns. The rules for the derivation used by these automata were
manually constructed following the descriptions found in the Aurélio Portuguese Dictionary [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>Figure 2 shows the output for the sentence “Controle trabalhista exato adotado em equipe” (“Exact labor control
adopted in team”).</p>
        <p>Controle controle 0 0 _SU
trabalhista trabalhista trabalho 0 _AJ
exato exato exatidao 0 _AJ
adotado adotar adocao adotante _AP
em em 0 0 _PR
equipe equipe 0 0 _SU
. . 0 0 _PN
– the original word, (e.g., “adotado” (“adopted”)),
– the lemma, (e.g., “adotar” (“to adopt”)),
– the abstract noun (e.g., “trabalho” (“work”), “exatidão” (“accuracy”) and “adoção” (“adoption”)), when it exists, or
zero, if there is no nominalization,
– the concrete noun (e.g., “adotante” (“who adopts”)), when it exists, or zero, if there is no nominalization, and
– the part-of-speech tag (in Figure 2: _SU=noun, _AJ=adjective, _AP=participle, _PR=preposition, and
_PN=pontuaction mark).</p>
      </sec>
      <sec id="sec-1-2">
        <title>3.2 Binary Lexical Relations</title>
        <p>
          BLRs [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] identify relationships between nominalized terms. These relationships capture phrasal cohesion
mechanisms [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] like those that occur between subject and predicate, subject and object (direct or indirect), noun and
adjective or verb and adverb. Such mechanisms reveal term dependences.
        </p>
        <p>A BLR has the form id(t1,t2) where id is a relation identifier, and t1 and t2 are its arguments (nominalized
terms).</p>
        <p>
          There are three kinds of BLRs:
- Classification: where id is the equal sign, t1 is a subclass or an instance of t2, and t2 is a class. The
classification BLR example =(dida,goalkeeper) may be extracted from the string the goalkeeper Dida.
- Restriction: where id is a preposition, t1 is a modifier and t2 is its head. The mapping of syntactic
dependencies onto semantic relations [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], concerning the prepositions, is the purpose of the BLR restriction. The
restriction BLR example of(quickness,team) may be extracted from the string the quick team.
- Association: where id is an event, t1 is a subject and t2 is a direct or indirect object. An association may be
prepositioned or not. The prepositioned association BLR example travel.across(tourist,europe) may be extracted
from the string the tourist traveled across Europe. The association BLR example training(coach,athlete) may be
extracted from the string the coach trained the athlete.
        </p>
        <p>
          We developed a tool named RELLEX that automatically extracts BLRs from a Brazilian Portuguese text. For
more details about BLRs and BLR extraction rules see [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Those rules, and the nominalization process, are
resources used to extract a unique BLR derived from different syntactic structures with the same semantics.
        </p>
      </sec>
      <sec id="sec-1-3">
        <title>3.3 Evidence concept and descriptor weighting</title>
        <p>
          Evidence is information that gives a strong reason for believing something or that proves something; evidences
are signs, indications; something is evident if it is obvious [
          <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
          ]. The evidence concept is crucial for TR+ Model
that adopts descriptor weighting based on this concept, i.e., the weighting is not only based on the descriptor
occurrence frequency. The descriptor representativeness depends, besides the frequency of occurrence, on the
occurrence of phrasal cohesion mechanisms.
        </p>
        <p>The evidence (evdt,d) of a term t in a document d is:
evdt,d =
f t,d +
2
r
f r,t,d
where:
ft,d is the occurrence frequency of t in d, and
fr,t,d is the number of BLRs in d where t is an argument.</p>
        <p>On the other hand, the evidence evdr,d of a BLR r in a document d is:</p>
        <p>
          evd r,d = f r,d (evdt1,d + evdt 2,d )
where:
fr,d is the occurrence frequency of r in d, and
evdt1,d and evdt2,d are the evidences of t1 and t2, respectively, and t1 and t2 are arguments of r.
We adopted the Okapi BM25 formula [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] without IDF factor (which did not contribute for improvements for
TR+ Model). So, the weight Wi,d based on the evidence concept for a descriptor i (a nominalized term or a BLR)
in a document d is given by:
(1)
(2)
k1 ((1- b) + b
        </p>
        <p>DLd ) + evdi,d
AVDL
where:
k1 and b are parameters whose values are 1.2 and 0.75 respectively;
DLd is the length of d and AVDL is the average document length in the collection; and
evdi,d is the descriptor evidence (evdt,d for a term t or evdr,d for a BLR r).</p>
        <p>In TR+ Model, query descriptors have their weight computed by the same formula used for documents. The
relevance value RVd,q of a document d for a query q is given by:
(3)
(4)
RVd ,q =
i</p>
        <p>Wi,d Wi,q
si
where:
i is a term or a BLR;
Wi,d is the weight for descriptor i in d;
Wi,q is the weight for descriptor i in q; and
si = 2, if i is a BLR with the same arguments but different relation identifiers in d and q, or
si = 1, if i is a term or a BLR with the same arguments and identifiers.</p>
        <p>The document classification depends on the relevance values and the Boolean query formulation.</p>
      </sec>
      <sec id="sec-1-4">
        <title>3.4 Boolean query and grouped classification of documents</title>
        <p>A query q formulated by the user, in TR+ Model, is recognized as a text like a text document. A Boolean query
qb, automatically derived from a query q, is formulated according to the following grammar (in EBNF
formalism):
&lt;qb&gt; ® [ &lt;BLRDisj&gt; OR ] &lt;TermConj&gt;
&lt;BLRDisj&gt; ® &lt;r&gt; [ OR &lt;BLRDisj&gt; ]
&lt;TermConj&gt; ® (&lt;TermDisj&gt; [ AND &lt;TermConj&gt; ])
&lt;TermDisj&gt; ® (h1(&lt;w&gt;) OR h2(&lt;w&gt;)) ½ (h1(&lt;w&gt;)) ½ (h2(&lt;w&gt;)) ½
&lt;r&gt; ® BLR
&lt;w&gt; ® adjective ½ adverb ½ noun ½ verb
The elements OR and AND are respectively disjunction and conjunction Boolean operators. Let the string “restored
painting” is a query q, then a corresponding Boolean query qb is:</p>
        <p>“of(restoration, painting) ” OR ( (“restoration” OR “restorer” ) AND (“painting”) ) )
In the next step, the retrieved documents are classified in two groups:
– Group I: more relevant documents that fulfill the Boolean query conditions; and
– Group II: less relevant documents that do not completely fulfill the Boolean query conditions, but contain at
least one query term.</p>
        <p>In each of these groups, the documents are ranked in decreasing order of relevance value according to equation
(4).</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>4 Collections and indexing files</title>
      <p>We submitted only one run for the Portuguese Monolingual Ad Hoc Task at CLEF 2006: the trevd06. This run
adopts the TR+ Model, i.e., uses terms and relationships in evidence.</p>
      <p>Trevd06 uses manual query construction from topics, including title, description, and narrative. For each topic,
a query in natural language was entered. Such text constituted the input of our system according to the strategy of
TR+ Model (see Figure 1).</p>
      <p>term
vocabulary
{ t\0 }
preposition
vocabulary</p>
      <p>{ p\0 }
term
index
{ rt rc rr ra rpa }
restriction
index
{ { T’ r } -1 } restriction
id index</p>
      <p>{ { P r } -1 }
association
index</p>
      <p>{ { T’ r } -1 }
prep. assoc.
index
{ { T’ r } -1 }
association
id index</p>
      <p>{ { E r } -1 }
prep. assoc.
id index
{ { E P r } -1 }</p>
      <p>PT collections at CLEF 2006 were Publico95, Publico94, Folha95, and Folha94. Table 1 shows the amounts
of descriptors (terms and BLRs) extracted from each collection.
The Tth record in “term index” file has five offset positions for the term t: rt for the “inverted file for terms”, rc
for the “classification index” file, rr for the “restriction index” file, ra for the “association index” file, and rpa for
“prepositioned association index” file. The default value is -1 if there is no corresponding offset position in those
files.</p>
      <p>The corresponding record for term t in “inverted file for terms” starts with m, indicating the size of the list of
document IDs { D }m where term t occurs with weight w. The inverted list for a term t consists of a sequence
m w { D }m, which is repeated until a m value of 0 is found.</p>
      <p>In “classification index” file, the corresponding record for term t has T’ (a term ID of t’) and the offset position
r in the “inverted file for classifications” for the classification =(t,t’). The sequence T’ r for term t (indicating
classifications where t is the first argument) is repeated until a T’ value of -1 is found. The same strategy is used for
the “restriction index”, “association index”, and “prepositioned association index” files. In these cases, r is an
offset position in “id index” files.</p>
      <p>The corresponding record for the restriction p(t,t’) in “restriction id index” file has P (a preposition ID of p)
and the offset position r in the “inverted file for restrictions”. The corresponding record for the association e(t,t’)
in “association id index” file has E (a term ID of e) and the offset position r in the “inverted file for associations”.
The corresponding record for the prepositioned association e.p(t,t’) in “prepositioned association id index” file
has E (a term ID of e), P (a preposition ID of p), and the offset position r in the “inverted file for prepositioned
associations”.</p>
      <p>For the BLRs, the corresponding record in inverted files has D (a document ID of d), where the BLR occurs
with weight w. The inverted list for a BLR consists of a sequence D w, which is repeated until a D value of -1 is
found.
inverted file for terms</p>
      <p>{ { m w { D }m } 0 }
classification
index
{ { T’ r } -1 } inverted file for classifications</p>
      <p>{ { D w } -1}
inverted file for restrictions</p>
      <p>{ { D w } -1}
inverted file for associations</p>
      <p>{ { D w } -1}
inverted file for prep. assoc.</p>
      <p>{ { D w } -1}</p>
      <sec id="sec-2-1">
        <title>3.4 Difficulties</title>
        <p>This is our first participation in CLEF. Our main goal was to obtain hands-on experience in the Ad Hoc
Monolingual Track on text such as the PT collections. Our prior experience was indexing and searching smaller text
collections using Brazilian Portuguese only.</p>
        <p>The changes applied to a search engine for such task are not simple adjustments and the decision to participate
was taken late. Some misplaces were verified during the indexation phase of trevd06. Our estimation is that at
least 20% of the terms were not indexed due to programming mistakes.</p>
        <p>The differences between Brazilian Portuguese and European one are another source of errors because our
system was designed for the former but not for the European Portuguese. The following example explains this
problem. While Figure 2 shows the output of our nominalization tool for a sentence in Brazilian Portuguese, Figure 4
shows the output for the same sentence (“Controlo laboral exacto adoptado em equipa”) in European Portuguese.</p>
        <p>Controlo controlar controle controlador _VB
laboral laboral laboralidade 0 _AJ
exacto exacto 0 0 _SU
adoptado adoptar adoptacao adoptador _AP
em em 0 0 _PR
equipa equipar equipamento equipador _VB
You should notice that the noun “Controlo” here is tagged as a verb (_VB), the adjective “exacto” as a noun (_SU),
and the noun “equipa” as a verb. Nouns, like “laboralidade” and “adoptação”, are generated erroneously due to
lexical differences between the two languages. On the other hand, nouns, like “controle”, “controlador”,
“equipamento”, and the wrong noun “equipador”, are generated due to incorrect tagging. These mistakes affected the
indexing of terms and BLRs.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>4 Results</title>
      <p>- Average Precision: 18,04%
- R-Precision: 22,85%
- Precision at 5 docs: 40,0%
- Precision at 10 docs: 35,0%
Indeed, under the conditions of participation in Ad Hoc Monolingual Portuguese Track at CLEF2006, we could
consider that the results were reasonable and the experience with European Portuguese and largest colletions was
valid.</p>
      <p>There are two immediate works concerning this experience:
– the correction of the indexing errors and the analysis of the impact on retrieving results, and
– the adaptation of our tagging and nominalization tools concerning European Portuguese.</p>
      <p>We must decide on two work directions: (i) to use specialized tools for each language type or (ii) to create a
generic tool for text pre-processing. An alternative that will be considered is to transform variations of words
(like “exato” and “exacto” (“exact”)) into a common form before the indexing phase.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Gamallo</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Agustini</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Lopes</surname>
            ,
            <given-names>G</given-names>
          </string-name>
          ; Lima, V. L. S. de.
          <source>Mapping Syntactic Dependencies onto Semantic Relations. ECAI'02, Workshop on Natural Language Processing and Machine Learning for Ontology Engineering</source>
          , Lyon, France,
          <year>2002</year>
          . p.
          <fpage>15</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Crowther</surname>
            ,
            <given-names>J</given-names>
          </string-name>
          . (ed.).
          <article-title>Oxford Advanced Learner's Dictionary of Current English</article-title>
          . New York: Oxford University Press,
          <year>1995</year>
          .
          <volume>1</volume>
          ,430 p.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Ferreira</surname>
            ,
            <given-names>A. B. H.</given-names>
          </string-name>
          <string-name>
            <surname>Dicionário Aurélio Eletrônico - Século XXI</surname>
          </string-name>
          .
          <article-title>Nova Fronteira S.A</article-title>
          ., Rio de Janeiro,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Lima</surname>
          </string-name>
          , V. L. S. de; Lima, J. V. de.
          <source>Binary Lexical Relations for Text Representation in Information Retrieval. 10th Int. Conf on Applications of NL to Inf. Systems</source>
          , NLDB,
          <year>2005</year>
          . Springer-Verlag,
          <source>LNCS 3513</source>
          ,
          <year>2005</year>
          . p.
          <fpage>21</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Termos e Relacionamentos em Evidência na Recuperação de Informação</article-title>
          .
          <source>PhD thesis</source>
          , Instituto de Informática, UFRGS,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Lima</surname>
            , V. L. S. de; Lima,
            <given-names>J. V.</given-names>
          </string-name>
          <year>de</year>
          .
          <source>7th Comp. Ling. and Intel. Text Processing - CICling</source>
          ,
          <year>2006</year>
          . Springer-Verlag,
          <source>LNCS 3878</source>
          ,
          <year>2006</year>
          . p.
          <fpage>394</fpage>
          -
          <lpage>405</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Lima</surname>
            , V. L. S. de; Lima,
            <given-names>J. V.</given-names>
          </string-name>
          <year>de</year>
          .
          <source>7o Encontro para Proc. Comp. da Língua Portuguesa Escrita e Falada - PROPOR</source>
          ,
          <year>2006</year>
          . Springer-Verlag,
          <source>LNCS 3960</source>
          ,
          <year>2006</year>
          . p.
          <fpage>100</fpage>
          -
          <lpage>109</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Mira</given-names>
            <surname>Mateus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. H.</given-names>
            ;
            <surname>Brito</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            ;
            <surname>Duarte</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          ; Faria,
          <string-name>
            <surname>I. H.</surname>
          </string-name>
          <article-title>Gramática da Língua Portuguesa</article-title>
          . Lisboa: Ed. Caminho,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Perini</surname>
            ,
            <given-names>Mário A</given-names>
          </string-name>
          .
          <article-title>Para uma Nova Gramática do Português</article-title>
          . São Paulo: Ed. Ática,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Robertson</surname>
            ,
            <given-names>S. E.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Walker</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <article-title>Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted Retrieval</article-title>
          .
          <source>17th Annual International ACM SIGIR conference on research and development in IR</source>
          ,
          <year>1994</year>
          . Proceedings, p.
          <fpage>232</fpage>
          -
          <lpage>241</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Ziviani</surname>
            ,
            <given-names>N. Text</given-names>
          </string-name>
          <string-name>
            <surname>Operations</surname>
            . In: Baeza-Yates,
            <given-names>R.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ribeiro-Neto</surname>
            ,
            <given-names>B. Modern Information</given-names>
          </string-name>
          <string-name>
            <surname>Retrieval</surname>
          </string-name>
          . New York : ACM Press,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>