<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An Ontological Representation of Documents and Queries for Information Retrieval Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mauro Dragoni</string-name>
          <email>mauro.dragoni@unimi.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Célia da Costa Pereira</string-name>
          <email>celia.pereira@unimi.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrea G.B. Tettamanzi</string-name>
          <email>andrea.tettamanzi@unimi.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Università degli Studi di Milano</institution>
          ,
          <addr-line>Dipartimento di Tecnologie, dell'Informazione, Via Bramante 65, I-26013, Crema (CR)</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Università degli Studi di Milano</institution>
          ,
          <addr-line>Dipartimento di Tecnologie, dell'Informazione, Via Bramante 65, I-26013, Crema (CR)</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Università degli Studi di Milano</institution>
          ,
          <addr-line>Dipartimento di Tecnologie, dell'Informazione, Via Bramante 65, I-26013, Crema (CR)</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2010</year>
      </pub-date>
      <fpage>27</fpage>
      <lpage>28</lpage>
      <abstract>
        <p>This paper presents a vector space model approach, for representing documents and queries, using concepts instead of terms and WordNet as a light ontology. This way, information overlap is reduced with respect to the classic semantic expansion techniques. Experiments undertaken on MuchMore benchmark showed the e®ectiveness of the approach.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        This paper presents an ontology-based approach for a
conceptual representation of documents. Such an approach is
inspired by a recently proposed idea presented in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], and
uses an adapted version of that method to standardize the
representation of documents and queries. The proposed
approach is somehow similar to the classic query
expansion technique. However additional considerations have been
taken into account and some improvements have been
applied as explained below.
      </p>
      <p>
        Query expansion is an approach used in Information
Retrieval (IR) in order to improve the system's performance.
It consists of the expansion of the content of the query by
adding the terms that are semantical correlated with the
original terms of the query [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Several works demonstrated
the enhanced performance of IR systems that implement
query expansion approaches [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. However, the query
expansion approach has to be used carefully because, as
demonstrated in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], expansion might degrade the
performance of some individual queries. This is due to the fact
that an incorrect choice of terms and concepts for the
expansion task might harm the retrieval process by drifting it
away from the optimal correct answer.
      </p>
      <p>
        Document expansion applied to IR has been recently
proposed in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In that work a sub-tree approach has been
implemented to represent concepts in documents and queries.
However, when using a tree structure there is a redundancy
of information because more general concepts may be
represented implicitly by using only the leaf concepts they
subsume. The smart idea behind the representation of
documents by using concepts is that documents and queries may
be represented in the same way. This way, the risk of
omitting some related terms (as it may happen in the classical
query expansion technique), is reduced. However, it is
necessary to use a language resource that permits to cover a
higher number of terms in order to avoid information loss.
      </p>
      <p>
        This paper presents a new representation for documents
and queries. The proposed approach exploits the structure
of the well-known machine readable dictionary WordNet in
order to reduce the redundancy of information generally
contained in a concept-based document representation. The
second improvement is the reduction of the computational
time needed to compare documents and queries represented
by using concepts. This representation has been applied
to the ad-hoc retrieval problem. The approach has been
evaluated on the MuchMore1 Collection [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and the results
demonstrate its viability.
      </p>
      <p>In Section 2 an overview of the environment in which
ontology has been used is presented. Section 3 presents the
tools used for this work. Section 4 illustrates the proposed
approach to represent information, while Section 5 compares
this approach with other two well-known approaches used in
conceptual representation of documents. In Section 6 the
results obtained from the test campaign are discussed. Finally,
Section 7 concludes.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>RELATED WORKS</title>
      <p>
        An increasing number of recent information retrieval
systems make use of ontologies to help the users clarify their
information needs and come up with semantic
representations of documents. Many ontology-based information
retrieval systems and models have been proposed in the last
decade. An interesting review on IR techniques based on
ontologies is presented in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], while in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] the author
studies the application of ontologies to a large-scale IR system
for web purposes. A model for the exploitation of
ontologybased knowledge bases is presented in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The aim of this
model is to improve search over large document
repositories. The model includes an ontology-based scheme for the
annotation of documents, and a retrieval model based on an
adaptation of the classic vector-space model [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Another
information retrieval system based on ontologies is presented
in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. The authors propose an information retrieval system
which has landmark information database that has
hierarchical structures and semantic meanings of the features and
1http://muchmore.dfki.de
characteristics of the landmarks.
      </p>
      <p>
        The implementation of ontology models has been also
investigated by using fuzzy models [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>In IR, the user's input queries usually are not detailed
enough, so the satisfactory query results can not be brought
back. Query expansion of IR can help to solve this problem.
However, the common query expansion in IR cannot get
steady retrieval results. Ontologies play a key role in query
expansion research. A common use of ontologies in query
expansion is to enrich the resources with some well-de¯ned
meaning to enhance the search capabilities of existing web
searching systems.</p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] the authors propose and implement query
expansion method which combines domain ontology with the
frequency of terms. Ontology is used to describe domain
knowledge; logic reasoner and the frequency of terms are used to
choose ¯tting expansion words. This way, higher recall and
precision can be gotten as user's query results.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] the authors present an approach to expand queries
that consists in searching terms from the topic query in an
ontology in order to add similar terms.
3.
      </p>
    </sec>
    <sec id="sec-3">
      <title>PRELIMINARIES</title>
      <p>The roadmap to prove the viability of a concept-based
representation of documents and queries consists in two main
tasks:
- to choose a method that permits to represent all
documents terms by using the same set of concepts;
- to implement an approach that permits to index and to
evaluate each concept, in both documents and queries,
with the appropriate weight.</p>
      <p>To represent documents, the method described in
Section 4 has been used, combined with the use of the WordNet
machine-readable dictionary. From the WordNet database,
the set of terms that do not have hyponymy has been
extracted, each term is named \base concept". A vector, named
\base vector", has been created and, to each component of
the vector, a base concept has been assigned. This way, each
term is represented by using the base vector of the WordNet
ontology.</p>
      <p>The representation described above has been implemented
on top of the Apache Lucene open-source API. 2</p>
      <p>In the pre-indexing phase, each document has been
converted in its ontological representation. After the
calculation of the importance of each concept in a document, only
concepts with a degree of importance higher than a ¯xed
cut-value have been maintained, while the others have been
discarded. The cut-value used in these experiments is 0.01.
This choice has a drawback, namely that an approximation
of representing information is introduced due to the discard
of some minor concepts. However, we have experimentally
veri¯ed that this approximation does not a®ect the ¯nal
results.</p>
      <p>During the evaluation activity, queries have been also
converted into the ontological representation. This way, weights
have to be assigned to each concept to evaluate all concepts
with the right proportion. One of the features of Lucene is
the possibility of assigning a payload to each term of the
2See URL http://lucene.apache.org/.
query. Therefore, to each element present in the
conceptbased representation of the query, its concept weight has
been used as boost value.
4.</p>
    </sec>
    <sec id="sec-4">
      <title>DOCUMENT REPRESENTATION</title>
      <p>Conventional IR approaches represent documents as
vectors of term weights. Such representations use a vector with
one component for every signi¯cant term that occurs in the
document. This has several limitations, for example:
1. di®erent vector positions may be allocated to the
synonyms of the same term; this way there is an
information loss because the importance of a determinate
concept is distributed among di®erent vector
components;
2. the size of a document vector have to be at least equal
to the total number of words of the language used to
write the document;
3. every time a new set of terms is introduced (which is a
high-probability event), all document vectors must be
reconstructed; the size of a repository thus grows not
only as a function of the number of documents that
it contains, but also of the size of the representation
vectors.</p>
      <p>
        To overcome these weaknesses of term-based representations,
an ontology-based representation has been used [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        An ontology-based representation has been recently
proposed in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] which exploits the hierarchical is-a relation
among concepts, i.e., the meanings of words. For example,
to describe with a term-based representation documents
containing the three words: \animal", \dog", and \cat" a vector
of three elements is needed; with an ontology-based
representation, since \animal" subsumes both \dog" and \cat", it
is possible to use a vector with only two elements, related to
the \dog" and \cat" concepts, that can also implicitly
contain the information given by the presence of the \animal"
concept. Moreover, by de¯ning an ontology base, which is a
set of independent concepts that covers the whole ontology,
an ontology-based representation allows the system to use
¯xed-size document vectors, consisting of one component
per base concept.
      </p>
      <p>
        Calculating term importance is a signi¯cant and
fundamental aspect for representing documents in conventional
information retrieval approaches. It is usually determined
through term frequency-inverse document frequency
(TFIDF). When using an ontology-based representation, such
usual de¯nition of term-frequency cannot be applied because
one does not operate by keywords, but by concepts. This
is the reason why it has been adopted the document
representation based on concepts proposed in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], which is a
concept-based adaptation of TF-IDF.
      </p>
      <p>
        In this paper, an adaptation of the approach proposed in
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is presented. The original approach was proposed for
domain speci¯c ontologies and does not always consider all
the possible concepts in the considered ontology, in the sense
that it assumes a cut at a given speci¯city level. Instead,
the proposed approach has been adapted for more general
purpose ontologies and it takes into account all independent
concepts contained in the considered ontology. This way,
information associated to each concept is more precise and
the problem of choosing the suitable level to apply the cut
is overcome.
      </p>
      <p>The quantity of information given by the presence of
concept z in a document depends on the depth of z in the
ontology graph, on how many times it appears in the document,
and how many times it occurs in the whole document
repository. These two frequencies also depend on the number of
concepts which subsume or are subsumed by z. Let us
consider a concept x which is a descendant of another concept y
which has q children including x. Concept y is a descendant
of a concept z which has k children including y. Concept
x is a leaf of the graph representing the used ontology. For
instance, considering a document containing only \xy", the
occurrence of x in the document is 1 + (1=q). In the
document \xyz", the occurrence of x is 1 + (1=q(1 + 1=k)). As
it is possible to see, the number of occurrences of a leaf is
proportional to the number of children which all of its
ancestors have. Explicit and implicit concepts are taken into
account by using the following formulas:
N (c) = occ(c) +</p>
      <p>X
depth(c)</p>
      <p>X
c2Path(c;:::;&gt;) i=2</p>
      <p>occ(ci)
Qij=2 jjchildren(cj )jj</p>
      <p>;
(1)
where N (c) is the number of occurrences, both explicit
and implicit, of concept c and occ(c) is the number of
lexicalizations of c occurring in the document. The value N (c)
is the weight associated with the concept c.</p>
      <p>Given the ontology base I = b1; : : : ; bn, where the bis are
the base concepts, the quantity of information, info(bi),
pertaining to base concept bi in a document is:
info(bi) = Ndoc(bi) ;</p>
      <p>Nrep(bi)
(2)
where Ndoc(bi) is the number of explicit and implicit
occurrences of bi in the document, and Nrep(bi) is the total
number of its explicit and implicit occurrences in the whole
document repository. This way, every component of the
representation vector gives a value of the importance relation
between a document and the relevant base concept.</p>
      <p>A concrete example can be explained starting from the
light ontology represented in Figures 1 and 2, and by
considering a document D1 containing concepts \xxyyyz".</p>
      <p>In this case the ontology base is:</p>
      <p>I = fa; b; c; d; xg
and, for each concept in the ontology, the vectors Ndoc
are:
z = (0:25; 0:25; 0:25; 0:125; 0:125)
a = (1:0; 0:0; 0:0; 0:0; 0:0)
b = (0:0; 1:0; 0:0; 0:0; 0:0)
c = (0:0; 0:0; 1:0; 0:0; 0:0)
y = (0:0; 0:0; 0:0; 0:5; 0:5)
d = (0:0; 0:0; 0:0; 1:0; 0:0)
x = (0:0; 0:0; 0:0; 0:0; 1:0) ,
so the document vector associated to D1 is:
D1 = (2¤x¹)+(3¤y¹)+z¹ = (0:25; 0:25; 0:25; 1:625; 3:625): (3)
In Section 5, a comparison between the proposed
representation and other two classic concept-based representation
is discussed.
5.</p>
    </sec>
    <sec id="sec-5">
      <title>REPRESENTATION COMPARISON</title>
      <p>
        In Section 4 the approach used to represent information
was described. This section shows the improvements
obtained by applying the proposed approach and it illustrates
a comparison between the proposed approach and other two
approaches commonly used in conceptual document
representation. The expansion technique is generally used to
enrich information content of queries. However, in the last
years some authors applied the expansion technique also to
represent documents [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Like in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], we propose an
approach that uses WordNet to extract concepts from terms.
      </p>
      <p>The two main improvements obtained by the application
of the ontology-based approach are illustrated below.</p>
      <sec id="sec-5-1">
        <title>Information Redundancy.</title>
        <p>Approaches that apply the expansion of documents and
queries, use correlated concepts to expand the original terms
of documents and queries. A problem with expansion is
that information is redundant and there is not a real
improvement of the representation of the document (or query)
content. With the proposed representation this redundancy
is eliminated because only independent concepts are taken
into account to represent documents and queries. Another
positive aspect is that the size of the vector representing
document content by using concepts is generally lower than the
size of the vector representing document content by using
terms.</p>
        <p>
          An example of technique that shows this drawback is
presented in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. In this work the authors propose an indexing
technique that takes into account WordNet synsets instead
of terms. For each term in documents, the synsets
associated to that terms are extracted and then used as token
for the indexing task. This way, the computational time
needed to perform a query is not increased, however, there is
a signi¯cant overlap of information because di®erent synsets
might be semantically correlated. An example is given by
the terms \animal" and \pet", these terms have two di®erent
synsets, however, observing the WordNet lattice, the term
\pet" is linked with an \is-a" relation with the term \animal".
Therefore, in a scenario in which a document contains both
terms, the same conceptual information is repeated. This is
clear because, even if the terms \animal" and \pet" are not
represented by using the same synset, they are semantically
correlated because \pet" is a sub-concept of \animal". This
way, when a document contains both terms, the presence of
the term \animal" has to contribute to the importance of the
concept \pet" instead of to be represented with a di®erent
token.
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>Computational Time.</title>
        <p>
          When IR approaches are applied in a real-world
environment, the computational time needed to evaluate the match
between documents and the submitted query has to be
considered. It is known that systems using the vector space
model have higher e±ciency. Conceptual-based approaches,
such as the one presented in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], generally implement a
nonvectorial data structure which needs a higher computational
time with respect to a vector space model representation.
The approach proposed in this paper overcomes this issue
because the document content is represented by using a
vector and therefore, the computational time needed to
compute document score is comparable to the computational
time needed by using the vector space model.
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>EXPERIMENTS</title>
      <p>
        In this section, the impact of the ontology document and
query representation is evaluated. The evaluation method
follows the TREC protocol [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. For each query the ¯rst
1000 retrieved documents have been considered and the
precision of the system has been calculated at di®erent points:
5, 10, 15, and 30 documents retrieved. Moreover, the
precision/recall graph has been calculated
      </p>
      <p>The experimental campaign has been performed by
using the MuchMore collection that consists of 7823 abstracts
of medical papers and 25 queries with their relevance
judgments. One of the particular features of this collection is
that there are a lot of medical terms. This way, a term-based
representation is more advantaged with respect to semantic
representation, because speci¯c terms present in documents
(for example \Arthroscopic") are very discriminant. Indeed,
by using a semantic expansion some problems may occur
because, generally, the MRD and thesaurus used to expand
terms do not contain some domain-speci¯c terms.</p>
      <p>
        The precision/recall graph showed in Figure 3 illustrates
the comparison between the proposed approach (gray curve
with circle marks), the classical term-based representation
(black curve), and the synset representation method [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]
(light gray curve with square marks). As expected, for all
recall values, the proposed approach obtained better results
than the term-based representation. The best gain of the
concept-based representation is at recall levels 0:0, 0:2, and
0:4. While for recall values between 0:6 and 1:0, the
conceptbased precision curve lies with the other two curves.
      </p>
      <p>A possible explanation for this scenario is that for
documents that are well related to a particular topic the adopted</p>
      <p>Recall
Term-Based</p>
      <p>Synsets</p>
      <p>Onto-Based
ontology representation is able to improve the representation
of the documents contents. However, for documents that are
partially related to a topic or that contains many
ambiguous terms, the proposed approach is not able to maintain
an high precision of the results. At the end of this section
some improvements that may be responsible of this fact are
discussed.</p>
      <p>In Table 1 the three di®erent representations are compared
for the Precision@X and MAP values. The results show
that the proposed approach obtains better results for the all
precision levels and also for the MAP value.</p>
      <p>Systems</p>
      <p>
        P5
Term-Based 0.544
Synset-Indexing [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] 0.648
Concept-Based 0.744
This happens because verbs and adjectives are
structured in a di®erent way than nouns. The hyperonymy
and hyponymy relations (that make MRD comparable
with ontologies) are not de¯ned for verbs and
adjectives, therefore another approach will be studied and
implemented to overcome this drawback.
- Term ambiguity: the concept-based representation has the
problem of introducing an error given by not using a
word sense disambiguation algorithm. Using such a
method, concepts associated to incorrect senses would
be discarded or weighted less. Therefore, the
conceptbased representation of each word would be ¯ner, with
the consequence of representing the information
contained in a document with more precision.
      </p>
      <p>Improving the actual model with the above features, would
certainly yield signi¯cantly better results in the next
experiments campaign. This positive view is motivated by the fact
that, in spite of these issues, the preliminary goal of
outperforming the precision of the term-based representation has
been accomplished.</p>
    </sec>
    <sec id="sec-7">
      <title>CONCLUSION</title>
      <p>In this paper we have discussed an approach to index
documents and to represent queries for information retrieval
purposes which exploits a conceptual representation based
on ontologies.</p>
      <p>Experiments have been performed on the MuchMore
Collection to validate the approach with respect to problems
like term-synonymity in documents.</p>
      <p>
        Preliminary experimental results show that the proposed
representation improves the ranking of the documents.
Investigation on results highlights that further improvement
could be obtained by integrating WSD techniques like the
one discussed in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] to avoid the error introduced by
considering incorrect word senses, and with a better usage and
interpretation of WordNet to overcome the loss of
information caused by the absence of proper nouns, verbs, and
adjectives.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Azzini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dragoni</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>da Costa Pereira, and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Tettamanzi</surname>
          </string-name>
          .
          <article-title>Evolving neural networks for word sense disambiguation</article-title>
          .
          <source>In Proc. of HIS '08</source>
          ,
          <string-name>
            <surname>Barcelona</surname>
          </string-name>
          , Spain,
          <source>September 10-12</source>
          , pages
          <fpage>332</fpage>
          {
          <fpage>337</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Baziz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Boughanem</surname>
          </string-name>
          , G. Pasi, and
          <string-name>
            <given-names>H.</given-names>
            <surname>Prade</surname>
          </string-name>
          .
          <article-title>An information retrieval driven by ontology: from query to document expansion</article-title>
          . In D. Evans,
          <string-name>
            <given-names>S.</given-names>
            <surname>Furui</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Soul¶</surname>
          </string-name>
          e-Dupuy, editors,
          <source>RIAO. CID</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>B.</given-names>
            <surname>Billerbeck</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Zobel</surname>
          </string-name>
          .
          <article-title>Techniques for e±cient query expansion</article-title>
          . In A. Apostolico and M. Melucci, editors,
          <source>SPIRE</source>
          , volume
          <volume>3246</volume>
          of Lecture Notes in Computer Science, pages
          <volume>30</volume>
          {
          <fpage>42</fpage>
          . Springer,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Boughanem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Dkaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mothe</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Soul¶</surname>
          </string-name>
          e-Dupuy.
          <article-title>Mercure at trec7</article-title>
          .
          <source>In TREC</source>
          , pages
          <volume>355</volume>
          {
          <fpage>360</fpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Cai</surname>
          </string-name>
          , C. van
          <string-name>
            <surname>Rijsbergen</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>J.</given-names>
            <surname>Jose</surname>
          </string-name>
          .
          <article-title>Automatic query expansion based on divergence</article-title>
          .
          <source>In CIKM</source>
          , pages
          <volume>419</volume>
          {
          <fpage>426</fpage>
          . ACM,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Calegari</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Sanchez</surname>
          </string-name>
          .
          <article-title>A fuzzy ontology-approach to improve semantic information retrieval</article-title>
          . In F. Bobillo, P. da
          <string-name>
            <surname>Costa</surname>
            , C. d'Amato,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Fanizzi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Fung</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Lukasiewicz</surname>
            , T. Martin,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Nickles</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Peng</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Pool</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Smrz</surname>
          </string-name>
          , and P. Vojt¶as, editors,
          <source>URSW</source>
          , volume
          <volume>327</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          , M. Ferna¶ndez, and
          <string-name>
            <given-names>D.</given-names>
            <surname>Vallet</surname>
          </string-name>
          .
          <article-title>An adaptation of the vector-space model for ontology-based information retrieval</article-title>
          .
          <source>IEEE Trans. Knowl</source>
          . Data Eng.,
          <volume>19</volume>
          (
          <issue>2</issue>
          ):
          <volume>261</volume>
          {
          <fpage>272</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Cronen-Townsend</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W.</given-names>
            <surname>Croft</surname>
          </string-name>
          .
          <article-title>A framework for selective query expansion</article-title>
          . In D. Grossman,
          <string-name>
            <given-names>L.</given-names>
            <surname>Gravano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Herzog</surname>
          </string-name>
          , and D. Evans, editors,
          <source>CIKM</source>
          , pages
          <volume>236</volume>
          {
          <fpage>237</fpage>
          . ACM,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C. da Costa</given-names>
            <surname>Pereira</surname>
          </string-name>
          and
          <string-name>
            <given-names>A. G. B.</given-names>
            <surname>Tettamanzi</surname>
          </string-name>
          .
          <article-title>Soft computing in ontologies and semantic Web, chapter An ontology-based method for user model acquisition</article-title>
          , pages
          <volume>211</volume>
          {
          <fpage>227</fpage>
          .
          <article-title>Studies in fuzziness and soft computing</article-title>
          . Ed. Zongmin Ma, Springer, Berlin,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>M. D¶</surname>
          </string-name>
          <article-title>³az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>M. G.</given-names>
          </string-name>
          <string-name>
            <surname>Cumbreras</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Mart¶³n-</article-title>
          <string-name>
            <surname>Valdivia</surname>
            ,
            <given-names>A. M.</given-names>
          </string-name>
          <article-title>R¶aez</article-title>
          , and L.
          <string-name>
            <surname>Uren</surname>
          </string-name>
          <article-title>~a-Lo¶pez. Integrating mesh ontology to improve medical information retrieval</article-title>
          .
          <source>In CLEF</source>
          , volume
          <volume>5152</volume>
          of Lecture Notes in Computer Science, pages
          <volume>601</volume>
          {
          <fpage>606</fpage>
          . Springer,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>O.</given-names>
            <surname>Dridi</surname>
          </string-name>
          .
          <article-title>Ontology-based information retrieval: Overview and new proposition</article-title>
          . In O.
          <string-name>
            <surname>Pastor</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Flory</surname>
          </string-name>
          , and J.-L. Cavarero, editors,
          <source>RCIS</source>
          , pages
          <volume>421</volume>
          {
          <fpage>426</fpage>
          . IEEE,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>E.</given-names>
            <surname>Efthimiadis</surname>
          </string-name>
          .
          <article-title>Query expansion</article-title>
          . In M. Williams, editor,
          <source>Annual review of information science and technology</source>
          , pages Vol.
          <volume>31</volume>
          , pp. 121
          <string-name>
            <given-names>U</given-names>
            }187. Information Today Inc,
            <surname>Medford</surname>
          </string-name>
          <string-name>
            <surname>NJ</surname>
          </string-name>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>J.</given-names>
            <surname>Gonzalo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Verdejo</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Chugur</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Cigarra</surname>
          </string-name>
          <article-title>¶n. Indexing with wordnet synsets can improve text retrieval</article-title>
          .
          <source>CoRR, cmp-lg/9808002</source>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>T.</given-names>
            <surname>Hattori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hiramatsu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Okadome</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Parsia</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Sirin</surname>
          </string-name>
          .
          <article-title>Ichigen-san: An ontology-based information retrieval system</article-title>
          . In X. Zhou,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kitsuregawa</surname>
          </string-name>
          , and Y. Zhang, editors,
          <source>APWeb</source>
          , volume
          <volume>3841</volume>
          of Lecture Notes in Computer Science, pages
          <volume>1197</volume>
          {
          <fpage>1200</fpage>
          . Springer,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>G.</given-names>
            <surname>Salton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Wong</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Yang</surname>
          </string-name>
          .
          <article-title>A vector space model for automatic indexing</article-title>
          .
          <source>Commun. ACM</source>
          ,
          <volume>18</volume>
          (
          <issue>11</issue>
          ):
          <volume>613</volume>
          {
          <fpage>620</fpage>
          ,
          <year>1975</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>S. Tomassen.</surname>
          </string-name>
          <article-title>Research on ontology-driven information retrieval</article-title>
          . In R. Meersman,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Tari</surname>
          </string-name>
          , and P. Herrero, editors,
          <source>OTM Workshops (2)</source>
          , volume
          <volume>4278</volume>
          of Lecture Notes in Computer Science, pages
          <volume>1460</volume>
          {
          <fpage>1468</fpage>
          . Springer,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>E.</given-names>
            <surname>Voorhees</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Harman</surname>
          </string-name>
          .
          <article-title>Overview of the sixth text retrieval conference (trec-6)</article-title>
          .
          <source>In TREC</source>
          , pages
          <volume>1</volume>
          {
          <fpage>24</fpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>F.</given-names>
            <surname>Wu</surname>
          </string-name>
          , G. Wu, and
          <string-name>
            <given-names>X.</given-names>
            <surname>Fu</surname>
          </string-name>
          .
          <article-title>Design and implementation of ontology-based query expansion for information retrieval</article-title>
          . In L.
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Tjoa</surname>
          </string-name>
          , and S. Chaudhry, editors,
          <source>CONFENIS (1)</source>
          , volume
          <volume>254</volume>
          <source>of IFIP</source>
          , pages
          <volume>293</volume>
          {
          <fpage>298</fpage>
          . Springer,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Xu</surname>
          </string-name>
          and
          <string-name>
            <given-names>W.</given-names>
            <surname>Croft</surname>
          </string-name>
          .
          <article-title>Query expansion using local and global document analysis</article-title>
          . In H.-P. Frei,
          <string-name>
            <given-names>D.</given-names>
            <surname>Harman</surname>
          </string-name>
          , P. SchaÄuble, and R. Wilkinson, editors,
          <source>SIGIR</source>
          , pages
          <volume>4</volume>
          {
          <fpage>11</fpage>
          . ACM,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>