<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Using Multiple Related Ontologies in a Fuzzy Information Retrieval Model</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Maria Angelica A. Leite</string-name>
          <email>angelica@cnptia.embrapa.br</email>
          <email>leite@dca.fee.unicamp.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ivan L. M. Ricarte</string-name>
          <email>ricarte@dca.fee.unicamp.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Embrapa Agriculture</institution>
          <addr-line>Informatics PO Box: 6041 - ZIP: 13083-970 - Campinas - SP -</addr-line>
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Electrical and Computer Engineering, University of Campinas</institution>
          <addr-line>PO Box 6101, Postal Code: 13083-970 - Campinas, SP</addr-line>
          ,
          <country country="BR">Brazil</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>With the Semantic Web progress many independently developed distinct domain ontologies have to be shared and reused by a variety of applications. The use of ontologies in information retrieval applications allows the retrieval of semantically related documents to an initial users' query. This work presents a fuzzy information retrieval model for improving the document retrieval process considering a knowledge base composed of multiple domain ontologies that are fuzzy related. Each ontology can be represented independently as well as their relationships. This knowledge organization is used in a novel method to expand the user initial query and to index the documents in the collection. Experimental results show that the proposed model presents better overall performance when compared with another fuzzy-based approach for information retrieval.</p>
      </abstract>
      <kwd-group>
        <kwd>Fuzzy information retrieval</kwd>
        <kwd>ontology</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        With the grown availability of information many research has been made in order
to provide intelligent ways to easy the information access. One point is to treat
not only the lexical information features but also to consider its semantics, that
is, the meaning attached to it. Within this approach the usage of ontologies to
organize the knowledge and to express semantic meaning has gaining attention.
An information retrieval system stores and index documents such that when
users express their information need in a query the system retrieves the related
documents associating a score to each one. The higher the score the greater the
document relevance [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Usually documents are retrieved when they contain the
index terms specified in the queries. However, this approach will neglect other
relevant documents that do not contain the index terms specified in the user’s
queries.
      </p>
      <p>
        When working with specific domain knowledge this problem can be overcome
by incorporating a knowledge base which depicts the relationships between index
terms into the existing information retrieval systems. Knowledge bases can be
manually developed by domain experts or automatically constructed from the
knowledge in the document collection [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ]. To deal with vagueness typical of
human knowledge, the fuzzy set theory can be used to manipulate the knowledge
in the bases. It deals with the uncertainty that may be present in document and
query representations as well as in their relationships. Knowledge bases in
information retrieval cover a wide range of topics of which query expansion is one. A
recent approach is to use ontologies to infer new terms to be added to the queries
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Usually information retrieval systems use just one conceptual structure to
model the knowledge and compose the knowledge base. But the knowledge
indexing a document collection can be expressed in multiple distinct domains. In
some contexts these domains concepts are related by causal, spatial or similarity
relationships. Each domain can be represented as a conceptual structure like
a lightweight ontology. Lightweight ontologies include concepts, concepts
taxonomies, relationships between concepts and properties that describes concepts
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The relationships between domain’s concepts can be translated to
relationships between the lightweight ontologies concepts producing a knowledge base
composed of multiple related lightweight ontologies. Consider the territorial
division and the climate domains. These are distinct domains but the relation of
a territorial division and a climate classification can be done through
observation of geographic and climatic maps. The geographic domains are, in general,
organized in an hierarchical way and can be represented by domain ontologies.
Figure 1 shows the Ko¨ppen climate3 distribution over brazilian territory4 and
Fig. 2 shows the ontologies referring to brazilian territory and climate domains
respectively. The idea is to relate the ontologies by establishing fuzzy
relationships between territory and climate concepts based on spatial distribution in the
map. The dashed lines illustrates this kind of relationship.
      </p>
      <p>
        We present an information retrieval model which is supported by fuzzy
related lightweight ontologies each one representing a distinct knowledge domain.
The model provides means to represent each ontology independently as well as
their relationships. This way existent ontologies can be reused in the model.
Based on the knowledge from the ontologies the system carries an automatic
fuzzy query expansion. The documents are indexed by the concepts in the
ontologies allowing the retrieval by their meaning. The documents do not need to be
indexed by each ontology concepts as in a faceted approach. Given a query with
concepts from an initial domain new semantically related documents indexed
by other domain ontologies can be retrieved based on the ontologies
relationships. The results obtained with the proposed information retrieval model are
compared with the results obtained using just the user’s entered keywords and
with the results obtained by another fuzzy information retrieval system [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ].
The proposed expansion method is also employed in expanding queries for the
3 http://en.wikipedia.org/wiki/Koppen climate classification
4 http://campeche.inf.furb.br/sisga/educacao/ensino/mapaClima.php
Apache Lucene [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] search engine. The results show an enhance in precision for
the same recall measures. The ontologies are considered as crisp ones where
relationships between their own concepts assume values in the set {0, 1} denoting the
existence (1) or absence (0) of the relationship between them. The relationships
between distinct ontologies concepts are calculated as fuzzy ones.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Works Based on Knowledge</title>
      <p>
        Some information retrieval models that encode knowledge among terms in
order to improve performance are presented. In context sensitive semantic query
expansion [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] a semantic encyclopedia is used as a means to provide semantics
to user’s query. The user queries and the index entries are represented with
the use of semantic entities, defined in the encyclopedia. The query expansion
process takes into account the query context, which is a fuzzy set of semantic
entities. Simulation examples show that the expansion method successfully
performed in direction the query context specifies. By combining lexico-syntactic
and statistical learning approaches a fuzzy domain ontology mining algorithm is
proposed for supporting ontology engineering [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The work uses one ontology
as knowledge base and presents studies confirming that the use of a fuzzy
domain ontology leads to significant improvement in information retrieval. By using
a geographical ontology the proposed information retrieval system performs a
query expansion of queries with geographical context. A query parser captures
geonames and spatial relationships, and maps geographical features and feature
types into concepts of a geographical ontology [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>
        The multi-relationship fuzzy concept network information retrieval model [
        <xref ref-type="bibr" rid="ref6 ref7">6,
7</xref>
        ] considers the knowledge encoded as a fuzzy conceptual network. In the network
each node can be related to another one by three relation types Vr : C×C → [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ]
where C is the concept set and r ∈ {P, G, S} denotes the fuzzy positive
association (P) , fuzzy generalization association (G) and fuzzy specialization (S)
association. These relations are constructed automatically based on word
cooccurrence at syntactic level in the documents. The implicit relationships
between concepts are inferred calculating the transitive closure for the relations
∗
resulting relations Vr . The documents are associated to concepts by a fuzzy
relation U : D × C → [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ] where D is the document set. Using the transitive
closure relations Vr∗ the system infers new concepts to be associated to
documents resulting the expanded document descriptor relations Ur∗ = U ⊗ Vr . The
∗
query q is composed with concepts from the concept network. When a query is
executed the system calculates the degree of satisfaction, DSr (di), that
document di ∈ D satisfies the user’s query q using the expanded document descriptor
relations. The degree of satisfaction that a document satisfies the user’s query by
different fuzzy relations are aggregated to obtain the overall satisfaction for the
query. The aggregation assigns a score to each document and they are presented
in decreasing order to user.
      </p>
      <p>
        The fuzzy ontological relational model [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] considers a knowledge base as a
fuzzy ontology with concepts representing the categories and the keywords of a
domain. When the user enters a query, composed of concepts, the system
performs its expansion and may add new concepts based on the ontology knowledge.
After expansion the similarity between the query and the documents is
calculated by fuzzy operations. In general, works use just one conceptual structure to
encode the knowledge as the ones presented is this section. The proposed model
allows the knowledge to be expressed in distinct but related lightweight
ontologies and offers a way to represent each ontology independently as well as their
relationships. This knowledge organization is then employed in a novel method
to expand queries.
3
3.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>Information Retrieval model</title>
      <sec id="sec-3-1">
        <title>Knowledge, Document and Query Representations</title>
        <p>
          An ontology is a concept set Dk = {ck1, ck2, · · · cky} where 1 ≤ k ≤ K, K is
the domains number and y = kDkk is the concepts number in each domain. The
concepts inside an ontology are organized as a taxonomy and are related by fuzzy
specialization association (S) and fuzzy generalization association (G). The fuzzy
generalization association is the inverse of the fuzzy specialization association.
A concept is regarded as a fuzzy generalization of another concept if it consists
of that concept or it includes that concept in a partitive sense. A concept is
regarded as a fuzzy specialization of another concept if it is part of that concept
or it is a kind of that concept. Concepts pertaining to distinct ontologies are
related by fuzzy positive association (P). The fuzzy positive association denotes
concepts related by a spatial, causal or similarity relation in some contexts.
Definition 1. Consider two distinct concept domains sets Di and Dj .
1. Fuzzy positive association is a fuzzy relation: (RPij : Di × Dj → [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ]) not
symmetric, not reflexive and not transitive.
2. Fuzzy generalization association is a fuzzy relation: (RGi : Di × Di → [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ])
not symmetric, not reflexive and transitive.
3. Fuzzy specialization association is a fuzzy relation: (RSi : Di × Di → [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ])
not symmetric, not reflexive and transitive.
        </p>
        <p>The implicit relationships between concepts from the same domain are given
by the transitive closure of the fuzzy generalization and fuzzy specialization
associations. The transitive closure of the associations RiG and RiS where 1 ≤
i ≤ K, results the relations RGi and R∗Si respectively.</p>
        <p>
          ∗
Definition 2. The transitive closure R∗ of a fuzzy relation R can be determined
by an iterative algorithm that consists of the following steps:
1. Compute R´= R ∪ [wet(R ◦ R)] where wet ∈ [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ], t ∈ {G, S};
2. If R´ 6= R, rename R = R´ and go to step 1; otherwise R∗ = R´ and the
algorithm terminates.
        </p>
        <p>
          The (R◦R) means the composition between two fuzzy relations. The composition
between two fuzzy relations [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] P : X × Y and Q : Y × Z is the fuzzy relation
R : X × Z as in (1).
        </p>
        <p>R(x, z) = (P ◦ Q)(x, z) = my∈aYx min [P(x, y), Q(y, z)] .
(1)</p>
        <p>The weight wet, with empirical values 0 &lt; wet &lt; 1, penalizes the association
strength between distant concepts in the ontology. As the distance between
concepts increase their association values decrease. Concepts with higher strength
value are considered to have stronger meaning association. In order to discard
concepts associations with lower strength value a boundary b establishes the
minimum value such that the corresponding association is to be considered.</p>
        <p>
          The documents dl are represented by the DOC set where 1 ≤ l ≤ kDOCk. A
fuzzy relation Uj (dl, cjy) = uly ∈ [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ], where 1 ≤ l ≤ kDOCk, 1 ≤ j ≤ K and
1 ≤ y ≤ kDj k indicates the association degree between the concept cjy ∈ Dj
and the document dl ∈ DOC. The relations Uj , 1 ≤ j ≤ K are represented
as matrices p × m where p = kDOCk and m = kDj k. The Uj fuzzy relation
indicates the relevance of the concept to represent the document content. Its
values are calculated following a tf-idf schema [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
        <p>A query is expressed with concepts from distinct domains connected by
logical operators. The query is transformed into the conjunctive normal form and
is represented by sub-queries connected by the AND logical operator. Each
subquery is composed by a set of concepts connected by the OR logical operator.
Given the domains D1 = {c11, c12, c13} and D2 = {c21, c22, c23, c24} a valid
query in this format would be q = (c11 ∨ c22) ∧ (c13 ∨ c24). Once the query is
in the conjunctive normal form each sub-query is performed independently and
retrieves a document set. The intersection of the document sets is the final result
of the query. Therefore, in the sequence, only aspects related to the sub-query are
presented. The documents are associated to the domain concepts using distinct
relations. To consider this the sub-queries are partitioned to take the concepts
from each domain separately. Each partition is a set with dimension equal to the
associated domain concepts number and is composed by values that indicates
the presence (1) or absence (0) of the concept in the query. A sub-query q is
partitioned in qi sets where 1 ≤ i ≤ K and K is the domains number. In the
previous example the sub-query q = (c11 ∨ c22) is partitioned as q1 = [1 0 0]
and q2 = [0 1 0 0].
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Query Expansion</title>
        <p>Query expansion is performed in two phases. In the first one each partition qi,
from the initial sub-query q, is expanded to consider the relations between the
domain Di associated to the partition and the other domains from the
knowledge base. For each partition qi new K − 1 sets are generated each one containing
concepts from the others domains Dj , j 6= i, 1 ≤ i, j ≤ K associated to
concepts present in qi. This process generates a new expanded query denoted qent.
The first expansion is translated in (2). The variable i refers to the domain of
the partition qi and the variable j refers to the remaining domains from the
knowledge base.</p>
        <p>K K
qent = [ [
i=1 j=1
qi P j = i
wP qi ◦ Rij j 6= i
.</p>
        <p>(2)</p>
        <p>
          To expand the query to consider other domains the fuzzy positive association
RiPj between concepts from the domains Di and Dj is used. The model allows
to associate a weight wP ∈ [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ], that defines the influence the fuzzy positive
association will have in the expansion. Each expansion generates a new set with
domain Dj concepts. The values in the sets denote the degree the concepts from
the domain Dj are related to the concepts from the partition qi. After expansion
among domains the second phase is performed. This phase expands the sub-query
qent considering the knowledge inside the ontologies. This expansion generates
the final transposed expanded query qexpT. Equation (3) presents the expansion.
The association type is given by r ∈ {S, G}.
        </p>
        <p>qentiTj</p>
        <p>K K 
qexpT = [ [ max  wrwrRrRj r∗◦j q◦eqnetniTjtiTj j = i .</p>
        <p>i=1 j=1 ∗ j 6= i</p>
        <p>
          Considering the knowledge inside the domains each transposed partition
qentiTj, 1 ≤ i, j ≤ K is expanded to take into account the fuzzy generalization
and fuzzy specialization associations between the concepts from their domain
Dj. The model allows to associate a value wr ∈ [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ], r ∈ {S, G} that defines
a weight to the association type. This way the expansion can be adjusted to
consider more one association type than the other.
3.3
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Documents Relevance</title>
        <p>The documents relevance is given by the similarity function between the
documents representation and the expanded fuzzy sub-query qexpT. The similarity is
calculated by the product between the relations Uj with each partition qexpiTj,
as in (4), resulting the retrieved documents set V.</p>
        <p>K K
V = [ [
i=1 j=1</p>
        <p>Uj × qexpiTj .</p>
        <p>(3)
(4)</p>
        <p>
          Each relation Uj associates the collection documents to the Dj domain
concepts, where 1 ≤ j ≤ K. The set qexpiTj represents the expansion of the concepts
from the partition qi to the Dj domain where 1 ≤ i, j ≤ K. It is constituted from
the Dj domain concepts and its values indicates the degree the concepts from
domain Dj are associated to the concepts in partition qi. The arithmetic
product Uj × qexpiTj indicates the documents associated to the Dj domain that are
related to the qi partition. The S symbol designates union and denotes the max
operator. The arithmetic product adjusts the associations of the documents to
the concepts (expressed in the relations Uj) by the strength of the relationships
between concepts present in qexpiTj. The V (vl) set represents all the documents
in the collection and its value vl ∈ [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ] indicates the degree a document dl,
1 ≤ l ≤ kDOCk is similar to the initial user query. Documents with V (vl) &gt; 0
are presented to the user in decreasing order.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Model Evaluation</title>
      <p>The model evaluation uses a document collection sample referring to the
agrometeorology domain in Brazil, a query set, a lightweight ontology referring to
the geographical brazilian territory and a lightweight ontology referring to the
climate distribution over the brazilian territory. Both ontologies are manually
constructed.
4.1</p>
      <sec id="sec-4-1">
        <title>The Ontologies Construction</title>
        <p>To construct the ontologies the brazilian map from Fig. 1 is considered. For
both ontologies the fuzzy generalization association and the fuzzy specialization
association relates the spatial relationship between the entities they refer to.
As ontologies are considered as crisp ones the fuzzy generalization and the fuzzy
specialization relationships assume values in the set {0, 1} denoting the existence
(1) or absence (0) of the relationship between concepts. The positive relationships
between distinct ontologies concepts are calculated as fuzzy ones.</p>
        <p>The first ontology refers to the brazilian territory, say domain D1, with
three levels. The root node is labeled ’Brazil’, the descendant nodes are
labeled with brazilian regions and each region node has the respective brazilian
state nodes as descendants. For the brazilian territory ontology the North
Region is part of Brazil country so the fuzzy specialization association value is</p>
        <p>S
R1 (Brazil, North Region) = 1.0. This means that Brazil concept is specialized
by North Region concept. As the fuzzy generalization association is the inverse of
G
the fuzzy specialization association then R1 (North Region, Brazil) = 1.0. This
means that North Region concept is generalized by Brazil concept. Figure 2
shows a sample of the brazilian territory ontology.</p>
        <p>The second ontology refers to the climate distribution over the brazilian
territory, say domain D2. The root node is labeled ’Climate’, the root descendant
nodes are labeled with brazilian zonal climates and each zonal climate has the
respective associated Ko¨ppen climate nodes as descendants. Figure 2 shows a
sample of the brazilian climate ontology.</p>
        <p>The relationship between ontologies is established by the distribution of
climate over brazilian territory as observed in the map. The relationship is settled
in two levels. The first one is between brazilian regions and zonal climates and
the second one is between brazilian states and Ko¨ppen climates. The dashed
lines in Fig. 2 illustrate both relationships levels. The first level of fuzzy positive
relationship is between brazilian regions and zonal climates. The value of
relationships is given by mapping scanning. For example, the tropical zonal climate
occurs in North Region. The amount of tropical climate in Brazil is 59, 811 pixels.
The amount of tropical climate in North Region is 30, 616 pixels. So the fuzzy
positive association between North Region and tropical climate is given by
relation value R1P2 (North Region, Tropical) = 0.51. This means that North
Region concept implies Tropical climate concept by fuzzy positive association with
strength 0.51. On the other hand the North Region extent is 43, 737 pixels so
the fuzzy positive association between tropical climate and North Region is
R2P1 (Tropical, North Region) = 0.70. This means that Tropical concept implies
North region concept by fuzzy positive association with strength value 0.70.</p>
        <p>The second level of fuzzy positive relationship is between brazilian states
and Ko¨ppen climates. The total amount of Cfb Ko¨ppen climate in Brazil is
1, 781 pixels. The amount of Cfb Ko¨ppen climate in Santa Catarina state is
693 pixels. So the fuzzy positive association between Santa Catarina State and
Cfb Ko¨ppen climate is R1P2 (Santa Catarina, Cfb) = 0.39. This means that Santa
Catarina concept implies Cfb Ko¨ppen climate concept by fuzzy positive
association with strength value equal to 0.39. On the other hand the Santa
Catarina state extent is 900 pixels so the fuzzy positive association between Cfb
Ko¨ppen climate and Santa Catarina state is R2P1 (Cfb, Santa Catarina) = 0.77.
This means that Cfb Ko¨ppen climate concept implies Santa Catarina state
concept by fuzzy positive association with strength value equal to 0.77. In the
information retrieval process if a user constructs a query composed with Cfb concept
the query expansion process, based on the R2P1 relation, points that Santa
Catarina concept is related with Cfb concept with strength 0.77 and this concept is
added to query. Considering that the document collection is about
agrometeorology domain this indicates that documents indexed with Santa Catarina concept
can be possibly a relevant answer, even if the documents are not indexed with the
Cfb concept itself. As Cfb ko¨ppen climate covers a fraction of 0.77 from Santa
Catarina state then documents related to Santa Catarina concept can contain
aspects related to Cfb Ko¨ppen climate even this concept itself is not present in
the documents.</p>
        <p>
          The Alignment Format Level 0 [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] allows to represent fuzzy relations
between lightweight ontologies concepts. It is used to represent the fuzzy positive
association between concepts from distinct ontologies. An extract from the fuzzy
        </p>
        <p>P
positive association R12 between the brazilian territory ontology, D1, and the
brazilian climate ontology, D2, is presented in the following.
The proposed model performance is compared with a similar approach, that is,
the multi-relationship fuzzy concept network information retrieval model
presented in Sect. 2. The experiment also tests the use of the query expansion
method in the Apache Lucene text search engine. The Apache Lucene allows
boosting a search concept increasing the relevance of documents indexed by the
concept.</p>
        <p>
          The document collection is composed of a sample of 128 documents selected
from a base of 17,780 documents from the agrometeorology domain. This sample
considers documents containing just one of the concepts from the ontologies as
well as a combination of concepts from both ontologies. The queries set contains
35 queries considering just one concept from each ontology or two concepts from
both ontologies connected with AND or OR boolean operators. For each query
the relevant documents from the sample document collection are selected by a
domain expert. Several experiments were ran considering many combinations
of the weights wet, t ∈ {S, G} and wr, r ∈ {S, G, P}. After many tests all the
models showed a behavior tendency concerning the precision and recall measures.
Recall is the fraction of the relevant documents which are retrieved over all
relevant documents in collection related to query q and precision is the fraction
of the retrieved documents which are relevant to a query q over all documents
in the answer set [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
        <p>As the addition of general concepts tends to add more noise in the search
results then a lower weight value is assigned to fuzzy generalization association
like wG = 0.3. A higher value is assigned to fuzzy specialization association like
wS = 0.7. Following the same reasoning for transitive closure calculation, the
tests showed that best results are achieved with lower values assigned to weight
weG and higher ones to weight weS like weG = 0.2 and weS = 0.8. The fuzzy
positive association is tested with four different weights like wP = 0.0, wP = 0.1,
wP = 0.5, and wP = 1.0.</p>
        <p>Figure 3 presents the performance results for the models showing precision
x recall curves. In these curves the best results occur when the precision values
maintains high as the recall values increases. This indicates that most of relevant
documents are retrieved and are presented in the top of the answer set. In the
performed experiment only the best result is recorded for each model to keep the
graphic understandable. The proposed model is represented by MO curves, the
multi-relationship fuzzy concept network model by the CN curve and the Apache
Lucene by LUC curves. In the curves legend, the KW means the use of just the
entered keywords (without performing query expansion) and the numbers refer to
the corresponding wP value when query expansion is considered. All the models
showed a behavior tendency considering the wP variations.</p>
        <p>As the proposed model and the Apache Lucene use the same query
expansion method their performances have the same tendency. When considering just
the user entered keywords the precision for lower recall values is high but it
decreases fast as the recall values increase. When query expansion is performed the
precision is high for low recall values and maintains around 45% for high recall
values. The fuzzy concept network model presents high precision values for low
recall values but as the recall values are higher it maintains the precision values
around 25%. Comparing the three models results, the proposed model exhibits
better performance (MO 0.1 curve) when knowledge is considered in the query
expansion process.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>This work presents an approach for improving document retrieval process
considering a knowledge base composed of multiple related domain ontologies. Contrary
to other approaches that consider the knowledge base composed of just one
ontology, the proposed model explores knowledge expressed in multiple ontologies
that, in some contexts, can be related to each other by causal, spatial or
similarity relationships. To deal with the uncertainty and vagueness present in the
knowledge the fuzzy set theory is used to express the relations between concepts
of distinct ontologies. This knowledge is used in a novel method to expand the
user query and to index the documents in the collection. Experimental results
show that the proposed model achieves better performance when compared with
other fuzzy information retrieval approach. When using the expansion method
with the Apache Lucene search engine the results are also improved.</p>
      <p>
        The knowledge organization and representation as ontologies is a growing
area. Many independent developed crisp ontologies, representing distinct
domains, are being proposed. The presented model offers an way where these
ontologies can be reused. Instead of developing one large ontology that encodes
multidisciplinary knowledge an alternative is to encode this knowledge as distinct
domain ontologies relating them in a next step. This allows distinct knowledge
groups to work in a independent way or to reuse already existent domain
ontologies. If the ontologies represent domains that can be related in some context
then just the positive fuzzy associations between the ontologies concepts have to
be constructed in order to reuse them in the model. An experiment considering
the ontologies themselves as fuzzy structures is presented in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Baeza-Yates</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ribeiro-Neto</surname>
            ,
            <given-names>B.A.</given-names>
          </string-name>
          :
          <article-title>Modern Information Retrieval</article-title>
          . ACM Press / Addison-Wesley (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ogawa</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morita</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kobayashi</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>A fuzzy document retrieval system using the keyword connection matrix and a learning method</article-title>
          .
          <source>Fuzzy Sets and Systems</source>
          <volume>39</volume>
          (
          <issue>2</issue>
          ) (
          <year>1991</year>
          )
          <fpage>163</fpage>
          -
          <lpage>179</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Widyantoro</surname>
            ,
            <given-names>D.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yen</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A fuzzy ontology-based abstract search engine and its user studies</article-title>
          .
          <source>In: Proceedings of the IEEE International Conference on Fuzzy Systems</source>
          , Washington, DC, USA, IEEE Computer Society (
          <year>2001</year>
          )
          <fpage>1291</fpage>
          -
          <lpage>1294</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bhogal</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Macfarlane</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>A review of ontology based query expansion</article-title>
          .
          <source>Information Processing and Management</source>
          <volume>43</volume>
          (
          <issue>4</issue>
          ) (
          <year>2007</year>
          )
          <fpage>866</fpage>
          -
          <lpage>886</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Gomez-P´erez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <article-title>Fern´andez-</article-title>
          <string-name>
            <surname>Lopez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corcho</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          : Ontological Engineering. Springer-Verlag (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horng</surname>
            ,
            <given-names>Y.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>C.H.</given-names>
          </string-name>
          :
          <article-title>Fuzzy information retrieval based on multirelationship fuzzy concept networks</article-title>
          .
          <source>Fuzzy Sets and Systems</source>
          <volume>140</volume>
          (
          <issue>1</issue>
          ) (
          <year>2003</year>
          )
          <fpage>183</fpage>
          -
          <lpage>205</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Horng</surname>
            ,
            <given-names>Y.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>C.H.</given-names>
          </string-name>
          :
          <article-title>Automatically constructing multi-relationship fuzzy concept networks for document retrieval</article-title>
          .
          <source>Applied Artificial Intelligence</source>
          ,
          <volume>17</volume>
          (
          <issue>1</issue>
          ) (
          <year>2003</year>
          )
          <fpage>303</fpage>
          -
          <lpage>328</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>8. Apache: Lucene overview http://lucene.apache.org/java/docs/index.html.</mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Akrivas</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wallace</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andreou</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stamou</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kollias</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Context-sensitive semantic query expansion</article-title>
          .
          <source>In: Proceedings of the 2002 IEEE International Conference on Artificial Intelligence Systems (ICAIS'02)</source>
          , Washington, DC, USA, IEEE Computer Society (
          <year>2002</year>
          )
          <fpage>109</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Lau</surname>
            ,
            <given-names>R.Y.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Mining fuzzy domain ontology from textual databases</article-title>
          .
          <source>In: Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence</source>
          , Washington, DC, USA, IEEE Computer Society (
          <year>2007</year>
          )
          <fpage>156</fpage>
          -
          <lpage>162</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Martins</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>M.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Freitas</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Afonso</surname>
            ,
            <given-names>A.P.</given-names>
          </string-name>
          :
          <article-title>Handling locations in search engine queries</article-title>
          .
          <source>In: GIR</source>
          . (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Pereira</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ricarte</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gomide</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Fuzzy relational ontological model in information search systems</article-title>
          . In: Elie Sanchez. (Org.).
          <source>Fuzzy Logic and The Semantic Web</source>
          , Amsterdan,
          <string-name>
            <surname>Elsevier B. V.</surname>
          </string-name>
          (
          <year>2006</year>
          )
          <fpage>395</fpage>
          -
          <lpage>412</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Pedrycz</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gomide</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>An introduction to fuzzy sets : Analysis and Design</article-title>
          . MIT Press, Cambridge, Massachusetts (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Euzenat</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>An api for ontology alignment</article-title>
          . In: International Semantic Web Conference. (
          <year>2004</year>
          )
          <fpage>698</fpage>
          -
          <lpage>712</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Leite</surname>
            ,
            <given-names>M.A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ricarte</surname>
            ,
            <given-names>I.L.M.:</given-names>
          </string-name>
          <article-title>A framework for information retrieval based on fuzzy relations and multiple ontologies</article-title>
          .
          <source>In: Iberamia 2008: Proceedings of the 11th Ibero-American Conference on Artificial Intelligence</source>
          , Berlin - Heidelberg, SpringerVerlag (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>