<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Storing and Querying Fuzzy Knowledge in the Semantic Web</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nick Simou</string-name>
          <email>nsimou@image.ntua.gr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giorgos Stoilos</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vassilis Tzouvaras</string-name>
          <email>tzouvaras@image.ntua.gr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giorgos Stamou</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefanos Kollias</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Electrical and Computer Engineering, National Technical University of Athens</institution>
          ,
          <addr-line>Zographou 15780</addr-line>
          ,
          <country country="GR">Greece</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The great evolution of ontologies during the last decade, bred the need for storage and querying for the Semantic Web. For that purpose, many RDF tools capable of storing a knowledge base, and also performing queries on it, were constructed. Recently, fuzzy extensions to description logics have gained considerable attention especially for the purposes of handling vague information in many applications. In this paper we investigate on the issue of using classical RDF storing systems in order to provide persistent storing and querying over large-scale fuzzy information. To accomplish this we first propose a novel way for serializing fuzzy information into RDF triples thus classical storing systems can be used without any extensions. Additionally, we extend the existing query languages of RDF stores in order to support expressive fuzzy queries proposed in the literature. These extensions are implemented through the FiRE fuzzy reasoning engine, which is a fuzzy DL reasoner for fuzzy-SHIN . Finally, the proposed architecture is evaluated using an industrial application scenario about casting for TV commercials and spots.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Ontologies, through the OWL language [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], are expected to play a significant
role in the Semantic Web. OWL is mainly based on Description Logics (DLs)
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], a popular family of knowledge representation languages. However, despite
their rich expressiveness, they are insufficient to deal with vague and uncertain
information which is commonly found in many real-world applications such as
multimedia content, medical informatics etc. For that purpose a variety of DLs
capable of handling imprecise information in many flavors, like probabilistic [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]
and fuzzy [
        <xref ref-type="bibr" rid="ref14 ref15">15, 14</xref>
        ] have been proposed.
      </p>
      <p>Fuzzy ontologies are envisioned to be very useful in the Semantic Web.
Similar to crisp ontologies, they can serve as basic semantic infrastructure, providing
shared understanding of certain domains across different applications.
Furthermore, the need for handling fuzzy and uncertain information is crucial to the
Web. This is because information and data along the Web may often be uncertain
or imperfect.</p>
      <p>
        Therefore sophisticated uncertainty representation and reasoning are
necessary for the alignment and integration of Web data from different sources. This
requirement for uncertainty representation has led W3C to set up the
Uncertainty Reasoning for the World Wide Web XG1. Recently, fuzzy DL reasoners
such as fuzzyDL2 and FiRE3 that can handle imprecise information have been
implemented. Despite these implementations of expressive fuzzy DLs there is still
no other work on persistent storage and querying, besides the work of Straccia
[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and Pan [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], which are based on fuzzy DL-lite and can be considered as
closely related to databases, but on the other hand they don’t use RDF triple
store technologies.
      </p>
      <p>The main contributions of this paper are the following:
1. It presents a novel framework for persistent storage and querying of
expressive fuzzy knowledge bases,
2. It presents the first ever integration of fuzzy DL reasoners with RDF triple
stores, and
3. It provides experimental evaluation of the proposed architecture using a
real-world industrial strength use-case scenario.</p>
      <p>
        The rest of the paper is organized as follows. Firstly, in section 2 a short
theoretical description of the fuzzy DL f-SHIN [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] is made. In section 3 the
proposed triples syntax accommodating the fuzzy element used for storing a
fuzzy knowledge base in RDF-Stores, is presented. Additionally, the syntax and
the semantics of expressive queries that have been proposed in the literature [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
to exploit fuzziness are briefly presented. In the following section (4) the fuzzy
reasoning engine FiRE which is based on the fuzzy DL f-SHIN and the way
in which it was integrated with the RDF-Store Sesame are presented. In the
last section (5) the applicability of the proposed architecture is demonstrated,
presenting a use case based on a database of human models. This database was
used by a production company for the purposes of casting for TV commercials
and spots. Some entries of the database were first fuzzified and then using an
expressive knowledge base, abundant implicit knowledge was extracted. The
extracted knowledge was stored to a Sesame repository, and various expressive
queries were performed in order to benchmark the proposed architecture.
2
2.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Preliminaries</title>
      <p>
        The Fuzzy DL fKD-SHIN
In this section we briefly present the notation of DL f-SHIN which is a fuzzy
extension of DL SHIN [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Similar to crisp description logic languages, a fuzzy
description logic language consist of an alphabet of distinct concepts names (C),
role names (R) and individual names (I), together with a set of constructors to
1 http://www.w3.org/2005/Incubator/urw3/
2 http://gaia.isti.cnr.it/~straccia/software/fuzzyDL/fuzzyDL.html
3 http://www.image.ece.ntua.gr/~nsimou/FiRE/
construct concept and role descriptions. If R is a role then R− is also a role,
namely the inverse of R. f-SHIN -concepts are inductively defined as follows,
1. If C ∈ C, then C is a f-SHIN -concept,
2. If C and D are concepts, R is a role, S is a simple role and n ∈ N, then
(¬C), (C " D), (C # D), (∀R.C), (∃R.C), (≥ nS) and (≤ nS) are also
f-SHIN -concepts.
      </p>
      <p>
        In contrast to crisp DLs, the semantics of fuzzy DLs are provided by a fuzzy
interpretation [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. A fuzzy interpretation is a pair I = (ΔI , ·I ) where ΔI is a
non-empty set of objects and ·I is a fuzzy interpretation function, which maps
an individual name a to elements of aI ∈ ΔI and a concept name A (role name
R) to a membership function AI : ΔI → [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ] (RI : ΔI × ΔI → [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ]).
      </p>
      <p>
        By using fuzzy set theoretic operations the fuzzy interpretation function can
be extended to give semantics to complex concepts, roles and axioms [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. FiRE
uses the standard fuzzy operators of 1 − x for fuzzy negation and max, min for
fuzzy union and intersection, respectively.
      </p>
      <p>A f-SHIN knowledge base Σ is a triple (T , R, A), where T is a fuzzy T Box,
R is a fuzzy RBox and A is a fuzzy ABox. T Box is a finite set of fuzzy concept
axioms which are of the form C - D called fuzzy concept inclusion axioms and
C ≡ D called fuzzy concept equivalence axioms, where C, D are concepts, saying
that C is a sub-concept or C is equivalent of D, respectively. Similarly, RBox is
a finite set of fuzzy role axioms of the form Trans(R) called fuzzy transitive role
axioms and R - S called fuzzy role inclusion axioms saying that R is transitive
and R is a sub-role of S respectively. Finally, ABox is a finite set of fuzzy
assertions of the form (a : C#$n), ((a, b) : R#$n), where #$ stands for ≥, &gt;, ≤ or
&lt;, or a / =. b, for a, b ∈ I. Intuitively, a fuzzy assertion of the form (a : C ≥ n)
means that the membership degree of a to the concept C is at least equal to n.
3
3.1</p>
    </sec>
    <sec id="sec-3">
      <title>Storing and Querying a Fuzzy Knowledge Base</title>
      <sec id="sec-3-1">
        <title>Fuzzy OWL Syntax in triples</title>
        <p>
          In order to use the existing RDF storing systems to store fuzzy knowledge
without enforcing any extensions we have to provide a way of serializing fuzzy
knowledge into RDF triples. Some work has already been done in this issue. In [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] the
authors use RDF reification, in order to store membership degrees, while the
authors in [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] use datatypes. Our goal is to neither use reification nor datatypes.
On the one hand, it is well-known that reification has weak, ill-defined model
theoretic semantics and its support by RDF tools is doubtful while on the other
hand, we do not want to use a concrete feature like datatypes to represent
abstract information such as fuzzy assertions. For those reasons we propose a more
clarified way based on the use of blank nodes. First, we define three new
entities, namely frdf:membership, frdf:degree and frdf:ineqType as types (i.e.
rdf:type) of rdf:Property.
        </p>
        <p>Our syntax becomes obvious in the following example. Suppose that we want
to represent the assertion ((paul : T all) ≥ n). The RDF triples representing this
information are the following:
paul
:paulmembTall
:paulmembTall
:paulmembTall
frdf:membership
rdf:type
frdf:degree
frdf:ineqType
:paulmembTall .</p>
        <p>Tall .
“n^^xsd:float” .
“=” .
where :paulmembPaul is a blank node used to represent the fuzzy assertion of
paul with the concept Tall.</p>
        <p>On the other hand, mapping fuzzy role assertions is more tricky since RDF
does not allow the use of blank nodes in the predicate position. Thus, we have
to use new properties for each assertion. Thus, the assertion ((paul, f rank) :
F riendOf ≥ n) is mapped to
paul
frdf:paulFriendOffrank
frdf:paulFriendOffrank
frdf:paulFriendOffrank
frdf:paulFriendOffrank
rdf:type
frdf:degree
frdf:ineqType
frank .</p>
        <p>FriendOf .
“n^^xsd:float” .
“=” .
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Extensions to Query Languages</title>
        <p>
          One of the main advantages of persistent storage systems, like relational databases
and RDF storing systems, is their ability to support conjunctive queries.
Conjunctive queries generalize the classical inference problem of realization of
Description Logics [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], i.e. get me all individuals of a given concept C, by allowing
for the combination (conjunction) of concepts and roles. Formally, a conjunctive
query is of the following form:
        </p>
        <p>q(X) ← ∃Y .conj(X, Y )
or simply q(X) ← conj(X, Y ), where q(X) is called the head, conj(X, Y ) is
called the body, X are called the distinguished variables, Y are existentially
quantified variables called the non-distinguished variables, and conj(X, Y ) is a
conjunction of atoms of the form A(v), R(v1, v2), where A, R are respectively
concept and role names, v, v1 and v2 are individual variables in X and Y or
individuals from the ontology.</p>
        <p>
          Since in our case we extend classical assertions to fuzzy assertions, new
methods of querying such fuzzy information are possible. More precisely, in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] the
authors extend ordinary conjunctive queries to a family of significantly more
expressive query languages, which are borrowed from the fields of fuzzy
information retrieval [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. These languages exploit the membership degrees of fuzzy
assertions by introducing weights or thresholds in query atoms. In particular,
the authors first define conjunctive threshold queries(CTQs) as:
        </p>
        <p>
          n
q(X) ← ∃Y. !(atomi(X, Y ) ≥ ki),
i=1
(1)
(2)
where ki ∈ [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ], 1 ≤ i ≤ n, atomi(X, Y ) represents either a fuzzy-DL concept or
role and all ki ∈ (0, 1] are thresholds. Intuitively, an evaluation [X 1→ S] (where
S is a set of individuals) is a solution if atomiI (X, Y )[X"→S,Y "→S!] ≥ ki for some
S and for 1 ≤ i ≤ n. As it is obvious answers of CTQs is a matter of true or
false, in other words an evaluation either is or is not a solution to a query. The
authors also propose General Fuzzy Conjunctive Queries (GFCQs) that further
exploit fuzziness and support degrees in query results. The syntax of a GFCQ
is the following:
        </p>
        <p>
          n
q(X) ← ∃Y. ! (atomi(X, Y ) : ki),
i=1
(3)
where atomi(X, Y ) and ki are as above. As it is shown in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], this syntax is
general enough to allow various choices of semantics, which emerge by interpreting
differently the association of the degree of each fuzzy-DL atom (atomi(X, Y ))
with the degree associated weight (ki). For example if this association is
interpreted by a fuzzy implication (J ) [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] then we obtain fuzzy threshold queries:
d =
        </p>
        <p>sup
S!∈ΔI×...×ΔI</p>
        <p>{tin=1 J (ki, atomiI (v¯)[X"→S,Y "→S!])}.</p>
        <p>
          Similarly we can use fuzzy aggregation functions [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] or fuzzy weighted t-norms
[
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Variations of semantics of GFCQs can be effectively used to model
importance of query atoms, preferences, and many more. The interested reader is
referred to [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] for more details on the semantics of GFCQs.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Implementation with FiRE and Sesame</title>
      <p>
        FiRE is a JAVA implementation of a fuzzy reasoning engine for imperfect
knowledge currently supporting f-SHIN . It can be found at http://www.image.ece.
ntua.gr/~nsimou/FiRE/ together with installation instructions and examples.
Its syntax is based the Knowledge Representation System Specification [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
proposal which has been extended to fit uncertain knowledge. In this section the
inference services of FiRE are presented and the way in which it was integrated
with RDF Store Sesame 4 in order to support CTQs and GFCQs is
demonstrated.
4.1
      </p>
      <sec id="sec-4-1">
        <title>Inference services</title>
        <p>Crisp DL reasoners offer reasoning services such as deciding satisfiability,
subsumption and entailment of concepts and axioms w.r.t. an ontology. In other
words, these tools are capable of answering queries like “Can the concept C have
any instances in models of the ontology T ?” (satisfiability of C), “Is the concept
D more general than the concept C in models of the ontology T ?” (subsumption
C - D) or does axiom Ψ logically follows from the ontology (entailment of Ψ ).
4 http://www.openrdf.org/</p>
        <p>
          These reasoning services are also available by f-SHIN together with greatest
lower bound queries which are specific to fuzzy assertions. FiRE uses the tableau
algorithm of f-SHIN , presented by Stoilos et al [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], in order to decide the key
inference problems of a fuzzy ontology. In the case of fuzzy DL, satisfiability
queries are of the form “Can the concept C have any instances with degree of
participation #$n in models of the ontology T ?”. Furthermore, it is in our interest
to compute the best lower and upper truth-value bounds of a fuzzy assertion.
The term greatest lower bound of a fuzzy assertion w.r.t. Σ was defined in [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
Roughly speaking, greatest lower bound are queries like “What is the greatest
degree n that our ontology entails an individual a to participate in a concept
C?”. Entailment queries ask whether our knowledge base logically entails the
membership of an individual to a specific concept to a certain degree.
        </p>
        <p>
          Finally, FiRE allows the user to make greatest lower bound queries (GLB).
GLB queries are evaluated by FiRE performing entailment queries of the
individual participating in concept of interest for all the degrees contained in the
ABox, using the binary search algorithm in order to reduce the degrees search
space [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. Furthermore a user can perform global GLB for a fuzzy knowledge
base. Global GLB service of FiRE, creates a file containing the greatest lower
bound degree of all the concepts of Σ participating in all the individuals of Σ.
4.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Sesame Integration with FiRE</title>
        <p>FiRE was enhanced by the functionalities of the RDF-Store Sesame (Sesame 2
beta 6). The RDF Store is used as a back-end for storing and querying RDF
triples in a sufficient and convenient way. In this architecture the reasoner is the
front-end which the user can use in order to store and query a fuzzy knowledge
base. Additionally, a user is able to access data from a repository, apply any of the
available reasoning services on this data and then store the implicit knowledge
extracted from them back in the repository.</p>
        <p>
          Another important benefit from this integration is the use of the query
language SPARQL [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] in the implementation of the fuzzy queries of section 3.
These queries are performed using the Queries inference tab of FiRE, and in the
case of generalized fuzzy conjunctive queries, users can choose among semantics,
such as fuzzy threshold queries, fuzzy aggregation and fuzzy weighted queries.
Example 1. A threshold query that reveals their syntax in FiRE follows:
x,y &lt;- Tall(x) &gt;= 0.8 ^ has-friend(x,y) &gt;= 0.4 ^ Short(y) &gt;= 0.7
Queries consist of two parts: the first one specifies the individuals that will
be evaluated while the second one states the condition that has to be fulfilled
for the individuals. This query asks for individuals x and y, x has to participate
in concept Tall to at least the given degree, it also has to be the subject of a
has-friend assertion with participation greater than 0.4, having as a role-filler
individual y which has to participate in concept Short to at least the given
degrees.
Example 2. We can issue a GFCQ by using the symbol “:” followed by the
importance of participation for each condition statement instead of inequality
types. Hence we can get all female models and rank those who have long hair
higher than those who are good-looking:
x &lt;- Female(x) : 1 ^ Goodlooking(x) : 0.6
        </p>
        <p>^ has-hairLength(x,y) : 1 ^ Long(y) : 0.8</p>
        <p>In case of CTQs, a query is firstly converted from the FiRE conjunctive query
syntax to SPARQL query language. Based on the fuzzy OWL syntax in triples
that we have defined in section 3.1 the query of Example 1 is as follows in
SPARQL. The query results are evaluated by Sesame engine and visualized by
FiRE.</p>
        <p>SELECT ?x WHERE {
?x ns5:membership ?Node1 .
?Node1 rdf:type ?Concept1 .
?Node1 ns5:ineqType ?IneqType1 .
?Node1 ns5:degree ?Degree1 .</p>
        <p>FILTER regex (?Concept1 , "CONCEPTS#Tall")
FILTER regex (?IneqType1 ,"&gt;")
FILTER (?Degree1 &gt;= "0.8^^xsd:float")
?BlankRole2 ns5:ineqType ?IneqType2 .
?BlankRole2 ns5:degree ?Degree2 .
?BlankRole2 rdf:type ?Role2 .
?x BlankRole2 ?y .</p>
        <p>FILTER regex (?Role2 , "ROLES#has-friend")
FILTER regex (?IneqType1 ,"&gt;")
FILTER (?Degree2 &gt;= "1.0^^xsd:float")
...</p>
        <p>}</p>
        <p>In case of general fuzzy conjuctive queries the operation is different. The
SPARQL query is constructed in a way that retrieves the participation degrees
of every Role or Concept used in the atoms criterions, for the results that satisfy
all of the atoms. The participation degrees retrieved for each query atom are
then used together with the degree associated weight by FiRE for the ranking
procedure of the results according to the selected semantics.</p>
        <p>
          It is worth mentioning that the proposed architecture obviously does not
provide a complete query answering system for f-SHIN since queries are issued
against the stored assertions of the RDF repository. Hence queries that include,
for example, transitive or inverse roles are not correctly evaluated. On the one
hand, query answering for fuzzy-DLs is still an open problem even for
inexpressive fuzzy-DLs while on the other hand, even for classical DLs it is known
that the algorithms are highly complex [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] and no practically scalable system
is known. However, in order to limit the effects of incompleteness, the f-SHIN
expressibility is used by FiRE for the extraction of implicit knowledge that is
stored in the repository performing GLB tests.
5
5.1
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Evaluation</title>
      <sec id="sec-5-1">
        <title>Models use case</title>
        <p>In this section we present the use case of human models utilized for the evaluation
of our proposal. The data were taken from a production company database
containing 2140 human models. The database contained information on each
model regarding their height, age, body type, fitness type, tooth condition,
eyecondition and color, hair quality, style, color and length, ending with the hands’
condition. Apart from the above, there were some additional, special-appearance
characteristics for certain models such as good-looking, sexy, smile, sporty, etc.
introduced by the casting producer. Finally for a minority of models, a
castingvideo was stored in the database. The main objective of the production company
was to pick a model, based on the above features, who would be suitable for a
certain commercial spot. Furthermore, depending on the spot type, inquiries
about models with some profession-like characteristics (like teacher, chef, mafia
etc.) were also of interest.</p>
        <p>Despite the fact that the database information on each model was rich
enough, there was great difficulty in querying models of appropriate
characteristics. The main reason for that was that the database information was not
semantically organized. The various tables of a database made the searching for
combined characteristics antiliturgical. Additionally, retrieval of models based
on threshold criteria for their age or height was most of the times inaccurate
since this kind of information is clearly fuzzy. Hence, selection was restricted
among models that had videotaped castings or those that had worked with the
producers in previous spots, thus not taking advantage of their database
information.</p>
        <p>In order to eliminate these limitations we have implemented a fuzzy
knowledge base using f-SHIN . For the generation of the fuzzy ABox the
characteristics given by numerical values being the height and age, were fuzzified defining
new concepts, while the remaining characteristics were used as crisp assertions.
Therefore, the fuzzification process of age was made by setting fuzzy partitions
depending on age by defining the concepts Kid, Teen, Baby, 20s, 30s, 40s, 50s,
60s and Old. Hence, as can be observed from the age fuzzification graph, a model
who is 29 years old participates in both concepts 20s and 30s with degrees 0.35
and 0.65 respectively. Similarly for the fuzzification process of height, the
concepts Very Short, Short, Normal Height, Tall and Very Tall were defined. In the
case of the height characteristic, the fuzzy partition used for female models was
different from the one used for males, since the average height of females is lower
than that of males. The fuzzification graphs of age and men’s height are shown
in Figure 1. An example of the produced assertions is shown in Example 3.
Example 3. An excerpt of ABox for the model michalis1539 is
(michalis1539 : 20s ≥ 0.66), (michalis1539 : 30s ≥ 0.33)
(michalis1539 : N ormal Height ≥ 0.5)
(michalis1539 : T all ≥ 0.5), (michalis1539 : GoodLooking ≥ 1)
((michalis1539, good) : has − toothCondition ≥ 1), (good : Good ≥ 1)
In order to permit knowledge-based retrieval of human models we have
implemented an expressive terminology for a fuzzy knowledge base. The alphabet of
concepts used for the fuzzy knowledge base consists of the features described
above while some characteristics like hair length, hair condition etc. were
represented by the use of roles.</p>
        <p>The effective extraction of implicit knowledge from the explicit one requires
an expressive terminology capable of defining higher concepts. In our case the
higher domain concepts defined for human models lie into five categories: age,
height, family, some special categories and the professions. Hence, the
profession Scientist has been defined as male, between their 50s or 60s, with classic
appearance who also wears glasses. In a similar way we have defined 33 domain
concepts; an excerpt of the T Box can be found in Table 1.</p>
        <p>At this point we must mention the fact that the proposed fuzzy knowledge
base does not fully utilize the expressivity of f-SHIN . This restriction is due to
the application domain (i.e transitive and inverse roles or number restrictions
are not applicable in this domain), but nevertheless it is more expressive that
an fuzzy DL-Lite ontology.
All the experiments were conducted under Windows XP on a Pentium 2.40 GHz
computer with 2. GB of RAM.</p>
        <p>The described fuzzy knowledge base was used in the evaluation of our
approach. Implicit knowledge was extracted using the greatest lower bound service
of FiRE, asking for the degree of participation of all individuals, in all the defined
domain concepts. The average number of assertions per individual was 13 while
the defined concepts were 33, that together with the 2140 individuals (i.e entries
of the database) resulted to 29460 explicit assertions and the extraction of 2430
implicit. These results, together with concept and role axioms, were stored to
a Sesame repository using the proposed fuzzy OWL triples syntax to form a
repository of 529.926 triples.</p>
        <p>The average time for the GLB reasoning process and the conversion of explicit
and implicit knowledge to fuzzy OWL syntax in triples was 1112 milliseconds.
The time required for uploading the knowledge to a Sesame repository depends
on the type of repository (Memory or Native) and also on repository’s size. Based
on our experiments, we have observed that the upload time is polynomial to the
size of the repository but without significant differences. Therefore, the average
minimum upload time to an almost empty repository (0-10.000 triples) is 213
milliseconds while the average maximum upload time to a full repository (over
500.000 triples) is 700 milliseconds.</p>
        <p>FiRE and Sesame were also examined in the use of expressive fuzzy queries.
The performance in this case mainly depended on the complexity of the query
but also on the type and size of the repository. Queries using role names in
combination with large repositories can dramatically slow down the response. Table
2 illustrates the response times in milliseconds using both types of repositories
and different repository sizes. Repository sizes was set by adjusting the number
of assertions. As it can be observed, very expressive queries seeking for young
female models with beautiful legs and eyes as well as long hair, a popular demand
in commercial spots, can be easily performed. It is worth mentioning that these
queries consist of higher domain concepts defined in our fuzzy knowledge base.</p>
        <p>Since our system is not a sound and complete query answering system for
f-SHIN , the GLB service performed before uploading the triples is employed in
order to use as much of the expressivity of the language as possible producing
new implied assertions.</p>
        <p>Furthermore, the results regarding query answering time are also very
encouraging, at least for the specific application. Although, compared to crisp querying,
over crisp knowledge bases, our method might require several more seconds to
be answered (mainly due to post processing steps for GFCQs or due to very
lengthy SPARQL queries for CTQs) this time is significantly less, compared to
T = {MiddleAged ≡ 40s " 50s,</p>
        <p>TallChild ≡ Child # (Short " Normal Height),</p>
        <p>Father ≡ Male # (30s " MiddleAged),</p>
        <p>Legs ≡ Female # (Normal Height " Tall)</p>
        <p>#(Normal " Perfect) # (Fit " PerfectFitness),
Teacher ≡ (30s " MiddleAged) # Elegant # Classic,
Scientist ≡ Male # Classic # (50s " 60s)</p>
        <p>#Serious # ∃has − eyeCondition.Glasses }
Table 1. An excerpt of the Knowledge Base (T Box).
the time spent by producers on casting (usually counted in days), since they
usually have to browse through a very large number of videos and images before
they decide.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions</title>
      <p>Due to the fact that imperfect information is spread along the web, the effective
management of imperfect knowledge is very important for the substantial
evolution of the Semantic Web. In this paper, we have proposed an architecture that
can be used for storing and querying fuzzy knowledge bases for the semantic web.
Our proposal which is based on DL f-SHIN , consists of the RDF triples
syntax accommodating the fuzzy element, the fuzzy reasoning engine FiRE and its
integration with RDF Store Sesame which permits very expressive fuzzy queries.</p>
      <p>The proposed architecture was evaluated using an industrial application
scenario about casting for TV commercials and spots. The obtained results are very
promising from the querying perspective. From the initial 29460 explicit
assertions made by database instances for models, 2430 new implicit assertions where
extracted and both uploaded in the Sesame repository. In this way expressive
semantic queries like “Find me young female models with beautiful legs and eyes
as well as long hair”, that might have proved very difficult or even impossible
using the producing company’s database, are applicable through FiRE. This
reveals both the strength of knowledge-based applications, and technologies for
managing fuzzy knowledge, since a wealth of the information of the databases,
like age, height, as well as many high level concepts of the specific application,
like “beautiful eyes”, “perfect fitness” and “scientist look” are inherently fuzzy.</p>
      <p>As far as future directions are concerned, we intend to further investigate on
different ways of performing queries using expressive fuzzy description logics.
Finally, it would be of great interest to attempt a comparison between the proposed
architecture and approaches using fuzzy DL-lite ontologies and approximation.</p>
      <p>Query
x ← Scientist(x)
x ← Father(x) ≥ 1 ∧ Teacher(x) ≥ 0.8
∧Normal Height(x) ≥ 0.5.</p>
      <p>x ← Scientist(x) : 0.8
x ← Father(x) : 0.6 ∧ Teacher(x) : 0.7
∧Normal Height(x) : 0.8.</p>
      <p>Native Memory
100.000 250.000 500.000 100.000 250.000 500.000
1042 2461 3335 894 2364 3332</p>
      <p>This work is supported by the FP6 Network of Excellence EU project X-Media
(FP6026978) and K-space (IST-2005-027026).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <article-title>Description-logic knowledge representation system specification from the KRSS group of the ARPA knowledge sharing effort</article-title>
          . http://dl.kr.org/krss-spec.ps.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nardi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.F.</given-names>
            <surname>Patel-Schneider</surname>
          </string-name>
          .
          <article-title>The Description Logic Handbook: Theory, implementation and applications</article-title>
          . Cambridge University Press,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>B.</given-names>
            <surname>Glimm</surname>
          </string-name>
          , I.Horrocks,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lutz</surname>
          </string-name>
          , and
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          .
          <article-title>Conjunctive query answering for SHIQ</article-title>
          .
          <source>Technical report</source>
          , University of Manchester,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>A.</given-names>
            <surname>Chortaras</surname>
          </string-name>
          , Giorgos Stamou, and
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Stafylopatis</surname>
          </string-name>
          .
          <article-title>Adaptation of weighted fuzzy programs</article-title>
          .
          <source>In Proc. of the International Conference on Artificial Neural Networks (ICANN</source>
          <year>2006</year>
          ), pages
          <fpage>45</fpage>
          -
          <lpage>54</lpage>
          . Springer,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>V.</given-names>
            <surname>Cross</surname>
          </string-name>
          .
          <article-title>Fuzzy information retrieval</article-title>
          .
          <source>Journal of Intelligent Information Systems</source>
          ,
          <volume>3</volume>
          :
          <fpage>29</fpage>
          -
          <lpage>56</lpage>
          ,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>G.</given-names>
            <surname>Stoilos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Stamou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Tzouvaras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.Z.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          .
          <article-title>Reasoning with very expressive fuzzy description logics</article-title>
          .
          <source>Journal of Artificial Intelligence Research</source>
          ,
          <volume>30</volume>
          (
          <issue>5</issue>
          ):
          <fpage>273</fpage>
          -
          <lpage>320</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Tobies</surname>
          </string-name>
          .
          <article-title>Reasoning with Individuals for the Description Logic SHIQ</article-title>
          . In David MacAllester, editor,
          <source>CADE-2000</source>
          ,
          <article-title>number 1831 in LNAI</article-title>
          , pages
          <fpage>482</fpage>
          -
          <lpage>496</lpage>
          . Springer-Verlag,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>G. J.</given-names>
            <surname>Klir</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Yuan</surname>
          </string-name>
          .
          <article-title>Fuzzy Sets and Fuzzy Logic: Theory and Applications</article-title>
          . Prentice-Hall,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>M.</given-names>
            <surname>Mazzieri</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.F.</given-names>
            <surname>Dragoni</surname>
          </string-name>
          .
          <article-title>A fuzzy semantics for semantic web languages</article-title>
          .
          <source>In ISWC-URSW</source>
          , pages
          <fpage>12</fpage>
          -
          <lpage>22</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>J.Z. Pan</surname>
          </string-name>
          , G. Stamou, G. Stoilos, and E. Thomas.
          <article-title>Expressive querying over fuzzy DL-Lite ontologies</article-title>
          .
          <source>In Proceedings of the International Workshop on Description Logics (DL</source>
          <year>2007</year>
          ),
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>P.F.</given-names>
            <surname>Patel-Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hayes</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I.Horrocks.</surname>
          </string-name>
          <article-title>Owl web ontology language semantics and abstract syntax</article-title>
          .
          <source>Technical report, World Wide Web Consortium</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. E.
          <string-name>
            <surname>Prud</surname>
          </string-name>
          <article-title>'hommeaux and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Seaborne</surname>
          </string-name>
          .
          <source>SPARQL query language for RDF</source>
          ,
          <year>2006</year>
          . W3C Working Draft, http://www.w3.org/TR/rdf-sparql-query/.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>R.</given-names>
            <surname>Giugno</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Lukasiewicz. P-SHOQ(D):</surname>
          </string-name>
          <article-title>A probabilistic extension of SHOQ(D) for probabilistic ontologies in the semantic web</article-title>
          .
          <source>In JELIA '02: Proceedings of the European Conference on Logics in Artificial Intelligence</source>
          , pages
          <fpage>86</fpage>
          -
          <lpage>97</lpage>
          , London, UK,
          <year>2002</year>
          . Springer-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. G. Stoilos,
          <string-name>
            <given-names>G.</given-names>
            <surname>Stamou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Tzouvaras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Z.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Horrocks. Fuzzy OWL</surname>
          </string-name>
          :
          <article-title>Uncertainty and the Semantic Web</article-title>
          .
          <source>In Proc. of the OWL-ED</source>
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>U.</given-names>
            <surname>Straccia</surname>
          </string-name>
          .
          <article-title>Reasoning within fuzzy description logics</article-title>
          .
          <source>Journal of Artificial Intelligence Research</source>
          ,
          <volume>14</volume>
          :
          <fpage>137</fpage>
          -
          <lpage>166</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16. U.Straccia and
          <string-name>
            <surname>G.Visco.</surname>
          </string-name>
          <article-title>DLMedia: an ontology mediated multimedia information retrieval system</article-title>
          .
          <source>In Proceeedings of the International Workshop on Description Logics (DL 07)</source>
          , volume
          <volume>250</volume>
          ,
          <string-name>
            <surname>Insbruck</surname>
          </string-name>
          , Austria,
          <year>2007</year>
          . CEUR.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17. V.
          <article-title>Vanekov´a</article-title>
          , J. Bella, P. Gursky´, and
          <string-name>
            <given-names>T.</given-names>
            <surname>Horv</surname>
          </string-name>
          <article-title>´ath. Fuzzy RDF in the semantic web: Deduction and induction</article-title>
          .
          <source>In Proceedings of Workshop on Data Analysis (WDA</source>
          <year>2005</year>
          ), pages
          <fpage>16</fpage>
          -
          <lpage>29</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>