<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Imaging Words - Wording Images</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Adrian Popescu</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gregory Grefenstette</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cristophe Millet</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pierre-Alain Moëllic</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Patrick Hède</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>-The rapid growth of the Internet information sources has led to organizing proposals, such as the Semantic Web initiative, with its ontological level providing a formal structuring for this disparate data. But given the amount of information to be treated even in a restricted domain, manual organization becomes rapidly unmanageable, and automatic methodologies for ontology building are required. Here we describe techniques for the automatic construction of a image ontology based on multimedia data (text and images) for a specific class of objects, manmade tools. Our approach combines modification of existing lexical resources and search engine querying in order to obtain raw images. These images are then clustered into representative concepts for the ontology. Our automated approach can be applied to any subset of physical objects.</p>
      </abstract>
      <kwd-group>
        <kwd>Image</kwd>
        <kwd>Ontology</kwd>
        <kwd>OWL</kwd>
        <kwd>Semantic WordNet</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>I. INTRODUCTION</title>
      <p>
        Aconstruction of large scale ontologies is a costly effort
s proven by initiatives like CYC [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], the manual
and it is unrealistic to think that this approach can solve
current needs for knowledge organization. This is especially
true for highly dynamic resources like the WWW, where the
increase in knowledge resources follows an exponential curve.
The Semantic Web, with its description of content in
ontologies has been presented as a potential solution to the
information structuring problems. But, as underlined in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], a
vicious circle is created as the Semantic Web is dependent on
the existence of metadata and these last rely themselves on the
existence of a well populated Semantic Web. A way to cope
with this problem is the development of automatic or
semiautomatic methodologies for the ontology construction.
Interesting results for automatic lexical ontology building are
reported in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>In this paper, we describe our technique for automatically
filling multimedia ontologies, grounding each concept in text
and images. After a transposition of parts of WordNet [Miller]
into OWL (Ontology Web Language) in order to create a
taxonomical base, we have lexical information associated to
concepts. For the image part of the grounding, we query the
Web to gather pictures corresponding to objects in the
taxonomy that are then clustered and filtered.</p>
      <p>We structured the rest of this paper as follows: we discuss a
translation of WordNet to OWL, we describe our image
gathering and clustering tool and, before concluding, some
All authors are with Commisariat à l’Energie Atomique – LIST, France.
preliminary results of our method for image ontology
construction.</p>
      <p>
        Our current work deals with the automatic construction of a
grounded ontology. In order to automatically build such a
formal structure, we need an associated taxonomy. There are
two main possibilities that are offered to us: learning
taxonomy from concepts found on the Web [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] or using one
from an existing resource. We have chosen the second variant
and used WordNet [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] as source for our taxonomy. Thus, we
preserve the automatic character of our methodology and are
able to exploit the richness of a resource that was manually
constructed by lexicographers. We are aware of the criticisms
raised by the transformation of WordNet into a formal
ontology [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], but with the implementation choices we have
made, we try to minimize their effects. There is notably the
fact that our method only addresses picturable objects, which
are ontologically less controversial than high level concepts.
The approach we propose is domain independent. It depends
uniquely of the knowledge contained in the resource we
parsed. For exemplification purposes only, the examples
furnished here are subconcepts of tool in WordNet.
      </p>
      <p>The envisioned application, construction of a structured
image catalogue, determined us to parse only parts of the
information contained in WordNet to OWL. We transformed
the sets of synonyms (synsets) in OWL classes, preserving the
sense separation. Thus, knife from the lexical hierarchy
becomes knife__1 in the ontology, while garden tool, lawn
tool is transformed to garden_tool__1. Lawn tool is saved as
an RDFS comment as another member of the garden_tool__1
class. We equally parsed the terms definitions in the ontology.</p>
      <p>
        Image clusters are associated exclusively to leave concepts
in the OWL ontology. The rationale for this decision is that,
with the use of hyponymy relation, we can propose image sets
for all concepts in the ontology. Moreover, the leave terms
generally are specialized concepts that point towards precise
entities [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], which are less ambiguous both in language and
the associated picture representation.
      </p>
    </sec>
    <sec id="sec-2">
      <title>III. IMAGE CLUSTERING MODULE</title>
      <p>We propose a second structuring axis in our image
catalogue. The use of an ontology allows inter-class
organization, while an image clustering tool provides means
for intra-class structure. A clustering process was run for each
leaf concept in the ontology. This process consists of two
steps: image indexing and clustering following visual
similarity.</p>
      <sec id="sec-2-1">
        <title>A. Image indexing</title>
        <p>
          We deal with pictures from broad domains and we need a
general image indexing technique. Using an approach based
on border/interior pixel classification [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. We construct two
histograms for each image, one for pixels on the image
borders and pixels in interior regions. This indexing algorithm
is fast, simple and provides information about colors in the
image and, equally important, about sizes of image regions
having a constant color (possibly objects). It leads to the
construction of a vector containing 128 elements for each
picture. We use the Riemann distance as similarity measure
between two images. Distances are calculated between all
pairs of images.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>B. Image clustering</title>
        <p>
          The indexed images are clustered using a k-SNN (Shared
Nearest Neighbors) algorithm [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. For each image, a
neighborhood of k images is considered in the algorithm. The
similarity of two images is assessed with respect to the degree
of overlapping of their neighborhoods. Next, pictures that are
most similar to their neighbors are considered as topic images
and clusters are structured around them. A useful feature of
the algorithm is that it does not impose the classification of all
indexed images. Pictures considered weakly related to topics
remain unclustered. This last feature is important in our
application as we work in a noisy environment (there are a lot
of images on the Web that are not annotated in direct relation
to their visual content). We thus hope to isolate images that
are irrelevant for the desired object and build highly coherent
clusters of images containing it. Given that the classification is
entirely automatic, there is noise that subsists in the clusters,
but the obtained results seem more coherent than the set of
images initially retrieved, though we have not yet performed
extensive evaluation.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>IV. PRELIMINARY RESULTS</title>
      <p>
        We already stated that our purpose here is to build a
structured image catalogue using images from the Web.
Instead of querying for images for all concepts in the
ontology, we perform this operation for leaves only and, via
hyponymy, propose picture sets for all other concepts in the
hierarchy. This results in an structured presentation of results,
while taking advantage of the fact that the image sets
associated to leaves are less noisy (they correspond to well
defined entities in the world[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]). An example of the obtained
results is presented for knife in two situations. We use Google
Image for the pictures in fig. 1 and our method (ontology for
inter-class structure and clustering for intra-class
organization).
      </p>
      <p>We observe that the images in fig. 2 illustrate better the
notion of knife and are ontologically and visually organized,
which is not the case for figure 1. Extensive evaluations are
needed in order to assess if the proposed method performs
better than existing ones in image retrieval tasks.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P.</given-names>
            <surname>Cimiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hotho</surname>
          </string-name>
          ,and
          <string-name>
            <given-names>S.</given-names>
            <surname>Staab</surname>
          </string-name>
          , “
          <article-title>Comparing Conceptual, Divisive and Agglomerative Clustering for Learning Taxonomies from Text”</article-title>
          ,
          <source>in Proc. of ECAI</source>
          <year>2004</year>
          , Valencia, Spain,
          <year>2004</year>
          , pp.
          <fpage>435</fpage>
          -
          <lpage>439</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>[2] CYC, www</article-title>
          .cyc.com
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>L.</given-names>
            <surname>Ertoz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Steibach</surname>
          </string-name>
          , and
          <string-name>
            <given-names>V.</given-names>
            <surname>Kumar</surname>
          </string-name>
          , “
          <article-title>Finding Topics in Collections of Documents: A Shared Nearest Neighbor Approach”</article-title>
          , In: Wu,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Sheklar</surname>
          </string-name>
          , S.(eds.):Clustering and
          <string-name>
            <given-names>Information</given-names>
            <surname>Retrieval</surname>
          </string-name>
          , Kluwer,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gangemi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Navigli</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Velardi</surname>
          </string-name>
          , “
          <article-title>The OntoWordNet Project: Extension and Axiomatisation of Conceptual Relations in WordNet”</article-title>
          ,
          <source>in Proc of. CoopIS/DOA/ODBASE</source>
          <year>2003</year>
          , Catania, Sicily, Italy,
          <year>2003</year>
          , pp.
          <fpage>689</fpage>
          -
          <lpage>706</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>G. A.</given-names>
            <surname>Miller</surname>
          </string-name>
          , “
          <article-title>Nouns in WordNet: a Lexical Inheritance System</article-title>
          .
          <source>International Journal of Lexicography”</source>
          ,
          <volume>3</volume>
          ,
          <issue>4</issue>
          ,
          <year>1990</year>
          , pp.
          <fpage>245</fpage>
          -
          <lpage>264</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Rosch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Mervis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. B.</given-names>
            <surname>Gray</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Johnson</surname>
          </string-name>
          , P. Boyes - Braem, “Basic Objects in Natural Categories”,
          <source>Cognitive Psychology</source>
          ,
          <volume>8</volume>
          ,
          <year>1976</year>
          , pp.
          <fpage>382</fpage>
          -
          <lpage>439</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R. O.</given-names>
            <surname>Stehling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.A.</given-names>
            <surname>Nascimento</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.X.</given-names>
            <surname>Falcao</surname>
          </string-name>
          , “
          <article-title>A Compact and Efficient Image Retrieval Approach Based on Border/Interior Pixel Classification”</article-title>
          ,
          <source>inProc. of the Eleventh International Conference on Information and Knowledge Management. Mc Lean</source>
          , Virginia, USA. ACM Press,
          <year>2002</year>
          , pp.
          <fpage>102</fpage>
          -
          <lpage>109</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>