<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <abstract>
        <p>In this paper we discuss a tool for annotation and search in a collection of art images. Multiple existing Thesaurus may be used to support this process, including the Art and Architecture Thesaurus, WordNet, ULAN and Iconclass. We discuss knowledge-engineering aspects such as the annotation methodology, and the use of taxonomic dynamic search. The annotation and search process are illustrated with an application scenario.</p>
      </abstract>
      <kwd-group>
        <kwd>Images recovery by content</kwd>
        <kwd>dynamic taxonomies</kwd>
        <kwd>CBIR</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Historically, curators have retrieved images by first manually annotating them with
keywords. Given a query, these annotations are used to retrieve appropriate pictures.
Underlying this approach is the belief that the words associated (manually) with a
picture essentially capture the semantics of the picture and any retrieval based on
these keywords will, therefore, retrieve relevant pictures. Since manual image
annotation is expensive, there has been great interest in producing automatic ways to
retrieve images based on content.</p>
      <p>Queries based on image concepts like color or texture, have been proposed for
retrieving images by content, but most users find it difficult to query using such visual
attributes. Most people would prefer to pose text queries and find images relevant to
those queries. For example, one should be able to pose a query like find me sad
people. This is difficult, if not impossible, with many of the current image retrieval
systems. This paper aims to address the problem of art image retrieval (AIR) and
introduce a methodology to intent to reduce the semantic gap in image annotation of
art images from a large art image collection.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Content based image retrieval (CBIR)</title>
      <p>The explosive growth of digital media and its usage on the web has opened both great
challenges and opportunities to computer science researchers. One such example is
the lively research area content-based image retrieval (CBIR), which aims to
effectively and efficiently locate a few relevant images in a large database according
to a user’s query concept.</p>
      <p>
        This paper will focus on another related, but different topic: Art image retrieval
(AIR), which is of great interests to us because of its application potentials and
technical challenges. In recent years, many museums, galleries and commercial
companies have put their art images on the web for online exhibitions or selling. The
WWW technology has created dramatic flexibilities in helping people freely share,
exchange, enjoy and purchase paintings. To facilitate the search of paintings, one of
the main standards for the cataloging of cultural objects is the one proposed by the
Getty Research Institute, namely, Categories for the Description of Works of Art
(CDWA) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Besides being a very complete and accepted model, from our very own
point of view, it does not comply with all of our possible descriptions of complex
objects that are required in the particular case of Industrial heritage objects and New
Media Art.
      </p>
      <p>
        A second model considered was the Cataloging Cultural Objects model (CCO). A
guide to describe cultural works and their images was published as a draft in February
2005, and its final version was published in 2006 [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ]. These two models
categorized all the art images by styles (e.g. classical art, impressionism) or other
information such as names of artists. However, cataloging-based search might not be
sufficient in capturing users’ personal interests.
      </p>
    </sec>
    <sec id="sec-3">
      <title>2.1 Properties of Art Image Retrieval</title>
      <p>From a technical point of view, AIR encounters many challenges that are also
common in CBIR, like high-dimension feature space, nonlinear distributions,
insufficient training examples and the gap between low-level features and high-level
contents.</p>
      <p>Thus previous research on CBIR has provided a good basis for our work on AIR.
Moreover, there are some other unique characteristics coming with AIR. In the
following work we will discuss some major issues.</p>
    </sec>
    <sec id="sec-4">
      <title>3. Information Retrieval of Artistic Objects</title>
      <p>One of the objectives that our approach tries to achieve is the search and information
retrieval through an analysis of a textual description carried out for different pieces
from the database of the artistic objects.</p>
      <p>In art works databases, information retrieval could be done by the cataloging
descriptors: author, title, or pictorial tendency. However, in many cases, users would
like to retrieve some or a group of works through the objects painted, photographed or
drawn. In other words, if a user were interested in retrieving pieces of work that
include women, as response, he would expect for pieces in which feminine figures
appear: Girls, young women or adult women. The type of search mentioned implies
the use of facets that allows proper retrieval of items searched. The types of query
required should then allow retrieval of information, either through the
alreadyexisting facets in the system or defined at the time of execution through search
equations in which the user is able to include a series of words that may indicate the
elements that define a concept, which must be found in the description of the piece, so
that it can be retrieved (simple concept). For example,</p>
      <p>Urban images = vehicles + buildings + people
In both cases, the defined concepts may become permanent upon the user’s request.
By doing so, some feedback can be obtained from the user. On the other hand, as an
object can be a part belonging to more than one concept, it is the intention to be able
to categorize the order of importance of the object regarding the concepts related to it.</p>
      <p>To support the structure and efficient retrieval of the artistic objects, and given the
complexity in the multiplicity of the possible interpretations and contents that can be
linked to the objects, we recommend the use of a faceted classification process.</p>
      <p>
        Under the previous premise, faceted taxonomies can take advantage of the ways in
which the metadata act. Metadata are the collection of structured information about
any type of object or any part of that object, e.g. the name (or title) of a work art, the
author, age, image, textual description (denotation), its interpretation (connotation), or
its gender. All these are some kind of metadata associated with a cultural object.
In our approach, we have implemented the faced classification, and thus, the metadata
model proposed in [
        <xref ref-type="bibr" rid="ref4 ref5">4,5</xref>
        ] was extended with the metadata of its gender, connotation,
and denotation, so as to include its textual description as a part of the requests of the
objects. This fact determines the need of a methodology that allows us to generate the
taxonomic structure of the facets on which the different terms included in the textual
description of the objects are going to be classified.
      </p>
      <p>
        In this natural way, every element of the metadata structures can be incorporated
as a concept of the faceted taxonomy and can be retrievable through a search engine.
The importance of taxonomies is based on the possibility of their use as the triplet
{classification diagram, semantic interpretation, knowledge map} [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] in the
organization and later retrieval of possibly stored objects in a database. As a
consequence, we have the opportunity to access to a request of an object under any of
the dimensions in which they were classified. If we consider the example previously
given, we could retrieve a cultural object by its gender, connotation, denotation, or
simply surf the different facets, because they are orthogonal among themselves.
      </p>
      <p>
        Sacco [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] introduced the concept of dynamic taxonomy, along with the notion to
bear the incorporation of facets that require themselves an independent taxonomy for
their description. Dynamic Taxonomies [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] (DT, also recently known as Faceted
Search Systems) are a general knowledge management model based on a
multidimensional classification of heterogeneous data objects, and are used to
explore/browse complex information bases on a guided, yet unconstrained, way
through a visual interface. The model is primarily concerned with user-centered
access, and object classification is not addressed in the base model. As our interest is
the retrieval of the works of art based on the textual description of their images, we
will use the fundamentals of the dynamic taxonomies, but will extend the domain of
data modeling established by Sacco, including textual objects within the metadata.
      </p>
      <p>In the literature, we can find two interesting approaches to the treatment of textual
objects. The first one presupposes the existence of several topics on which a
taxonomy is to be developed, and in this way, the job of the developed algorithms is
to extract the terms and concepts linked to the topics in question from the documents.
In this approach, each of the several derived taxonomies requires a group of key
terms, linked to the concepts based on which the classification of the documents of
the target objective will be carried out.</p>
      <p>
        The second approach, in contrast, focuses on determining the facets and defining
their taxonomy, departing from the textual analysis of the documents of the
collection. In this case, some techniques for text analysis are used. In particular, in
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], an algorithm is described, used by the Flemish project of the University of
Berkeley for the automatic generation of facets about the corpus in the English
language, using WordNet.
      </p>
    </sec>
    <sec id="sec-5">
      <title>4. Faceted Indexing and Classification</title>
      <p>
        The purpose of indexing is to identify the concepts that represent the content and
translation of documents into a way that can be computably managed [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The use of
a faceted classification will assist us in analyzing the text, and will allow us to ensure
that we are entering all relevant items that should be indexed.
      </p>
      <p>We would now present a brief description of the three facets, genre, denotation,
and connotation, which we had already mentioned in the previous paragraphs. Using
these three facets, we would extend the classification of the artistic objects.</p>
      <p>Genre (Artistic subject): Each of the different categories or kinds that can be
used to order the works depending on common features, such as shape and
content. For example, the portraits and self-portraits, landscapes, religion,
mythology, etc.</p>
      <p>Denotation (Content): Concepts that are shown in an artistic work.</p>
      <p>Connotation (Interpretation): Concepts reflected or transmitted by the artistic
work. For example, the word rain has the connotative meaning of sadness or
melancholy.</p>
      <p>From these three facets, only the facet Genre was defined by an expert user. The
other two facets were obtained automatically.</p>
      <p>
        To develop an information system and a motor search for the retrieved objects in a
collection of 500 works of art using their textual description in Spanish [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], our
methodology uses some techniques and algorithms used in Information Retrieval (IR),
to generate, in a semi automatic way, a faceted classification that allows us to link a
resort of a document to the different facets. By doing so, we could obtain a controlled
vocabulary extracted from the processing of different texts of the target collection.
Subsequently, with the help of the controlled vocabulary and a thesaurus, the concepts
were determined. These concepts were structured hierarchically with the aid of the
thesaurus; in particular, we used the following: Human Emotions Thesaurus (HET)1,
online thesaurus of the human emotions developed by clinical researchers of the
University of British Columbia and the Tesauro Europeo de la Educación2, a
thesaurus structured in Spanish created in 2003 by the Commission of the European
Communities, which contains 3516 terms and 2100 relations.
      </p>
      <p>
        Finally, the objects are indexed [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] with the faceted taxonomy. This allowed us
to generate a faceted classification system. The objects, the faceted taxonomy, and
1Human Emotions Thesaurus.
http://www.slais.ubc.ca/COURSES/libr512/04-05wt2/thesauri/het/
2 Tesauro Europeo de la Educación. http://redined.r020.com.ar/es/
indexes are stored in a database and integrated to the system that enables the user to
explore and, at the same time, refine the search through a navigation tree. In Figure 1,
the process is shown through an activity diagram.
      </p>
      <p>Figure 1. Activity diagram of the dynamic taxonomy generation process</p>
    </sec>
    <sec id="sec-6">
      <title>5. User Interface Design</title>
      <p>
        The exploration by the user will be realized by means of a navigation based search
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] (taxonomic), which allows to guide the user so that he can interact and tune his
query. In addition, it allows to structure, present, and access to the information in
more than one dimension, which benefits the user, because the user can easily and
intuitively locate and explore the information by means of the different approaches
provided by the facets. The navigation based search allows representing the
characteristic of dynamics taxonomy visually. In Figure 2, we have shown an
example of an object classified under two facets.
One of the core principles of dynamic taxonomies is to restrict the available filtering
options in the given focus to only those that will lead to a nonempty result set. Hence,
the user can never run into a situation with zero results. This is opposed to the process
in a typical advanced search situation, where first, a complex Boolean query is
constructed, which is then evaluated on demand. This, however, can result in empty
result sets, often without further indication on which part of the query could be
relaxed to retrieve some results. The exclusion of potentially frustrating situations by
design is often referred to as poka-yoke principle (see Figure 3).
In Figure 5, we have summarized our approach for automatic annotation of the
images of the pieces of an Art Collection based on their textual description, and in
this way, can described an object in more detail. With this, every artwork object
should index not only with the information contained in textual descriptions, but
should also have the hierarchical information of the concepts that lie in the dynamic
taxonomy.
      </p>
    </sec>
    <sec id="sec-7">
      <title>6. Results and Future Work</title>
      <p>We proposed a methodology, using IR techniques, to extract concepts and structures
and facets to manage information and enable further research and exploration.</p>
      <p>Taxonomies were constructed in the following aspects: gender, connotation and
denotation, which can serve as a standard for other cases in which the use of this type
of hierarchy is required.</p>
      <p>We made and implemented algorithms that allow the retrieval of images by
relying on their contents and thereby contributing to solving the semantic labeling of
images, which is one of the goals of the CBIR methodology. The concept extraction
process and indexing between the facets and the works was designed and
subsequently implemented in AWK. This module can be used on any set of textual
descriptors. With regard to the construction of dynamic taxonomies and navigation,
the following operations were implemented: Zoom-In, Zoom-Out, Pivot, Shift, and
Query by Example (QBE).</p>
      <p>Dynamic taxonomies generated, unlike those reported in the literature, are not on a
closed domain. An example is the taxonomy denotation, in which there is an open
domain, associated with a thesaurus for expanding the domain of the facets obtained
from the analysis of unstructured text.</p>
      <p>Among the extensions and improvements that we have referred to are that of
incorporating: The Art &amp; Architecture Thesaurus ® developed by the Getty Research
Institute (GRI), and translated into Spanish by the Documentation Center of
Patrimonial Assets (under the Directorate of Libraries, Archives and Museums
Dibam), the Thesaurus Spanish Historical Heritage, the Multilingual Thesaurus of
Nations, and the Extended Multilingual Wordnet (University Alicante).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <article-title>Categories for the Description of Works of Art (CDWA), J</article-title>
          . Paul Getty Trust, College Art Association, http://www.getty.edu/research/conducting_research/standards/cdwa/index.
          <source>html (accessed 1 august</source>
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Baca</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Harpring</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Lanzi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>McRae</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Whiteside</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          ., Eds. Cataloging Cultural Objects:
          <article-title>A Guide to Describing Cultural Works</article-title>
          and Their Images; American Library Association: Chicago,
          <year>2006</year>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Gilliland-Swetland</surname>
          </string-name>
          , Anne J. “Setting the Stage: Defining Metadata” in Introduction to Metadata: Pathways to Digital Information, Murtha Baca, ed. (Los Angeles: Getty Information Institute,
          <year>2000</year>
          ),
          <article-title>second edition</article-title>
          . 12 pp. Available at http://www.getty.edu/research/conducting_research/standards/intrometadata/ (
          <source>Third edition Summer</source>
          <year>2006</year>
          , accessed 1 august
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Concepción Pérez de Celis Herrero</surname>
          </string-name>
          , Gustavo Cossio Aguilar, Sistema de Gestión de Exposiciones: TESEO ,
          <source>VII Reunión del Arte Contemporáneo Grupo Español del International Institute of Conservation</source>
          , Museo Nacional Centro de Arte Reina Sofia - VEGAP ,
          <year>2006</year>
          , ISBN 84-8026-315-6.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>C. Pérez de Celis Herrero</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Cossio</surname>
            <given-names>Aguilar</given-names>
          </string-name>
          , R. Eguézabal Blanco ; Sistema di Catalogazione per il Museo Nacional de los Ferrocarrriles Mexicanos
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6. 12th.
          <article-title>International Congress of the International Committee for the Conservation of the Industrial Heritage</article-title>
          , Roma, Italia 2006
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Lambe</given-names>
            <surname>Patrick</surname>
          </string-name>
          ,
          <article-title>Organising Knowledge: Taxonomies, Knowledge and Organisational Effectiveness</article-title>
          .
          <source>Chandos Publishing</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Sacco</surname>
            ,
            <given-names>G. M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Dynamic</given-names>
            <surname>Taxonomies</surname>
          </string-name>
          :
          <article-title>A Model for Large Information Bases</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>12</volume>
          ,
          <issue>2</issue>
          , p.
          <fpage>468</fpage>
          -
          <lpage>479</lpage>
          ,
          <year>2000</year>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Dynamic</given-names>
            <surname>Taxonomies</surname>
          </string-name>
          and Faceted Search Theory, Practice, and
          <source>Experience; The Information Retrieval Series</source>
          , Vol.
          <volume>25</volume>
          Sacco, Giovanni Maria; Tzitzikas, Yannis (Eds.)
          <year>2009</year>
          , XVII, 340 p.,
          <string-name>
            <surname>Hardcover</surname>
            <given-names>ISBN</given-names>
          </string-name>
          :
          <fpage>978</fpage>
          -3-
          <fpage>642</fpage>
          -02358-3
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Stoica</surname>
            <given-names>Emilia</given-names>
          </string-name>
          , Hearst Marti y Richardson Megan.
          <article-title>Automating Creation of Hierarchical Faceted Metadata Structures</article-title>
          .
          <source>In the proceedings of NAACL-HLT</source>
          ,
          <string-name>
            <surname>Rochester</surname>
            <given-names>NY</given-names>
          </string-name>
          ,
          <year>Abril 2007</year>
          . Available at: http://flamenco.berkeley.edu/papers/castanet.pdf.
          <source>(accessed 1 august</source>
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. Tolosa Gabriel y Bordignon Fernando.
          <article-title>Introducción a la Recuperación de Información - Conceptos, modelos y algoritmos básicos</article-title>
          . Universidad Nacional de Luján, Argentina.Available at: http://www.tyr.unlu.edu.ar/tallerIR/2008/docs/Introduccion-RI-v9f.
          <source>pdf (accessed 1 august</source>
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>El ABC del Arte del siglo</surname>
            <given-names>XX</given-names>
          </string-name>
          ,
          <source>Primera edición en español</source>
          <year>1999</year>
          , Editorial Phaidon Press Limited.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>Leise</given-names>
            <surname>Fred</surname>
          </string-name>
          .
          <article-title>Using Faceted Classification to Assist Indexing</article-title>
          .
          <source>Contextual Analysis</source>
          ,
          <year>2003</year>
          . Disponible en: http://www.contextualanalysis.com/pub_usingfacets.
          <source>php (accessed 1 august</source>
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>