<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yih-Chen Chang</string-name>
          <email>ycchang@nlg.csie.ntu.edu.tw</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hsin-Hsi Chen</string-name>
          <email>hhchen@csie.ntu.edu.tw</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science and Information Engineering National Taiwan University Taipei</institution>
          ,
          <country country="TW">Taiwan</country>
        </aff>
      </contrib-group>
      <fpage>2</fpage>
      <lpage>7</lpage>
      <abstract>
        <p>Two kinds of intermedia are explored in ImageCLEFphoto2006. The approach of using a word-image ontology maps images to fundamental concepts in an ontology and measure the similarity of two images by using the kind-of relationship of the ontology. The approach of using an annotated image corpus maps images to texts describing concepts in the images, and the similarity of two images is measured by text counterparts using BM25. The official runs show that visual query and intermedia are useful. Comparing the runs using textual query only with the runs merging textual query and visual query, the latter improved 71%~119% of the performance of the former. Even in the situation which example images were removed from the image collection beforehand, the performance was still improved about 21%~43%. In recent years, many methods (Clough, Sanderson, &amp; Müller, 2005; Clough, Müller, Deselaers, Grubinger, Lehmann, Jensen, &amp; Hersh, 2006) have been proposed to explore visual information to improve the performance of cross-language image retrieval. The challenging issue is the semantic differences among visual and textual information. For example, using thevisualinformation“redcircle”may retrieve noise images containing “red flower”, “red balloon”, “red ball”,andsoon, rather than the desired ones containing “sun”. The semantic difference between visual concept “redcircle”andtextual symbol “sun”is called a semantic gap. Some approaches conducted text- and content-based image retrieval separately and then merged the results of two runs (Besançon, et al., 2005; Jones, et al., 2005; Lin, Chang &amp; Chen, forthcoming). Content-based image retrieval may suffer from the semantic gap problem and report noise images. That may have negative effects on the final performance. Some other approaches (Lin, Chang &amp; Chen, forthcoming) learned the relationships between visual and textual information and used the relationships for media transformation. The final retrieval performance depends on the relationship mining. In this paper, we use two intermedia approaches to capture the semantic gap. The main characteristic of these approaches is that human knowledge is imbedded in the intermedia and can be used to compute the semantic similarity of images. A word-image ontology and an annotated image corpus are explored and compared. Section 2 specifies how to build and use the word-image ontology. Section 3 deals with the uses of the annotated corpus. Sections 4 and 5 show and discuss the official runs in ImageCLEFphoto2006.</p>
      </abstract>
      <kwd-group>
        <kwd>Cross language image retrieval</kwd>
        <kwd>cross-media translation</kwd>
        <kwd>image annotation</kwd>
        <kwd>ontology</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Building the Ontology</title>
      <p>A word-image ontology is a word ontology aligned with the related images on each node. Building such an
ontology manually is time consuming. In ImageCLEFphoto2006, the image collection has 20,000 colored
images. There are 15,998 images containing English captions in &lt;TITLE&gt; and &lt;DESCRIPTION&gt; fields. The
vocabularies include more than 8,000 different words, thus an ontology with only hundreds words is not enough.</p>
      <p>Instead of creating a new ontology from the beginning, we extend WordNet, the well-known word
ontology, to a word-image ontology. In WordNet, different senses and relations are defined for each word. For
simplicity, we only consider the first two senses and kind-of relations in the ontology. Because our experiments
in ImageCLEF2004 (Lin, Chang &amp; Chen, 2006) showed that verbs and adjectives are less appropriate to be
represented as visual features, we only used nouns here.</p>
      <p>
        Before aligning images and words, we selected those nouns in both WordNet and image collection based
on their TF-IDF scores. For each selected noun, we used Google image search to retrieval 60 images from the
web. The returned images may encounter two problems: (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) they may have unrelated images, and (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) the related
images may not be pure enough, i.e., the foci may be in the background or there may be some other things in the
images. Zinger (2005) tried to deal with this problem by using visual features to cluster the retrieved images and
filtering out those images not belonging to any clusters. Unlike Zinger (2005), we employed textual features. For
each retrieved image, Google will return a short snippet. We filter out those images whose snippets do not
exactly match the query terms. The basic idea is: “themorethingsa snippet mentions, the more complex the
image is.”Finally, we get a word-image ontology with 11,723 images in 2,346 nodes.
2.2
      </p>
    </sec>
    <sec id="sec-3">
      <title>Using the Ontology</title>
    </sec>
    <sec id="sec-4">
      <title>2.2.1 Similarity Scoring</title>
      <p>Each image contains several fundamental concepts specified in the word-image ontology. The similarity of two
images is measured by the sum of the similarity of the fundamental concepts. In this paper we use kind-of
relations to compute semantic distance between fundamental concepts A and B in the word-image ontology. At
first, we find the least common ancestor (LCA) of A and B. The distance between A and B is the length of the
path from A through LCA to B. When computing the semantic distance of nodes A and B, the more the nodes
should be traversed from A to B, the larger the distance is. In addition to the path length, we also consider the
weighting of links in a path shown as follows.</p>
      <p>
        (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) When computing the semantic distance of a node A and its child B, we consider the number of children
of A. The more children A has, the larger the distance between A and B is. In an extreme case, if A has only one
child B, then the distance between A and B is 0. Let #children(A) denote the number of children of A, and
base(A) denote the basic distance of A and its children. We define base(A) to be log(#children(A)).
      </p>
      <p>
        (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) When computing the semantic distance of a node A and its brother, we consider the level it locates.
Assume B is a child of A. If A and B have the same number of brothers, then the distance between A and its
brothers is larger than that between B and its brothers. Let level(A) be the depth of node A. Assume the level
of root is 0. The distance between node A and its child, denoted by dist(A), is defined to be clevel( A) base( A) .
      </p>
      <sec id="sec-4-1">
        <title>Here C is a constant between 0 and 1. In this paper, C is set to 0.9.</title>
        <p>If the shortest path between two different nodes N0 and Nm is N0, N1, …,NLCA, …, Nm-1, Nm, we define the
distance between N0 and Nm to be:</p>
        <p>m-1
dist(N 0 , N m ) dist(N LCA ) dist(N i )</p>
        <p>i1
The larger the distance of two nodes is, the less similar the nodes are.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>2.2.2 Mapping into the Ontology</title>
      <p>Before counting the semantic distance between two given images, we need to map the two images into nodes of
the ontology. In other words, we have to find the fundamental concepts the two images consist of. A CBIR
system is adopted. It regards an image as a visual query and retrieves the top k fundamental images (i.e.,
fundamental concepts) in the word-image ontology. In such a case, we have two sets of nodes S1={n11, n12,
n13, …,n1k} and S2={n21, n22, n23, …, n2k}, which correspond to the two images, respectively. We define the
following formula to compute the semantic distance:</p>
      <p>k
SemanticDistance(S1, S2 ) min(dist(n1i , n2 j )), where j 1,...,k</p>
      <p>i1</p>
      <p>Given a query with m example images, we regard each example image Q as an independent visual query,
and compute the semantic distance between Q and images in the collection. Note that we determine what
concepts are composed of an image in the collection before retrieval. After m image retrievals, each image in the
collection has been assigned m scores based on the above formula. We choose the best score for each image and
sort all the images to create a rank list. Finally, the top 1000 images in the rank list will be reported.</p>
    </sec>
    <sec id="sec-6">
      <title>An Approach of Using an Annotated Image Corpus</title>
      <p>An annotated image corpus is a collection of images along with their text annotations. The text annotation
specifies the concepts and their relationships in the images. To measure the similarity of two images, we have to
know how many common concepts there are in the two images. An annotated image corpus can be regarded as
a reference corpus. We submit two CBIRs to the reference corpus for the two images to be compared. The
corresponding text annotations of the retrieved images are postulated to contain the concepts embedded in the
two images. The similarity of text annotations measures the similarity of the two images indirectly.</p>
      <p>The image collection in ImageCLEFphoto2006 can be considered as a reference annotated image corpus.
Using image collection itself as intermedia has some advantages. It is not necessary to map images in the image
collection to the intermedia. Besides, the domain can be more restricted to peregrine pictures. In the
experiments, the &lt;DESCRIPTION&gt;, &lt;NOTE&gt;, and &lt;LOCATION&gt; fields in English form the annotated corpus.</p>
      <p>To use the annotated image corpus as intermedia to compute similarity between example images and
images in image collection, we need to map these images into intermedia. Since we use the image collection
itself as intermedia, we only need to map example images in this work. An example image is considered as a
visual query and submitted to retrieve images in intermedia by a CBIR system. The corresponding text
counterparts of the top returned k images form a long text query and it is submitted to an Okapi system to
retrieval images in the image collection. BM25 formula measures the similarity between example images and
images in image collection.</p>
    </sec>
    <sec id="sec-7">
      <title>Experiments</title>
      <p>
        In the formal runs, we submitted 25 cross-lingual runs for eight different query languages. All the queries with
different source languages were translated into target language (i.e., English) using SYSTRAN system. We
considered several issues, including (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) using different intermedia approaches (i.e., the text-image ontology and
the annotated image corpus), and (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) with/without using visual query. In addition, we also submitted 4
monolingual runs which compared (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) the annotation in English and in German, and (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) using or not using visual
query and intermedia. At last, we submitted a run using visual query and intermedia only. The details of our runs
are described as follows:
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
8 cross-lingual and text query only runs:
NTU-PT-EN-AUTO-NOFB-TXT, NTU-RU-EN-AUTO-NOFB-TXT,
NTU-ES-EN-AUTO-NOFB-TXT, NTU-ZHT-EN-AUTO-NOFB-TXT,
NTU-FR-EN-AUTO-NOFB-TXT, NTU-JA-EN-AUTO-NOFB-TXT,
NTU-IT-EN-AUTO-NOFB-TXT, and NTU-ZHS-EN-AUTO-NOFB-TXT.
      </p>
      <p>
        These runs are regarded as baselines and are compared with the runs using both textual and visual
information.
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
2 monolingual and text query only runs:
NTU-EN-EN-AUTO-NOFB-TXT, and NTU-DE-DE-AUTO-NOFB-TXT.
      </p>
      <p>
        These two runs serve as the baselines to compare with cross-lingual runs with text query only, and to
compare with the runs using both textual and visual information.
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
1 visual query only run with the approach of using an annotated image corpus:
NTU-AUTO-FB-TXTIMG-WEprf.
      </p>
      <p>
        This run will be merged with the runs using textual query only, and is also a baseline to compare
with the runs using both visual and textual queries.
(
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
8 cross-lingual runs, using both textual and visual queries with the approach of an annotated corpus:
NTU-PT-EN-AUTO-FB-TXTIMG-T-WEprf, NTU-RU-EN-AUTO-FB-TXTIMG-T-WEprf,
NTU-ES-EN-AUTO-FB-TXTIMG-T-WEprf, NTU-FR-EN-AUTO-FB-TXTIMG-T-WEprf,
NTU-ZHS-EN-AUTO-FB-TXTIMG-T-WEprf, NTU-JA-EN-AUTO-FB-TXTIMG-T-WEprf,
NTU-ZHT-EN-AUTO-FB-TXTIMG-T-Weprf, and NTU-IT-EN-AUTO-FB-TXTIMG-T-Weprf.
These runs merge the textual query only runs in (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) and visual query only run in (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) with equal
weight.
(
        <xref ref-type="bibr" rid="ref5">5</xref>
        )
8 cross-lingual runs, using both textual and visual queries with the approach of using word-image
ontology:
NTU-PT-EN-AUTO-NOFB-TXTIMG-T-IOntology,
NTU-RU-EN-AUTO-NOFB-TXTIMG-T-IOntology,
NTU-ES-EN-AUTO-NOFB-TXTIMG-T-IOntology,
NTU-FR-EN-AUTO-NOFB-TXTIMG-T-IOntology,
NTU-ZHS-EN-AUTO-NOFB-TXTIMG-T-IOntology,
NTU-JA-EN-AUTO-NOFB-TXTIMG-T-IOntology,
NTU-ZHT-EN-AUTO-NOFB-TXTIMG-T-IOntology, and
NTU-IT-EN-AUTO-NOFB-TXTIMG-T-IOntology.
      </p>
      <p>
        These runs merge textual query only runs in (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ), and visual query runs with weights 0.9 and 0.1.
(
        <xref ref-type="bibr" rid="ref6">6</xref>
        )
2 monolingual runs, using both textual and visual queries with the approach of an annotated corpus:
NTU-EN-EN-AUTO-FB-TXTIMG, and NTU-DE-DE-AUTO-FB-TXTIMG
These two runs using both textual and visual queries. The monolingual run in (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) and the visual run
in (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) are merged with equal weight.
      </p>
    </sec>
    <sec id="sec-8">
      <title>Results and Discussions</title>
      <sec id="sec-8-1">
        <title>NTU-JA-EN-AUTO-NOFB-TXTIMG-T-IOntology</title>
        <p>Since the example images in this task are in the image collection, the CBIR system always correctly maps
the example images into themselves at mapping step. We made some extra experiments to examine the
performance of our intermedia approach. In the experiments, we took out the example images from the image
collection when mapping example images into intermedia. Table 2 shows the experiment results. Although the
performance of Table 2 is lower than that of Table 1, the runs using the approach of annotated image corpus are
still better than the runs using textual query only.</p>
      </sec>
      <sec id="sec-8-2">
        <title>NTU-PT-EN-AUTO-NOFB-TXT</title>
      </sec>
      <sec id="sec-8-3">
        <title>NTU-RU-EN-AUTO-NOFB-TXT</title>
      </sec>
      <sec id="sec-8-4">
        <title>NTU-ES-EN-AUTO-NOFB-TXT NTU-FR-EN-AUTO-NOFB-TXT 0.1228</title>
      </sec>
      <sec id="sec-8-5">
        <title>NTU-ZHT-EN-AUTO-NOFB-TXTIMG-T-IOntology Text Only</title>
      </sec>
      <sec id="sec-8-6">
        <title>NTU-IT-EN-AUTO-NOFB-TXT Text + Annotations</title>
      </sec>
      <sec id="sec-8-7">
        <title>NTU-IT-EN-AUTO-FB-TXTIMG-T-Weprf Text + Ontology</title>
      </sec>
      <sec id="sec-8-8">
        <title>NTU-IT-EN-AUTO-NOFB-TXTIMG-T-IOntology</title>
        <p>Table 3 shows the experiment results of monolingual runs. Using both textual and visual queries are still
better than runs using textual query only. The performance of the runs by taking out the example images from
collection beforehand is still better than the runs use textual query only. Comparing with Tables 1 and 2, we can
find some cross-lingual runs that use textual and visual queries are even better than monolingual run that use
textual query only. That means visual information is very important when doing cross language image retrieval.</p>
        <p>Table 4 shows the experiment of runs using visual query only. When example images were kept in the
image collection, we can always map the example images into the right images. Therefore, the translation from
visual information into textual information will be more correctly. The experiment shows the performance of
visual query runs is better than that of textual query runs when the transformation is correct.</p>
      </sec>
      <sec id="sec-8-9">
        <title>Text + Annotations</title>
      </sec>
      <sec id="sec-8-10">
        <title>NTU-FR-EN-AUTO-FB-TXTIMG-T-WEprf-NoE</title>
      </sec>
      <sec id="sec-8-11">
        <title>NTU-IT-EN-AUTO-NOFB-TXT</title>
      </sec>
      <sec id="sec-8-12">
        <title>NTU-ZHS-EN-AUTO-NOFB-TXT</title>
      </sec>
      <sec id="sec-8-13">
        <title>NTU-JA-EN-AUTO-NOFB-TXT</title>
      </sec>
      <sec id="sec-8-14">
        <title>NTU-ZHT-EN-AUTO-NOFB-TXT</title>
      </sec>
      <sec id="sec-8-15">
        <title>Text + Annotations</title>
      </sec>
      <sec id="sec-8-16">
        <title>NTU-ZHS-EN-AUTO-FB-TXTIMG-T-Weprf-NoE</title>
      </sec>
      <sec id="sec-8-17">
        <title>Text + Annotations</title>
      </sec>
      <sec id="sec-8-18">
        <title>NTU-JA-EN-AUTO-FB-TXTIMG-T-Weprf-NoE</title>
      </sec>
      <sec id="sec-8-19">
        <title>Text + Annotations</title>
      </sec>
      <sec id="sec-8-20">
        <title>NTU-ZHT-EN-AUTO-FB-TXTIMG-T-Weprf-NoE</title>
      </sec>
      <sec id="sec-8-21">
        <title>Text + Annotations</title>
      </sec>
      <sec id="sec-8-22">
        <title>NTU-IT-EN-AUTO-FB-TXTIMG-T-Weprf-NoE</title>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>Conclusion</title>
      <p>MAP
0.1787
The experiments show visual query and intermedia approaches are useful. Comparing the runs using textual
query only with the runs merging textual query and visual query, the latter improved 71%~119% of performance
of the former. Even in the situation which example images are removed from the image collection, the
performance can still be improved about 21%~43%. We find visual query in image retrieval is important. The
performance of the runs using visual query only can be even better than the runs using textual only if we
translate visual information into textual one correctly. In this year the word-image ontology built automatically
still contain much noise. We will investigate how to filter out the noise and explore different methods.</p>
    </sec>
    <sec id="sec-10">
      <title>Acknowledgments References</title>
      <p>Research of this paper was partially supported by National Science Council, Taiwan, under the contracts
NSC94-2213-E-002-076 and NSC95-2752-E-001-001-PAE.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Besançon</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hède</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moellic</surname>
            ,
            <given-names>P.A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Fluhr</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Cross-media feedback strategies: Merging text and image information to improve image retrieval</article-title>
          . In Peters, C.;
          <string-name>
            <surname>Clough</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Gonzalo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ; Jones,
          <string-name>
            <given-names>G.J.F.</given-names>
            ;
            <surname>Kluck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ;
            <surname>Magnini</surname>
          </string-name>
          ,
          <string-name>
            <surname>B</surname>
          </string-name>
          . (Eds.),
          <source>Proceedings of 5th Workshop of the Cross-Language Evaluation Forum, LNCS</source>
          <volume>3491</volume>
          , (pp.
          <fpage>709</fpage>
          -
          <lpage>717</lpage>
          ). Berlin: Springer.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Clough</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sanderson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Müller</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>The CLEF 2004 cross language image retrieval track</article-title>
          . In Peters, C.;
          <string-name>
            <surname>Clough</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Gonzalo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ; Jones,
          <string-name>
            <given-names>G.J.F.</given-names>
            ;
            <surname>Kluck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ;
            <surname>Magnini</surname>
          </string-name>
          ,
          <string-name>
            <surname>B</surname>
          </string-name>
          . (Eds.),
          <source>Proceedings of 5th Workshop of the Cross-Language Evaluation Forum, LNCS</source>
          <volume>3491</volume>
          , (pp.
          <fpage>597</fpage>
          -
          <lpage>613</lpage>
          ). Berlin: Springer.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Clough</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Müller</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deselaers</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grubinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>T.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Hersh</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>The CLEF 2005 cross-language image retrieval track</article-title>
          ,
          <source>Proceedings of 6th Workshop of the Cross Language Evaluation Forum, Lecture Notes in Computer Science</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>G.J.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Groves</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khasin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lam-Adesina</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mellebeek</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Way</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2005</year>
          ). Dublin City University at CLEF 2004:
          <article-title>Experiments with the ImageCLEF St Andrew's Collection</article-title>
          . In Peters, C.;
          <string-name>
            <surname>Clough</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Gonzalo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ; Jones,
          <string-name>
            <given-names>G.J.F.</given-names>
            ;
            <surname>Kluck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ;
            <surname>Magnini</surname>
          </string-name>
          ,
          <string-name>
            <surname>B</surname>
          </string-name>
          . (Eds.),
          <source>Proceedings of 5th Workshop of the Cross-Language Evaluation Forum, LNCS</source>
          <volume>3491</volume>
          , (pp.
          <fpage>653</fpage>
          -
          <lpage>663</lpage>
          ). Berlin: Springer.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>W.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>Y.C.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>H.H.</given-names>
          </string-name>
          (forthcoming).“
          <string-name>
            <surname>IntegratingTextualandVisualInformationforCross-Language Image Retrieval: A Trans-MediaDictionaryApproach.</surname>
          </string-name>
          ”Information Processing and Management, Special Issue on Asia Information Retrieval Research.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Zinger</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>“Extracting an Ontology of Portable Objects from WordNet</article-title>
          .
          <source>” Proceedings of the MUSCLE/ImageCLEF Workshop on Image and Video Retrieval Evaluation.</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>