<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>United Institute Of Informatics Problems at CLEF2009: Medical Image Retrieval Task</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vassili Kovalev</string-name>
          <email>vassili.kovalev@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Igor Safonov</string-name>
          <email>igorsafonov@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexander Dmitruk</string-name>
          <email>dmitruk@newman.bas-net.by</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexander Prus</string-name>
          <email>prus@tut.by</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>United Institute of Informatics Problems</institution>
          ,
          <addr-line>Surganova st. 6, Minsk, 220012</addr-line>
          ,
          <country country="BY">Belarus</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper describes methods and results archived by our research group at the Cross Language Evaluation Forum (CLEF'2009). In our work we concentrated on the medical image retrieval task. All the attention was given to the retrieval based on nine visual topics only. No textual information was considered. Our goal was to develop and comparatively assess image descriptors for content based retrieval on the large medical image database. In addition to results of official relevance judgment, time- and computational efficiency of the algorithms has also been of our interest. An approach based on generalized co-occurrence matrices was used. The following elementary image characteristics have been utilized: signal intensity, absolute value of the intensity gradient and the angle between the gradient vectors in the pixel pairs.</p>
      </abstract>
      <kwd-group>
        <kwd>H</kwd>
        <kwd>3 [Information Storage and Retrieval]</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>1 Content Analysis and Indexing</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>3 Information Search and Retrieval</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>4 Systems and Software</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>7 Digital Libraries</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>2.1</p>
      <sec id="sec-1-1">
        <title>Image descriptors</title>
        <p>
          Generalized co-occurrence matrices [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] extend classical Haralick matrices [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] first introduced in
1973. Co-occurrence matrix for gray level image can be a matrix of relative frequencies P (i, k, d)
with which the pixels with gray levels i, k occur in the image by distance d apart (Fig. 1).
        </p>
        <p>As both variables take continuous values, the construction of such a histogram will require
their quantization. Depending on the range of relative distances considered and the number of
bins used, this step could be quite computationally intensive. However, it is not necessary to
consider a large range of distances. Depending on the task we want to perform, we may restrict
ourselves to only a small range of. If this distance value is small, the histogram will contain local
information concerning the described shape or texture. If the value is large, global information is
conveyed.</p>
        <p>Taking into account the extensive experience on the medical image analysis, classification
and recognition using these descriptors, the following elementary image characteristics have been
utilized:
• brightness (signal intensity);
• absolute value of the intensity gradient;
• the angle between the gradient vectors in the pixel pairs.</p>
        <p>
          As it was shown experimentally [
          <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
          ], together with the information on their variation in space
these elementary image characteristics may describe the image structure properly.
        </p>
        <p>A multidimensional co-occurrence matrix of a general form that combines all of above
characteristics and relationships looks as follows:</p>
        <p>W = kw (I(i), I(k), G(i), G(k), A(i, k), d(i, k))k ,
where
i = (xi, yi), k = (xk, yk) is an arbitrarily pair of pixels,
d(i, k) is the distance between them,
I(i), I(k) is the intensity of the pixels,
G(i), G(k) is the absolute values of brightness gradients,</p>
        <p>A(i, k) is the angle between the directions of the gradients.</p>
        <p>The color images were converted into gray images using:</p>
        <p>I = 0.3 × R + 0.59 × G + 0.11 × B
2.2</p>
      </sec>
      <sec id="sec-1-2">
        <title>Similarity measures</title>
        <p>The histograms/co-occurrence matrices that represent different images have to be compared and
their difference be expressed by a single number if possible. This can be achieved with the help of
a metric defined to measure the ”distance” between two histograms.</p>
        <p>As a similarity measure of the classes (topics), the following distances were tested:
• canberra metrics
• manhattan metrics
• and absolute value of correlation coefficient
n
dcanberra(X, Y ) = X |xi − yi| ,
i=1 xi + yi</p>
        <p>n
dmanhattan(X, Y ) = X |xi − yi|</p>
        <p>i=1
dcorrelation(X, Y ) =</p>
        <p>E(XY ) − E(X)E(Y )
pE(X2) − E2(X) pE(Y 2) − E2(Y )
where E is the expected value operator, X = (x1, x2, . . . , xn) and Y = (y1, y2, . . . , yn) are feature
vectors.</p>
        <p>Some visual topics could contain several images as a query, so we tried two types of distance
measures to the group of images:
dcentroid(X, Y ) = mean(xi)</p>
        <p>dmin(X, Y ) = min(xi).
and minimum distances</p>
        <p>Additionally, color control was used during the retrieval: if the query contained color/gray
images, corresponding images were moved at the top of the results.
3</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Experiments</title>
      <p>Initially, due to descriptors’ normalization, original images were scaled down:
• proportional rescaling to 320 pixels width;
• rescaling to 150 × 150 pixels;
• rescaling to 300 × 300 pixels.</p>
      <p>After that different types of descriptors were calculated and stored into binary files (Table 1).
In our experiments we have chosen 5–7 bins for every dimension of the co-occurrence matrix.
Average time for generation one image descriptor was about half of a second.</p>
      <sec id="sec-2-1">
        <title>Descriptor GGAD GGGD IIGGAD</title>
      </sec>
      <sec id="sec-2-2">
        <title>Number of elements 1944 2907 54432</title>
      </sec>
      <sec id="sec-2-3">
        <title>Database of descriptors 570Kb 852Kb 16Gb</title>
        <p>Various types of descriptors and metrics in combination with image rescaling, produced 42 runs.</p>
        <p>In our opinion, the best retrieval results on such a common database were achieved with
IIGGAD descriptors and minimum distance on manhattan metrics (Fig. 2). Thus only two runs
were submitted (Table 2)</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results and discussion</title>
      <p>As it was our first experience at this competition, we erroneously did not run mixed and semantic
topics. Thus, only 9 out of 25 topics were submitted. According to a general number of relevant
results our two runs can by ranked 3rd and 6th respectively out of 13 visual runs.</p>
      <p>Generalized co-occurrence matrices are intended to find visually similar images. As some topics
can by semantically the same but visually absolutely different, employment of textual information
has more potential. Despite all this, generalized co-occurrence matrices showed very promising
results as similarity measures of medical images.</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <p>This work was partially supported by the ISTC grant B-1682.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Henning</given-names>
            <surname>Mu</surname>
          </string-name>
          <article-title>¨ller, Jayashree Kalpathy-Cramer, Ivan Eggel</article-title>
          , Steven Bedrick,
          <string-name>
            <given-names>Sa</given-names>
            ¨ıd Radhouani, Brian Bakke, Charles Kahn Jr.,
            <surname>William</surname>
          </string-name>
          <string-name>
            <surname>Hersh</surname>
          </string-name>
          ,
          <article-title>Overview of the CLEF 2009 medical image retrieval track</article-title>
          ,
          <source>CLEF working notes 2009</source>
          , Corfu, Greece,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Kovalev</surname>
            <given-names>V.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Petrou</surname>
            <given-names>M. Multidimensional</given-names>
          </string-name>
          <article-title>Co-occurrence Matrices for Object Recognition and Matching</article-title>
          ,
          <source>Graphical Models and Image Processing</source>
          , Vol.
          <volume>58</volume>
          , No. 3, May, pp.
          <fpage>187</fpage>
          -
          <lpage>197</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Haralick</surname>
            <given-names>R.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shanmugan</surname>
            <given-names>K.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dinstein</surname>
            <given-names>I.</given-names>
          </string-name>
          <article-title>Textural features for image classification</article-title>
          .
          <source>IEEE Transactions on Systems. Man and Cybernetics</source>
          . SMC-
          <volume>3</volume>
          (
          <issue>6</issue>
          ) (
          <year>1973</year>
          ). pp.
          <fpage>610</fpage>
          -
          <lpage>621</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Kovalev</surname>
            <given-names>V.A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Kruggel F</surname>
          </string-name>
          .
          <article-title>Retrieving 3D MRI Brain Images</article-title>
          , In: Information Retrieval and
          <article-title>Exploration in Large Medical Image Collections (VISIM'01</article-title>
          /MICCAI'01),
          <year>Oct 2001</year>
          ,
          <article-title>Utrecht, The Netherlands, Smeulders</article-title>
          <string-name>
            <surname>A.W.M.</surname>
          </string-name>
          (Ed.), pp.
          <fpage>53</fpage>
          -
          <lpage>56</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Kovalev</surname>
            <given-names>V.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Volmer</surname>
            <given-names>S. Color</given-names>
          </string-name>
          <article-title>Co-occurrence Descriptors for Querying-by-</article-title>
          <string-name>
            <surname>Example</surname>
          </string-name>
          ,
          <source>Int. Conf. on Multimedia Modelling</source>
          , Oct.
          <volume>12</volume>
          -
          <fpage>15</fpage>
          , Lausanne, Switzerland, IEEE Comp. Society Press, pp.
          <fpage>32</fpage>
          -
          <lpage>38</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>