<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Baseline Results for the CLEF 2008 Medical Automatic Annotation Task</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Mark O. Gu ̈ld, Thomas M. Deserno Department of Medical Informatics, RWTH Aachen University</institution>
          ,
          <addr-line>Aachen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This work reports baseline results for the CLEF 2008 Medical Automatic Annotation Task (MAAT) by applying a classifier with a fixed parameter set to all tasks 2005 2008. The classifier performs a weighted combination of three distance and similarity measures operating on global image features: Scaled-down representations of the images are compared via metrics that model the typical variability in the image data, mainly translation, local deformation, and radiation dose. In addition, a distance measure based on texture features is used. For classification, a k nearest neighbor classifier is used. In 2008, the baseline classifier yields error scores of 170.34 and 182.77 for k=1 and k=5 when the full code is reported, which corresponds to error rates of 51.3% and 52.8% for 1-NN and 5-NN, respectively. Judging the relative increases of the number of classes and the error rates over the years, MAAT 2008 is estimated to be the most difficult in the four years.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Methods</title>
      <p>
        The content of one radiograph is represented by Tamura’s texture measures (TTM) proposed in
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and down-scaled representations of the original images, 32 × 32 and X × 32 pixels disregarding
and according to the original aspect ratio, respectively. Since these image icons maintain the
spatial intensity information, variabilities which are commonly found in a medical imagery are
modelled by the distance measure. These include radiation dose, global translation, and local
deformation. In particular, the cross-correlation function (CCF) that is based on Shannon, and
the image distortion model (IDM) from [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] are used.
      </p>
      <p>The single classifiers are combined within a parallel scheme, which performs a weighting of
the normalized distances obtained from the single classifiers Ci, and applies the
nearest-neighbordecision function C to the resulting distances:
dcombined(q, r) = X λi · d′i(q, r),</p>
      <p>i
d′i(q, r) =</p>
      <p>di(q, r)
Pr′∈R di(q, r′)
(1)
(2)
where 0 ≤ λi ≤ 1, Pi λi = 1 denotes the weight for the normalized distance di(q, r) obtained
from classifier Ci for a sample q and a reference r from the set of reference images, R. Values
0 ≤ si(q, r) ≤ 1 obtained from similarity measures are transformed via di(q, r) = 1 − si(q, r).</p>
      <p>The three content descriptors and their distance measures use the following parameters:
• TTM: texture histograms from down-scaled image (256 × 256), 384 bins, Jensen-Shannon
divergence as a distance measure,
• CCF: 32 × 32 icon, 9 × 9 translation window
• IDM: X × 32 icon, gradients, 5 × 5 window, 3 × 3 context
The weighting coefficients were set empirically during CLEF MAAT 2005: λIDM = 0.42, λCCF =
0.18, and λTTM = 0.4.
3</p>
    </sec>
    <sec id="sec-2">
      <title>Results</title>
      <p>A rough estimation of the task difficulty can be derived from the baseline error rates: Comparing
2005 and 2006, the number of classes increased by 103%, while the error rate only increased by
63% and 48% for 1-NN and 5-NN, respectively. This suggests that the task in 2006 was easier
than in 2005. Since the challenges in 2006 and 2007 use the same class definitions, the obtained
error rates are directly comparable and show a slightly reduced task difficulty in 2007. In 2008,
the number of classes increased by 70% compared to 2007, while the error rate increased by 157%
and 193%, respectively. The 2008 task can therefore be considered to be more difficult than the
2007 task. Applying the same estimation, the 2008 task is also found to be more difficult than the
2005 task, as the number of classes increased by 246% and the error rate increased by 286% and
257%, respectively.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Tamura</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mori</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamawaki</surname>
            <given-names>T.</given-names>
          </string-name>
          <article-title>Textural features corresponding to visual perception</article-title>
          .
          <source>IEEE Trans Syst Man Cybern B Cybern</source>
          <year>1978</year>
          ;
          <volume>8</volume>
          (
          <issue>6</issue>
          ):
          <fpage>460</fpage>
          -
          <lpage>73</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Keysers</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dahmen</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ney</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wein</surname>
            <given-names>BB</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            <given-names>TM</given-names>
          </string-name>
          .
          <article-title>A statistical framework for modelbased image retrieval in medical applications</article-title>
          .
          <source>J Elec Imaging</source>
          <year>2003</year>
          ;
          <volume>12</volume>
          (
          <issue>1</issue>
          ):
          <fpage>59</fpage>
          -
          <lpage>68</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Gu</given-names>
            <surname>¨ld</surname>
          </string-name>
          <string-name>
            <given-names>MO</given-names>
            ,
            <surname>Thies</surname>
          </string-name>
          <string-name>
            <given-names>C</given-names>
            ,
            <surname>Fischer</surname>
          </string-name>
          <string-name>
            <given-names>B</given-names>
            ,
            <surname>Lehmann</surname>
          </string-name>
          <string-name>
            <surname>TM</surname>
          </string-name>
          .
          <article-title>Content-based retrieval of medical images by combining global features</article-title>
          .
          <source>Lect Notes Comput Sci</source>
          <year>2006</year>
          ;
          <volume>4022</volume>
          :
          <fpage>702</fpage>
          -
          <lpage>11</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Gu</given-names>
            <surname>¨ld</surname>
          </string-name>
          <string-name>
            <given-names>MO</given-names>
            ,
            <surname>Thies</surname>
          </string-name>
          <string-name>
            <given-names>C</given-names>
            ,
            <surname>Fischer</surname>
          </string-name>
          <string-name>
            <given-names>B</given-names>
            ,
            <surname>Deserno</surname>
          </string-name>
          <string-name>
            <surname>TM</surname>
          </string-name>
          .
          <article-title>Baseline results for the ImageCLEF 2006 medical automatic annotation task</article-title>
          <source>Lect Notes Comput Sci</source>
          <year>2007</year>
          ;
          <volume>4730</volume>
          :
          <fpage>686</fpage>
          -
          <lpage>9</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Gu</given-names>
            <surname>¨ld</surname>
          </string-name>
          <string-name>
            <given-names>MO</given-names>
            ,
            <surname>Deserno</surname>
          </string-name>
          <string-name>
            <surname>TM</surname>
          </string-name>
          .
          <article-title>Baseline results for the CLEF 2007 medical automatic annotation task using global image features</article-title>
          .
          <source>Lect Notes Comput Sci</source>
          <year>2008</year>
          , in press
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>