<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Landmark-based Feature Tracking for Endoscopic Motion Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>S. Friedl</string-name>
          <email>sven.friedl@iis.fraunhofer.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>B. Morgus</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A. Kage</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>C. Münzenmayer</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>T. Wittenberg</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>T. Bergen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Fraunhofer Institute for Integrated Circuits IIS</institution>
          ,
          <addr-line>Erlangen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University Hospital Erlangen</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2011</year>
      </pub-date>
      <fpage>57</fpage>
      <lpage>60</lpage>
      <abstract>
        <p>Automated image analysis and interpretation within computer assisted minimally invasive surgery (MIS) most often depend and rely on manually defined landmarks, visible in endoscopic views. More specific, within many types of applications, such landmarks must be tracked automatically during the intervention. Typical feature tracking approaches are able to track slightly changing landmarks over time, as they occur in endoscopic image sequences, but are originally most often designed to track automatically detected salient points. In this contribution an approach is presented, where the advantages of feature descriptors and corresponding matchers can be used to track manually defined landmarks. Based on such initiated landmark points, local feature detection and tracking utilizing SURF or KLT features as descriptors, is executed. Within the region of interest as a constraint, movements of the detected features can be used to approximate the original landmark movements.</p>
      </abstract>
      <kwd-group>
        <kwd>Feature Tracking</kwd>
        <kwd>Anatomical Landmarks</kwd>
        <kwd>SURF</kwd>
        <kwd>KLT</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Problem</title>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <p>
        Robust feature descriptors and the corresponding matching algorithms are well known and established methods to
identify and track prominent identical points in sequences of consecutive image frames. These methods have been optimized
to track feature points with specific characteristics in monocular image sequences. In contrast, manually initiated
landmarks for tracking are unlikely to coincide with optimal (in the sense of feature tracking) features. Thus, depending on
the visibility of interesting anatomical structures or landmarks and their movement and velocity within an endoscopic
image sequence, a region of interest (ROI) can be defined around a manually initiated landmark point. As one side
condition the ROIs must be defined sufficiently small and in such a way that clearly visible landmarks and the interrelated
anatomy lies within the region and independent movements of adjacent ROIs do not interfere with each other.
Additionally, the ROIs have to be large enough to cover the possible displacements of consecutive image frames based on the
occurring movements. Now, within each manually initiated region, local feature detection and tracking is applied using
well-established feature tracking approaches, which are promising for clinical (real-time) applications: the
KanadeLucas-Tomasi (KLT) feature tracker [
        <xref ref-type="bibr" rid="ref9">9,10</xref>
        ] and speeded up robust features (SURF) [
        <xref ref-type="bibr" rid="ref10">11</xref>
        ]. SURF tracking is realized by
descriptor comparison within the ROI.
      </p>
      <p>Within our evaluation framework, these matching algorithms are applied to the manually selected and initiated ROIs to
recognize corresponding feature points in consecutive frames. Assuming that detected feature points vary from the given
landmarks, but still describe the surrounding anatomic region of interest, the resulting movements can be used to
estimate the movement of the original landmark.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>To evaluate the proposed approach, it has been applied to five monocular endoscopic video image sequences of different
organs. These image sequences cover the surface of a human liver, the view into the human bladder, tissue of the colon,
a native and as well as an artificial heart valve, cf. Figure 1. The movements within those sequences vary from almost
pure translation of the endoscope (e.g. liver) up to complex deformations of the organ (e.g. heart valves). To prove the
principal concept and the correctness of the implementation, a well-behaving test sequence with only one moving item (a
ball-pen) was recorded. Specifically, this rigid object being different from the organic structures and being displaced
over time with a constant and stable background has been chosen to be independent from application related restrictions
in the tracking task, where several movements of landmarks, organs and tissue background are overlaying each other. As
ground truth data for comparison and evaluation, the selected landmarks have been labeled manually in all frames of the
sequences and the corresponding coordinates have been stored. Due to the intention to estimate the movement of a
certain anatomical structure and not to track the exact landmark, the spatial distance between the ground truth points and
the tracked features has not been considered as a meaningful error measure. Instead, the movement m of the tracked
Points Tr in frame i, relative to the initial key frame, is determined as
For each of the endoscopic image sequences, the movement of the ground truth (GT) was compared to the detected
motion using both the SURF as well as the KLT descriptors. The resulting trajectories of all six sequences are shown in
Figure 2. Ideal tracking with respect to the manually labeled ground truth data would result in identical trajectories. The
more parallel the trajectories of the tracking approaches are, compared to the trajectory of the ground truth, the better
the result can be regarded.
(d) (e) (f)
Figure 2: Trajectories of the movements for the six evaluated recordings for Speeded Up Robust Features (SURF, +),
Kanade-Lucas-Tomasi (KLT, ×), and the annotated ground truth (GT, *): ball-pen test sequence (a), view inside a
bladder (b), surface of a liver (c), native heart valve (d), artificial heart valve (e), and colon tissue (f).</p>
      <p>Ball pen
Bladder
Liver
Native heart valve
Artificial heart valve
Colon</p>
      <p>KLT
100 %
32 %
75 %</p>
      <p>4 %
25 %
13 %</p>
      <p>SURF
100 %
14 %
5 %
9 %
0 %
29 %
In addition, both tracking approaches were evaluated by manually rating the consistency of feature tracking. For each
sequence, we counted the number of frames, in which the organic structure (i.e. initially selected feature) was tracked
correctly, independently of the pixel based ground-truth distance. Table 1 depicts the results obtained.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Discussion</title>
      <p>As can bee seen in Figure 2(a), as well as in Table 1, upper row, the movements within the reference ball-pen test
sequence could be tracked with only small deviations. Thus, the principle approach of exploiting robust feature descriptors
for manually defined and initiated landmarks seems to be promising. However, applying the proposed tracking method
to real medical endoscopic image data, the results differ. For the liver (Fig. 2(c)) and both heart valve sequences (Fig.
2(d+e)), the movements of the manually selected landmarks can only be roughly estimated. In the case of the bladder
(Fig. 2(b)), the difference between the ground truth and the tracked movement is increasing, while for the colon (Fig.
2(f)), the original movement is hardly identified in the tracking trajectories. Interestingly, in some sequences (bladder,
native heart valve) the KLT tracker yields closer results to the ground truth while in other sequences (colon, artificial
heart valve) the SURF features seem better. The rating depicted in Table 1 confirms these results. Although none of the
approaches yielded better results throughout all image sequences, the KLT approach showed better stability than the
SURF approach in most cases. Dealing with medical image data leads to various possible sources of error, e.g. low
contrast images with significant noise, substantial organ deformations, as well as structured surfaces and specular highlights.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Friedl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wittenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kondruweit</surname>
          </string-name>
          .
          <article-title>Interactive registration and visualization of cardiac video and angiography</article-title>
          .
          <source>In IFMBE Proc</source>
          . Vol.
          <volume>25</volume>
          /IV, World Congress on Med. Physics &amp; Biomedical Engineering, pp.
          <fpage>468</fpage>
          -
          <lpage>471</lpage>
          ,
          <year>2009</year>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Ortmaier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gröger</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Hirzinger. Multisensorielle</surname>
          </string-name>
          <article-title>Schätzung der Herzbewegung in der minimal invasiven Chirurgie CURAC-Jahrestagung, October 4-5</article-title>
          ,
          <year>2002</year>
          , Leipzig - Germany
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Ortmaier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Groeger</surname>
          </string-name>
          , and
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Hirzinger: Robust Motion Estimation in Robotic Surgery on the Beating Heart</article-title>
          .
          <source>Proc's Computer Assisted Radiology &amp; Surgery (CARS)</source>
          ,
          <source>June 26-29</source>
          ,
          <year>2002</year>
          , Paris - France, pp.
          <fpage>206</fpage>
          -
          <lpage>211</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Wittenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Drechsler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kaltenbacher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Friedl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Reis</surname>
          </string-name>
          , G. Sakas,
          <string-name>
            <given-names>J.</given-names>
            <surname>Stallkamp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Rotinat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perrot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kondruweit</surname>
          </string-name>
          . '
          <article-title>MISS heart': Assisting systems for minimal invasive smart suturing in cardiac surgery? A conceptually closed loop approach</article-title>
          .
          <source>In IFMBE Proc</source>
          . Vol.
          <volume>25</volume>
          /IV, World Congress Med. Physics &amp; Biomed. Eng., pp.
          <fpage>445</fpage>
          -
          <lpage>448</lpage>
          ,
          <year>2009</year>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.P.</given-names>
            <surname>Condurache</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hahn</surname>
          </string-name>
          , U.G. Hofmann,
          <string-name>
            <given-names>M.</given-names>
            <surname>Scharfschwerdt</surname>
          </string-name>
          , Martin Misfeld,
          <string-name>
            <given-names>Til</given-names>
            <surname>Aach</surname>
          </string-name>
          .
          <article-title>Automatic measuring of quality criteria for heart valves</article-title>
          .
          <source>Med</source>
          . Imaging 2007:
          <article-title>Image Processing</article-title>
          ,
          <string-name>
            <surname>SPIE</surname>
          </string-name>
          , San Diego, CA,,
          <volume>17</volume>
          -
          <fpage>22</fpage>
          .2.2007
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>T.</given-names>
            <surname>Wittenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cesnjevar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rupp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Weyand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kondruweit</surname>
          </string-name>
          .
          <article-title>High-Speed-Camera Recordings and Image Sequence Analysis of Moving Heart-Valves: Experiments and First Results</article-title>
          . In T. Buzug,
          <string-name>
            <given-names>D.</given-names>
            <surname>Holz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Weber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bongartz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kohl-Bareis</surname>
          </string-name>
          , U. Hartmann (Eds),
          <source>Advances in Med</source>
          . Engineering,
          <source>Springer Proc's in Physics 114</source>
          , pp.
          <fpage>169</fpage>
          -
          <lpage>174</lpage>
          . Workshop, 7.-
          <fpage>9</fpage>
          .3.2007 in Remagen, Springer, Heidelberg,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>T.</given-names>
            <surname>Bergen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ruthotto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Münzenmayer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rupp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Paulus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Winter</surname>
          </string-name>
          .
          <article-title>Feature-based real-time endoscopic mosaicking</article-title>
          .
          <source>In Proc. 6th International Symposium on Image and Signal Processing and Analysis</source>
          , pp.
          <fpage>695</fpage>
          -
          <lpage>700</lpage>
          ,
          <year>2009</year>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>T.</given-names>
            <surname>Bergen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Münzenmayer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Knödgen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Feussner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wittenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Winter</surname>
          </string-name>
          .
          <article-title>Echtzeit-Stitching endoskopischer Bilder für eine erweiterte Sicht in chirurgischen Eingriffen</article-title>
          .
          <source>Endoskopie Heute</source>
          ,
          <volume>24</volume>
          (
          <issue>1</issue>
          ):
          <fpage>60</fpage>
          -
          <lpage>61</lpage>
          ,
          <year>2011</year>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>B.D.</given-names>
            <surname>Lucas</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Kanade</surname>
          </string-name>
          .
          <article-title>An iterative image registration technique with an application to stereo vision</article-title>
          .
          <source>In Proc. 7th International Joint Conference on Artificial Intelligence</source>
          , pp.
          <fpage>674</fpage>
          -
          <lpage>679</lpage>
          ,
          <year>1981</year>
          [10
          <string-name>
            <given-names>J.</given-names>
            <surname>Shi</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Tomasi</surname>
          </string-name>
          .
          <article-title>Good features to track</article-title>
          .
          <source>In Proc IEEE Conference on Computer Vision and Pattern Recognition</source>
          , pp.
          <fpage>593</fpage>
          -
          <lpage>600</lpage>
          ,
          <year>1994</year>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>H.</given-names>
            <surname>Bay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ess</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tuytelaars</surname>
          </string-name>
          ,
          <string-name>
            <surname>L. Van Gool. SURF</surname>
          </string-name>
          :
          <article-title>Speeded Up Robust Features</article-title>
          .
          <source>Computer Vision and Image Understanding</source>
          , Vol.
          <volume>110</volume>
          , No.
          <issue>3</issue>
          , pp.
          <fpage>346</fpage>
          -
          <lpage>359</lpage>
          ,
          <year>2008</year>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>W.</given-names>
            <surname>Lau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ramey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Corso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Thakor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Hager</surname>
          </string-name>
          .
          <article-title>Stereo-Based Endoscopic Tracking of Cardiac Surface Deformation</article-title>
          . In C. Barillot,
          <string-name>
            <given-names>D.R.</given-names>
            <surname>Haynor</surname>
          </string-name>
          , and P. Hellier (Eds.):
          <source>MICCAI</source>
          <year>2004</year>
          , LNCS 3217, pp.
          <fpage>494</fpage>
          -
          <lpage>501</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>