<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Mean</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Automatic Multi-modal ToF/CT Organ Surface Registration</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kerstin Mu¨ller</string-name>
          <email>kerstin.mueller@informatik.uni-erlangen.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sebastian Bauer</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jakob Wasza</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joachim Hornegger</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Erlangen Graduate School in Advanced Optical Technologies</institution>
          ,
          <addr-line>SAOT</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Pattern Recognition Lab, Friedrich-Alexander University Erlangen-Nuremberg</institution>
        </aff>
      </contrib-group>
      <volume>4</volume>
      <issue>82</issue>
      <fpage>154</fpage>
      <lpage>158</lpage>
      <abstract>
        <p>In the eld of image-guided liver surgery (IGLS), the initial registration of the intra-operative organ surface with preoperative tomographic image data is performed on manually selected anatomical landmarks. In this paper, we introduce a fully automatic scheme that is able to estimate the transformation for initial organ registration in a multi-modal setup aligning intra-operative time-of- ight (ToF) with preoperative computed tomography (CT) data, without manual interaction. The method consists of three stages: First, we extract geometric features that encode the local surface topology in a discriminative manner based on a novel gradient operator. Second, based on these features, point correspondences are established and deployed for estimating a coarse initial transformation. Third, we apply a conventional iterative closest point (ICP) algorithm to re ne the alignment. The proposed method was evaluated for an open abdominal hepatic surgery scenario with invitro experiments on four porcine livers. The method achieved a mean distance of 4.82 0.79 mm and 1.70 0.36 mm for the coarse and ne registration, respectively.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Image-guided surgery techniques augment the anatomical expertise of the
surgeon with a patient-specific source of information by correlating the operative
field with preoperative tomographic image data. This allows the physician to
see the surgical probe position in relation to anatomical structures during the
procedure. Benefits of the integration of computer navigated tool guidance for
hepatic surgery are (i) the enhancement of the resection of subsurface targets and
avoidance of critical structures, (ii) improved outcomes due to reduced resection
margins, and (iii) an expansion of the spectrum of resectability [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The
essential step in image-guided surgery is the determination of a mapping between the
intra-operative presentation of the exposed organ (physical space) and the
patient anatomy available from pre-operatively acquired tomographic data (image
space). The current registration protocol for image guidance in hepatic surgery
is based on a landmark-based initial alignment. As a prerequisite, anatomical
fiducials are manually selected in the preoperative image sets prior to surgery [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
Then, during the procedure, the homologous physical-space locations are
digitized with a pen probe system. Last, the initial landmark-based registration is
refined by conventional rigid surface registration techniques [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In clinical
practice, manual fiducial selection by radiology experts is difficult, subjective and
time-consuming. In this paper, we introduce a fully automatic scheme that is
able to estimate the transformation for initial organ registration and ICP
initialization without any manual interaction. The evaluation considers a multi-modal
ToF/CT setup for IGLS, where the intra-operative surface data is acquired with
a ToF camera. The applicability of ToF technology for intra-operative surface
acquisition was investigated by Seitel et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], achieving promising results on a
variety of porcine organs.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Materials and Methods</title>
      <p>The proposed registration framework is composed of three stages (Fig. 1). First,
ToF and CT data are preprocessed and transformed into a surface mesh
representation. Second, we extract local surface feature descriptors from both data
sets. Third, in order to register the surfaces, point correspondences are
established by feature matching and a rigid body transformation is estimated. This
coarse registration can then be refined with conventional ICP variants.
2.1</p>
      <sec id="sec-2-1">
        <title>ToF/CT Surface Mesh Generation</title>
        <p>ToF imaging directly acquires metric 3D surface information in real-time with
a single sensor based on the phase shift between an actively emitted and the
reflected optical signal. The measurements of the ToF camera can be represented
as a set of points or vertices V = {vi}, i ∈ {1, . . . , w · h}, where vi ∈ R3 denotes
the vertex coordinates, w ×h the sensor resolution. In consideration of the
inherent noise in ToF range measurements and w.r.t. the trade-off between data
denoising and preservation of topological structure, we perform data preprocessing
in a way that gives priority to the topological reliability of the surface. In
particular, we combine temporal averaging with edge-preserving median and bilateral
filtering. In terms of segmentation, for the in-vitro experiments (Sect. 2.5), we
used a semi-automatic scheme. The organ is extracted by thresholding and
ToF Data</p>
        <p>CT Data
Registration
Refinement
(ICP)</p>
        <p>Denoising
Segmentation</p>
        <p>Initial
Transformation</p>
        <p>Estimation</p>
        <p>Segmentation</p>
        <p>Mesh</p>
        <p>Generation
Correspondence</p>
        <p>Search</p>
        <p>ToF Mesh</p>
        <p>CT Mesh
CT Descriptors
ToF Descriptors</p>
        <p>Feature
Extraction
the remaining point cloud triangulated, providing the intra-operative ToF
surface mesh. The anatomical reference surface is segmented from preoperative
CT data using a region growing framework with manual seed point placement.
Subsequently, we apply the marching cubes algorithm on the extracted binary
volumetric segmentation mask and eventually decimate the dense CT mesh.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Local Surface Descriptors</title>
        <p>
          In this section, we introduce a gradient operator that computes a numerical
differentiation of a geometric scalar field defined on a 2D manifold. Then, we
present descriptors that encode the spatial distribution and orientation of the
resulting gradient vector field within the local neighborhood of a mesh vertex.
Geometric CUSS Gradient Conventionally, in a 2D image, gradients are
computed by differentiating scalar data in two orthogonal directions. For 2D
manifolds, we propose a novel gradient operator ∇f (vi) that is based on a
circular uniform surface sampling (CUSS) technique being invariant to mesh
representation and density [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Given is a scalar field f (vi) that holds a 1D geometric
feature for every mesh vertex vi ∈ V, e.g. the signed distance of a point to
the best fitting plane of its neighborhood. In a first step, the tangent plane Ti
being defined by the corresponding normal ni is determined for the vertex vi.
Next, a circular uniform sampling of Ti is performed via rotating a reference
vector ai ∈ Ti, ||ai||2 = 1 around ni by the angles ϕs = s · N2πs , s ∈ {1, . . . , Ns},
yielding Rϕs ai. The circular sampling density is given by Ns, Rϕs denotes the
3 × 3 rotation matrix for ϕs. Scaling the vectors Rϕs ai with a sampling radius
rs provides a set P of points ps ∈ Ti, |P| = Ns
        </p>
        <p>P = {ps | ps = vi + rs · Rϕs ai}
Finally, the surface sampling is performed by intersecting the mesh with rays
that emerge from the points ps and are directed parallel to ni (Fig. 2(a)). The
intersection points are denoted ms, the scalar field value f (ms) is interpolated
w.r.t. the adjacent vertices. The CUSS gradient ∇f (vi) at the vertex vi can then
be expressed as
∇f (vi) =
1</p>
        <p>∑Ns f (ms) − f (vi)
Ns s=1 ||ms − vi||2
· Rϕs ai
(1)
(2)
2.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>Descriptor Encoding</title>
        <p>
          For each vertex vi, the descriptor extracts the spatial distribution and
orientation of the CUSS gradient vector field (Fig. 2(b)) for the local set of vertices
Vˆ ⊂ V that reside within a spherical volume of interest. First, the gradient
vectors ∇f (vi) are projected onto the three planes of a local coordinate
system. It is spanned by the vertex normal ni and a second axis mi ∈ Ti pointing
into the dominant gradient direction. Second, for each plane, the projected
vectors are separated into circular segments and binned in polar histograms [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
The concatenation of these histograms yields the descriptor that is invariant to
translation and rotation but not invariant to scale, as we incorporate the metric
scale of the surface topology as an important characteristic.
2.4
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>Correspondence-based Registration</title>
        <p>
          For the registration of ToF and CT data (V1,V2), the corresponding sets of local
descriptors D1,D2 are computed first. Based on these descriptor sets, point
correspondences are established between V1 and V2 by searching the mutual
best match with an Euclidean similarity metric. In order to eliminate false
correspondences, the set of point pairs is decimated by comparing all internal
pairwise distances of the two correspondence point sets [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. The remaining set
of correspondences (Fig. 2(c)) is used to estimate a rigid body transformation.
Last, based on this initial transformation, the registration is refined with an ICP
variant [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
2.5
        </p>
      </sec>
      <sec id="sec-2-5">
        <title>Experiments</title>
        <p>The proposed method was evaluated with in-vitro experiments on four porcine
livers. ToF data were acquired with a CamCube 2.0 (PMDTechnologies GmbH)
at a distance of 60 cm. CT data were acquired with an Artis zeego C-arm system
(Siemens AG, Healthcare Sector). The livers were scanned with a resolution of
512 × 512 × 348 voxels and a spacing of 0.70 × 0.70 × 0.70 mm.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>Quantitative ToF/CT registration results of the four porcine livers are given in
Fig. 3. In order to evaluate the registration accuracy, we measured the residual
mean distance d of all ToF surface points to the closest CT surface points.
(a)
(b)
(c)
dinit [mm]
The proposed method achieved a mean distance between the ToF/CT mesh of
dinit = 4.82 ± 0.79 mm for the initial coarse registration and d ne = 1.70 ±
0.36 mm for the subsequent ICP refinement. The descriptor parameters were
set heuristically. Qualitative registration results for liver L2 are shown in Fig. 3.
The runtime of the intra-operative pipeline (preprocessing, feature extraction,
correspondence search, registration) is less than 30 s on a 2.66 GHz CPU, with
the correspondence search running on a NVIDIA Geforce 8400 GPU. CT data
preprocessing can be performed pre-operatively.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Discussion</title>
      <p>We have presented a multi-modal rigid registration framework for estimating
the initial alignment of an exposed intra-operative organ surface with
preoperative data based on local surface descriptors. Preliminary experimental results
on porcine liver data are encouraging and suggest that a feature-based
correspondence search can replace the manual selection of anatomical landmarks for
ICP initialization in organ registration applications (e.g. IGLS). Ongoing work
analyzes the influence of the ToF preprocessing pipeline and the robustness of
the descriptor w.r.t. minor deformations.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Cash</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miga</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Glasgow</surname>
            <given-names>S</given-names>
          </string-name>
          , et al.
          <article-title>Concepts and preliminary data toward the realization of image-guided liver surgery</article-title>
          .
          <source>J Gastrointest Surg</source>
          .
          <year>2007</year>
          ;
          <volume>11</volume>
          (
          <issue>7</issue>
          ):
          <volume>844</volume>
          {
          <fpage>59</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Besl</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neil</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>A method for registration of 3D shapes</article-title>
          .
          <source>IEEE Trans Pattern Anal Mach Intell</source>
          .
          <year>1992</year>
          ;
          <volume>14</volume>
          (
          <issue>2</issue>
          ):
          <volume>239</volume>
          {
          <fpage>56</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Seitel</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>dos Santos</surname>
            <given-names>TR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mersmann</surname>
            <given-names>S</given-names>
          </string-name>
          , et al.
          <article-title>Time-of-Flight Kameras fur die intraoperative Ober achenerfassung</article-title>
          . In:
          <string-name>
            <surname>Proc</surname>
            <given-names>BVM</given-names>
          </string-name>
          ;
          <year>2010</year>
          . p.
          <volume>11</volume>
          {
          <fpage>5</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bauer</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wasza</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muller K</surname>
          </string-name>
          , et al.
          <article-title>4D photogeometric face recognition with timeof- ight sensors</article-title>
          .
          <source>In: Proc WACV;</source>
          <year>2011</year>
          ,
          <article-title>accepted for publication.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Zaharescu</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boyer</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Viranasi</surname>
            <given-names>K</given-names>
          </string-name>
          , et al.
          <article-title>Surface feature detection and description with applications to mesh matching</article-title>
          .
          <source>In: Proc CVPR; 2009</source>
          . p.
          <volume>373</volume>
          {
          <fpage>80</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Gelfand</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitra</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guibas</surname>
            <given-names>L</given-names>
          </string-name>
          , et al.
          <article-title>Robust global registration</article-title>
          .
          <source>In: Proc SGP; 2005</source>
          . p.
          <volume>197</volume>
          {
          <fpage>206</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>