<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Simultaneous Measurement Imputation and Outcome Prediction for Achilles Tendon Rupture Rehabilitation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Charles Hamesse</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paul Ackermann</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hedvig Kjellstrom</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cheng Zhang</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>KTH Royal Institute of Technology</institution>
          ,
          <addr-line>Stockholm</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Karolinska University Hospital</institution>
          ,
          <addr-line>Stockholm</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Microsoft Research</institution>
          ,
          <addr-line>Cambridge</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Achilles Tendon Rupture (ATR) is one of the typical soft tissue injuries. Accurately predicting the rehabilitation outcome of ATR using noisy measurements with missing entries is crucial for treatment decision support. In this work, we design a probabilistic model that simultaneously predicts the missing measurements and the rehabilitation outcome in an end-to-end manner. We evaluate our model and compare it with multiple baselines including multi-stage methods using an ATR clinical cohort. Experimental results demonstrate the superiority of our model for ATR rehabilitation outcome prediction.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Soft tissue injuries, such as Achilles Tendon Rupture (ATR), are
increasing in recent decades [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. These injuries require lengthy healing processes with
abundant complications which can cause severe incapacity to individuals.
Numerous measurements are not carried out for all patients since they can be costly
and/or painful. Thus, accurately predicting the rehabilitation outcome at
different stages using the existing measurements is a crucial problem. Leveraging
the predictive power of data-driven approaches, it is of great interest to nd out
whether we can predict potential outcomes for new patients with sparse and
noisy data, and thus provide decision support for practitioners. In this work, we
focus on predicting the rehabilitation outcome of ATR, but our framework can
be further applied to a wider domain of conditions. In particular, we develop a
generic, end-to-end model to tackle two problems at once: imputing the missing
measurements and test values for patients during their stay at the hospital, and
predicting the outcome of their rehabilitation after 3, 6 and 12 months.
      </p>
      <p>
        We use a real-life dataset from the Orthopedic Research Group, aggregated
from multiple previous studies [
        <xref ref-type="bibr" rid="ref2 ref8">8, 2</xref>
        ]. It consists of a longitudinal cohort with 442
patients described by 360 variables. A snapshot of the dataset is shown in Fig.
1. We split all variables into two categories based on the patient's journey. The
rst category contains patients' demographics and measurements realized during
their stay at hospital; variables in this category are referred to as predictors P in
vdm
udn
      </p>
      <p>D
Anm
Inm
Pnm Xnm
wmp</p>
      <p>M</p>
      <p>Bnp
Snp
In0p</p>
      <p>N</p>
      <p>P
(a) Using the measurement P^
bp
vdm
udn
wdp</p>
      <p>D
bp
Anm
Inm
Pnm Xnm Snp</p>
      <p>Bnp</p>
      <p>In0p
M</p>
      <p>P</p>
      <p>N
(b) Using patient traits U^</p>
      <p>
        Age Length Weight : : : DVT 2 : : : ATRS 12 sti ATRS 12 pain
1 27 190 79.8 : : : 1 : : : 8 10
2 36 : : : : : : 8
3 41 172 : : : 0 : : : 10 10
the following. The second category is the scores S, and includes all rehabilitation
outcome tests, such as ATRS [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] or FAOS taken at 3, 6 and 12 months. ATRS and
FAOS assess the function and symptom of the tendon by a number of
patientreported criteria. At the time when the patient is discharged from hospital, the
number of measurements is M = 297, and the number of the scores to predict in
the next three visits is P = 63, for N = 442 patients. 69.5% entries are missing
for the predictors, and 64.2% for the scores.
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <p>We design an end-to-end probabilistic model to simultaneously impute the
missing entries and predict the rehabilitation outcome as shown in Fig. 2.</p>
      <p>
        We formulate the missing data imputation problem into a collaborative
ltering or matrix factorization problem [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The model assumes that the patient
measurement a nity matrix A is generated from the patient traits U, which
re ect the health status of the patient, and predictor traits V, which map
different health status to measurements from various medical instruments. We use
2 ) =
Gaussian distributions to model these entries in the same way as [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]: p(Uj U
QnN=1 N (unj U; U21); p(Vj V2) = QmM=1 N (vmj V; V21). The measurement
imputation model is:
      </p>
      <p>N M "
p(PjU; V; P2) = Y Y
#Inm
;
where Inm is an indicator set to 1 if Pnm is observed and 0 otherwise.</p>
      <p>We then predict the score matrix S using the patient information, which can
be either the imputed measurement matrix as shown in Fig. 2(a) or the patient
trait vector which summaries the patient state as shown in Fig. 2(b).
Bayesian linear regression We rst consider a Bayesian linear regression model.
The score is modeled as:</p>
      <p>N P "
p(S j W; b; X) = Y Y</p>
      <p>n=1 p=1
p(W) = N (W j 0; w21);</p>
      <p>N (Snp j xnwp + bp; S2)
#In0p</p>
      <p>;
p(b) = N (b j 0; b2);
where the input X is either the predictors or the patient traits, W and b are
the weights and bias parameters for Bayesian linear regression. S indicates the
observed rehabilitation scores, that is the rehabilitation outcome B, masked by
the boolean observation indicator I0. In the case of the predictors (Fig. 2(a)),
we use the observed values so that X = P^ = I P + (1 I) A, where I is
the N M measurement observation indicator. In the case of the patient traits
(Fig. 2(b)), we have X = U^ .</p>
      <p>
        Bayesian neural network We also consider a Bayesian neural network (BNN)
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In this case, we have the following conditional distribution of the scores:
(2)
(3)
(4)
      </p>
      <p>
        N P "
p(S j ; X) = Y Y
where NN is a BNN parameterized by , the collection of all weights and biases
of the network. We consider fully connected layers with hyperbolic tangent
activations. The graphical model resembles the one in Fig. 2, with the exception
that instead of the weights W and biases b, we have BNN parameters .
Inference We run inference on the entire model in an end-to-end manner. We use
variational inference with the KL divergence [
        <xref ref-type="bibr" rid="ref1 ref9">1, 9</xref>
        ]. We implement all our models
using Edward [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], a Python library for probabilistic programming that o ers
various inference choices including variational inference with KL divergence.
4
      </p>
    </sec>
    <sec id="sec-3">
      <title>Experiments</title>
      <p>We present our experimental results, comparing our proposed model with
multiple baselines using the ATR dataset. We convert the whole dataset to
numerical values and normalize all variables to t in the range of [0; 1].
Baselines We compare four variations of our proposed model against two
baselines: mean data imputation, and two-stage model where inference is run in a
sequential manner. In the case of mean imputation, the per-patient mean of
observations that belong to the training set is imputed to all of their missing
measurements. We then use the completed measurement data in the second
component of the model. The second baseline is a two-stage version of our model.
We rst use the measurement imputation part of our method to impute missing
data. Then, we use this completed predictor matrix for the outcome prediction
in the second stage. Inference is run separately on each component.
Component 2 Input</p>
      <p>BLR P BLR S BNN P BNN S</p>
      <p>Results We split the training and testing set to re ect the treatment journey.
In all of our experiments. For P, we randomly pick 80% of the available data
for training and leave the rest for testing. For S, we randomly pick 80% of the
patients, take all of their available scores for training and leave all scores of the
remaining 20% of patients for testing, since the goal of our work is to predict
the outcome using patients' incomplete measurements. We use grid search for
hyper-parameter tuning and report the best result for each method in Table 1,
using the Mean Absolute Error (MAE).</p>
      <p>We can see that our proposed end-to-end model with Bayesian neural network
applied on P achieves the best performance for the prediction of rehabilitation
outcomes. Our proposed method shows clear improvement over the baselines.
We can see that using latent variable models for data imputation gives better
performance than the traditional mean imputation method. Additionally, it was
established that a di erence of 0.1 is signi cant in the case of ATR in clinical
practice. Thus, our result is close to the ideal MAE target 0:1.
5</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>We developed a probabilistic framework to simultaneously predict the
rehabilitation outcome and impute the missing entries in a clinical cohort in the
context of Achilles Tendon Rupture rehabilitation. We demonstrated a clear
improvement in the accuracy of the predicted outcomes in comparison with
traditional data imputation methods. Using the proposed model, we obtained a
mean absolute error of 0:156. This result is close to the target 0:1 which is the
clinical di erence step. We will thus continue to explore modeling choices to
improve the outcome prediction accuracy. Additionally, the proposed method is a
general framework that can be used in numerous health-care applications
involving a long-term healing process after the treatment. In the future, we would also
collaborate with more health-care departments to test and improve our method
in these applications.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Blei</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kucukelbir</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McAuli</surname>
            <given-names>e</given-names>
          </string-name>
          , J.D.:
          <article-title>Variational inference: A review for statisticians</article-title>
          .
          <source>Journal of the American Statistical Association</source>
          <volume>112</volume>
          (
          <issue>518</issue>
          ),
          <volume>859</volume>
          {
          <fpage>877</fpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Domeij-Arverud</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anundsson</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hardell</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barreng</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Edman</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lati</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Labruto</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ackermann</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Ageing, deep vein thrombosis and male gender predict poor outcome after acute achilles tendon rupture</article-title>
          .
          <source>Bone Joint J</source>
          <volume>98</volume>
          (
          <issue>12</issue>
          ),
          <volume>1635</volume>
          {
          <fpage>1641</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Huttunen</surname>
          </string-name>
          , T.T.,
          <string-name>
            <surname>Kannus</surname>
            ,
            <given-names>P.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rolf</surname>
            ,
            <given-names>C.G.</given-names>
          </string-name>
          ,
          <article-title>Fellander-</article-title>
          <string-name>
            <surname>Tsai</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mattila</surname>
            ,
            <given-names>V.M.:</given-names>
          </string-name>
          <article-title>Acute achilles tendon ruptures: incidence of injury and surgery in sweden between 2001 and 2012</article-title>
          .
          <source>The American journal of sports medicine 42</source>
          <volume>10</volume>
          ,
          <issue>2419</issue>
          {
          <fpage>23</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Neal</surname>
            ,
            <given-names>R.M.</given-names>
          </string-name>
          :
          <article-title>Bayesian learning for neural networks</article-title>
          , vol.
          <volume>118</volume>
          . Springer Science &amp; Business
          <string-name>
            <surname>Media</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>S</given-names>
            <surname>Kearney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Achten</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E</given-names>
            <surname>Lamb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Parsons</surname>
          </string-name>
          ,
          <string-name>
            <surname>N.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L</given-names>
            <surname>Costa</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.:</surname>
          </string-name>
          <article-title>The achilles tendon total rupture score: A study of responsiveness, internal consistency and convergent validity on patients with acute achilles tendon ruptures</article-title>
          10,
          <volume>24</volume>
          (02
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Stern</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herbrich</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Graepel</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Matchbox: Large scale bayesian recommendations</article-title>
          .
          <source>In: Proceedings of the 18th International World Wide Web Conference (January</source>
          <year>2009</year>
          ), https://www.microsoft.com/en-us/research/publication/matchboxlarge
          <article-title>-scale-bayesian-recommendations/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Tran</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kucukelbir</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dieng</surname>
            ,
            <given-names>A.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rudolph</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liang</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blei</surname>
            ,
            <given-names>D.M.:</given-names>
          </string-name>
          <article-title>Edward: A library for probabilistic modeling, inference, and criticism (</article-title>
          <year>2016</year>
          ), http://arxiv.org/abs/1610.09787, cite arxiv:
          <volume>1610</volume>
          .
          <fpage>09787</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Valkering</surname>
            ,
            <given-names>K.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aufwerber</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ranuccio</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lunini</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Edman</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ackermann</surname>
            ,
            <given-names>P.W.:</given-names>
          </string-name>
          <article-title>Functional weight-bearing mobilization after achilles tendon rupture enhances early healing response: a single-blinded randomized controlled trial</article-title>
          .
          <source>Knee Surgery, Sports Traumatology, Arthroscopy</source>
          <volume>25</volume>
          (
          <issue>6</issue>
          ),
          <year>1807</year>
          {
          <year>1816</year>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Butepage</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kjellstrom</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mandt</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Advances in variational inference</article-title>
          .
          <source>arXiv preprint arXiv:1711.05597</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>