<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Phuoc Nguyen, Truyen Tran, Svetha Venkatesh Centre for Pattern Recognition and Data Analytics Deakin University</institution>
          ,
          <addr-line>Geelong, Australia phuoc.nguyen, truyen.tran, svetha.venkatesh @deakin.edu.au</addr-line>
        </aff>
      </contrib-group>
      <fpage>25</fpage>
      <lpage>29</lpage>
      <abstract>
        <p>Modeling physiological time-series in ICU is of high clinical importance. However, data collected within ICU are irregular in time and often contain missing measurements. Since absence of a measure would signify its lack of importance, the missingness is indeed informative and might reflect the decision making by the clinician. Here we propose a deep learning architecture that can effectively handle these challenges for predicting ICU mortality outcomes. The model is based on Long Short-Term Memory, and has layered attention mechanisms. At the sensing layer, the model decides whether to observe and incorporate parts of the current measurements. At the reasoning layer, evidences across time steps are weighted and combined. The model is evaluated on the PhysioNet 2012 dataset showing competitive and interpretable results.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Multivariate physiological time-series are a critical component
in monitoring the critical state of the patient admitted to ICU
[Ghassemi et al., 2015a]. Characterizing this data type must
take into account the fact that data are irregularly sampled.
That is, biomarkers (Cholesterol, Glucose, Heart rate, etc.)
are measured and recorded only then the attending doctors
decide to do so, e.g., a particular measurement is made to
find out a critical condition about the patient at a given time.
Other measurements may be omitted if they offer no new
knowledge, are expensive and invasive, or if the patient is
stable with respect to these physiological parameters. Other
reasons for missing data could be due to technical errors, that
is, the measured signals are unreliable or interrupted in the
urgent and intensive conditions in ICU.</p>
      <p>Irregular timing causes great difficulties in statistical
analysis as the time gaps cannot easily characterized or predicted.
In this paper we consider predicting ICU mortality from
physiological time-series. Unfortunately, most existing time-series
methodologies assume equally spaced data [Erdogan et al.,
2004]. To this end, we propose a method that partly deals with
the difficulties due to irregular sampling. The main idea is
to “attend” to important signals/biomarkers and ignore
others. There are two level of attentions. At the signal level,
t=1
o
f
i</p>
      <p>a
h
c
x
t=2
z
classifier</p>
      <p>y
t=T
only informative signals at given time are kept. At the illness
state level, temporal state progression is considered, and only
time at which states are most critical for prediction will be
kept. The mechanism for weighting the temporal importance
is called “attention”. The attention mechanism might capture
the implicit human decision making.</p>
      <p>To realize the attention ideas, we derive a deep learning
architecture based on a recurrent neural network known as
Long Short-Term Memory (LSTM) [Hochreiter and
Schmidhuber, 1997]. The LSTM offers an attention mechanism at the
signal level through input gating. On top of the LSTM, we
impose several attention “heads” that read the illness states
over time, and decide on the importance of each time interval.
The resulting model is flexible and interpretable. Fig. 1
illustrates the model. Evaluation of the PhysioNet 2012 dataset
demonstrates the desirable characteristics.</p>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <p>We present our deep neural net for reading ICU time-series and
predicting mortality. The model consists of four components:
data preprocessing, LSTM, reading heads and classifier. The
last three components are graphically depicted in Fig. 1.
2.1</p>
      <sec id="sec-2-1">
        <title>Data preprocessing</title>
        <p>We assume that each patient has multiple variable-length
timeseries sampled at arbitrary times. As the data is highly
irregular, it might be more useful to partly transform the data so
that some regularities can emerge. First for robustness,
outliers are handled by truncating all measures into the range of
percentiles. Then we divide the entire time-series
into intervals of equal length (e.g., 3 hours). If a measure is
missing in an interval, it is imputed by its mean value across
time for the patient. If the patient does not have this biomarker
measured, the mean value is taken from the entire dataset. For
each interval we collect simple statistics on each physiological
measure, including min, max, mean, median and standard
deviation . Finally, for each patient, we have a sequence of
vectors, where each vector is the set of statistics for its interval.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Long Short-Term Memory</title>
        <p>A Long Short-Term Memory (LSTM) [Hochreiter and
Schmidhuber, 1997] is a recurrent neural network. Let denotes the
input vector at time . LSTM maintains a memory cell and
a state vector over time. In our model, this state vector is
considered representing a patient’s illness state from begin to
the current time . Let be a candidate new memory update,
which is a function of the previous state and the current
input gate . The memory is updated as follows:
where is point-wise multiplication, is the input
gate to control what to read from raw data, and is
the forget gate to control the refreshing rate of the memory.
The two gates are function of and .</p>
        <p>This equation is crucial for handling irregular information.
When the new input is uninformative, the input gate can
learn to turn off (i.e., ) and the forget gate can learn to
turn on (i.e., ), and thus . In other words,
the memory is maintained. On the other hand, when the new
input is highly informative, the old memory can be safely
forgotten (i.e., , and ).</p>
        <p>Finally, the state vector is computed as
tanh
(1)
(2)
where
, which is a function of
and
.</p>
      </sec>
      <sec id="sec-2-3">
        <title>Bidirectional LSTM</title>
        <p>The LSTM is directional from the past to the future. Thus
when a state is estimated, it cannot be re-estimated on the
face of new evidences. A solution is to use bidirectional
LSTM, that is, we maintain two LSTMs, one from the past to
the future, the other in the reverse direction. The joint state
is likely to be more informative than each of
the component.
For each patient, the LSTMs produce a sequence of state
vectors . It is like a memory bank of slots from which
a read head can operate to generate sequence-level outputs.
Since is usually variable, we need to aggregate all the states
into a fixed-size vector. A number of soft reading heads are
therefore employed:
where is the index of the reading head,
and . Here is known as the attention
mechanism, and is parameterized as a neural network:
(3)
(4)
nnet
nnet</p>
        <p>Finally, readings are max-pooled as:
.
2.4</p>
      </sec>
      <sec id="sec-2-4">
        <title>Classifier</title>
        <p>Given the fixed-size vector , any differentiable classifier
can be placed on top to predict the future. For example, to
predict ICU mortality, a simple logistic regression can be
used: sigmoid for
regression parameters , and positive outcome .
Finally, the entire system is learnt by minimizing the log-loss:
. Since the system is end-to-end
differentiable, automatic differentiation and gradient descent
methods can be employed.
3
3.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experiments</title>
      <sec id="sec-3-1">
        <title>Dataset and Setting</title>
        <p>We use data from the PhysioNet Challenge 2012 [Silva et
al., 2012]. There are 4,000 patients of age 16 or over (mean:
64.5, std: 17.1), 56.1% are males. Of 41 measure types, five
are static (age, gender, height, ICU type, and initial weight),
and the other 36 are time-series. The recording time is 48
hours max. There are 4 ICU types: medical (35.8%),
surgical (28.4%), cardiac surgery recovery (21.1%), and coronary
(21.1%). The overall mortality rate is 18%.</p>
        <p>The models are implemented using Knet.jl. For the
experiments reported below, time intervals are 3 hours long, resulting
in 16 intervals per patient at most. At each interval, 185
statistics are extracted as input features. The memory cell (hence
the state and output vector) size is 32. The state vector of
the memory cell can be thought of as representing a patient’s
illness state. At time , this state is set to zero vector. At each
following time step, it is changed according to the response of
the input signals to the trained model. The progression of this
state vector is different between a positive and a negative case.
Two read heads are used to generate the output features, which
is then fed to a simple logistic regression to estimate the
probability of death. Dropout is utilized at both the input features
(due to high level of redundancy in the extracted statistics) and
the output features. Prediction performance is evaluated using
5-fold cross-validation.
(a)
(b)</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Related Work</title>
      <p>Irregular physiological time-series have attracted a fair amount
of attention in recent years, probably due to public
availability of large datasets such as MIMIC II/III [Caballero Barajas
and Akella, 2015; Che et al., 2016; D u¨richen et al., 2015;
Ghassemi et al., 2015b; Lasko et al., 2013; Li and
Marlin, 2015; Li-wei et al., 2015; Lipton et al., 2016; Liu and
Hauskrecht, 2016; Liu et al., 2013; Schulam and Saria, 2016;
Razavian et al., 2016]. The most popular strategy to deal with
missing data is imputation based on interpolation [Eckner,
2012]. An alternative method has also been suggested, in that
the time gaps are part of the models [Nguyen et al., 2017;
Pham et al., 2017].</p>
      <p>
        Neural nets in general and recurrent neural nets in
particular have long been applied for time-series data
        <xref ref-type="bibr" rid="ref22">(e.g.,
[Tresp and Briegel, 1998])</xref>
        . The modern surge in deep
learning has resulted in a new wave of more powerful nets
such as deep denoising autoencoder [Lasko et al., 2013]
and LSTM/GRU [Che et al., 2016; Esteban et al., 2016;
Lipton et al., 2016]. Attention has been recently suggested as
a mechanism to boost interpretability of RNNs [Choi et al.,
2016]. The main difference is that the attention in [Choi et
al., 2016] is used to select the original data for classification,
where our attention is to select the illness state.
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <p>We have proposed to use attention as a mechanism to mitigate
the effect of missing data resulted from irregular sampling in
time-series. There are two attention levels, one at the sensing
layer to select informative measurements, and the other at the
reasoning layer to select the informative period. The idea is
realized using Long Short-Term Memory (LSTM) equipped with
multiple reading heads, which generate features for the
classifier. Experiments on ICU mortality prediction demonstrate
that the models are accurate and interpretable. It suggests
that alert can be generated in real-time if the new
measurements are informative (based on the attention probability) or
the mortality risk is sufficiently high.</p>
      <p>Future work includes more sophisticated imputation
methods, such as those in [Che et al., 2016], handling
multiresolutions, and explicitly incorporating data quality and
uncertainty into reasoning.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>This work is partially supported by the Telstra-Deakin Centre
of Excellence in Big Data and Machine Learning.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>[Caballero Barajas and Akella</source>
          , 2015]
          <article-title>Karla L Caballero Barajas</article-title>
          and
          <string-name>
            <given-names>Ram</given-names>
            <surname>Akella</surname>
          </string-name>
          .
          <article-title>Dynamically modeling patient's health state from electronic medical records: A time series approach</article-title>
          .
          <source>In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          , pages
          <fpage>69</fpage>
          -
          <lpage>78</lpage>
          . ACM,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [Che et al.,
          <year>2016</year>
          ]
          <string-name>
            <given-names>Zhengping</given-names>
            <surname>Che</surname>
          </string-name>
          , Sanjay Purushotham, Kyunghyun Cho, David Sontag, and Yan Liu.
          <article-title>Recurrent neural networks for multivariate time series with missing values</article-title>
          .
          <source>arXiv preprint arXiv:1606</source>
          .
          <year>01865</year>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [Choi et al.,
          <year>2016</year>
          ]
          <string-name>
            <given-names>Edward</given-names>
            <surname>Choi</surname>
          </string-name>
          , Mohammad Taha Bahadori, Jimeng Sun, Joshua Kulas, Andy Schuetz, and
          <string-name>
            <given-names>Walter</given-names>
            <surname>Stewart</surname>
          </string-name>
          .
          <article-title>RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism</article-title>
          .
          <source>In Advances in Neural Information Processing Systems</source>
          , pages
          <fpage>3504</fpage>
          -
          <lpage>3512</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [Du¨richen et al.,
          <year>2015</year>
          ]
          <article-title>Robert Du¨richen, Marco AF Pimentel, Lei Clifton, Achim Schweikard, and David A Clifton. Multitask Gaussian processes for multivariate physiological time-series analysis</article-title>
          .
          <source>IEEE Transactions on Biomedical Engineering</source>
          ,
          <volume>62</volume>
          (
          <issue>1</issue>
          ):
          <fpage>314</fpage>
          -
          <lpage>322</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>[Eckner</source>
          , 2012]
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Eckner</surname>
          </string-name>
          .
          <article-title>A framework for the analysis of unevenly spaced time series data</article-title>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [Erdogan et al.,
          <year>2004</year>
          ]
          <string-name>
            <given-names>Emre</given-names>
            <surname>Erdogan</surname>
          </string-name>
          , Sheng Ma, Alina Beygelzimer, and
          <string-name>
            <given-names>Irina</given-names>
            <surname>Rish</surname>
          </string-name>
          .
          <article-title>Statistical models for unequally spaced time series</article-title>
          .
          <source>In Proceedings of the Fifth SIAM International Conference on Data Mining. SIAM</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [Esteban et al.,
          <year>2016</year>
          ] Cristo´bal Esteban, Oliver Staeck, Stephan Baier,
          <string-name>
            <given-names>Yinchong</given-names>
            <surname>Yang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Volker</given-names>
            <surname>Tresp</surname>
          </string-name>
          .
          <article-title>Predicting clinical events by combining static and dynamic information using recurrent neural networks</article-title>
          .
          <source>In Healthcare Informatics (ICHI)</source>
          ,
          <year>2016</year>
          IEEE International Conference on, pages
          <fpage>93</fpage>
          -
          <lpage>101</lpage>
          . IEEE,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [Ghassemi et al., 2015a]
          <string-name>
            <given-names>Marzyeh</given-names>
            <surname>Ghassemi</surname>
          </string-name>
          , Leo Anthony Celi, and
          <string-name>
            <surname>David</surname>
          </string-name>
          J Stone.
          <article-title>State of the art review: the data revolution in critical care</article-title>
          .
          <source>Critical Care</source>
          ,
          <volume>19</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [Ghassemi et al., 2015b]
          <string-name>
            <given-names>Marzyeh</given-names>
            <surname>Ghassemi</surname>
          </string-name>
          , Marco AF Pimentel, Tristan Naumann, Thomas Brennan, David A Clifton,
          <string-name>
            <given-names>Peter</given-names>
            <surname>Szolovits</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Mengling</given-names>
            <surname>Feng</surname>
          </string-name>
          .
          <article-title>A Multivariate Timeseries Modeling Approach to Severity of Illness Assessment and Forecasting in ICU with Sparse, Heterogeneous Clinical Data</article-title>
          .
          <source>In Twenty-Ninth AAAI Conference on Artificial Intelligence</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <source>[Hochreiter and Schmidhuber</source>
          , 1997]
          <article-title>Sepp Hochreiter and Ju¨rgen Schmidhuber. Long short-term memory</article-title>
          .
          <source>Neural computation</source>
          ,
          <volume>9</volume>
          (
          <issue>8</issue>
          ):
          <fpage>1735</fpage>
          -
          <lpage>1780</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [Lasko et al.,
          <year>2013</year>
          ]
          <article-title>Thomas A Lasko, Joshua C Denny, and Mia A Levy. Computational phenotype discovery using unsupervised feature learning over noisy, sparse, and irregular clinical data</article-title>
          .
          <source>PloS one</source>
          ,
          <volume>8</volume>
          (
          <issue>6</issue>
          ):e66341,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <source>[Li and Marlin</source>
          , 2015]
          <string-name>
            <given-names>Steven</given-names>
            <surname>Cheng-Xian Li</surname>
          </string-name>
          and
          <string-name>
            <given-names>Benjamin</given-names>
            <surname>Marlin</surname>
          </string-name>
          .
          <article-title>Classification of sparse and irregularly sampled time series with mixtures of expected gaussian kernels and random features</article-title>
          .
          <source>In 31st Conference on Uncertainty in Artificial Intelligence</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <article-title>[Li-wei et al</article-title>
          .,
          <year>2015</year>
          ]
          <string-name>
            <given-names>H</given-names>
            <surname>Lehman</surname>
          </string-name>
          Li-wei, Ryan P Adams, Louis Mayaud, George B Moody, Atul Malhotra, Roger G Mark,
          <article-title>and</article-title>
          <string-name>
            <given-names>Shamim</given-names>
            <surname>Nemati</surname>
          </string-name>
          .
          <article-title>A physiological time series dynamicsbased approach to patient monitoring and outcome prediction</article-title>
          .
          <source>IEEE journal of biomedical and health informatics</source>
          ,
          <volume>19</volume>
          (
          <issue>3</issue>
          ):
          <fpage>1068</fpage>
          -
          <lpage>1076</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [Lipton et al.,
          <year>2016</year>
          ] Zachary
          <string-name>
            <surname>C Lipton</surname>
          </string-name>
          , David C Kale, and
          <string-name>
            <given-names>Randall</given-names>
            <surname>Wetzel</surname>
          </string-name>
          .
          <article-title>Modeling missing data in clinical time series with rnns</article-title>
          .
          <source>Conference on Machine Learning in Healthcare (MLHC)</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <source>[Liu and Hauskrecht</source>
          , 2016]
          <string-name>
            <given-names>Zitao</given-names>
            <surname>Liu</surname>
          </string-name>
          and
          <string-name>
            <given-names>Milos</given-names>
            <surname>Hauskrecht</surname>
          </string-name>
          .
          <article-title>Learning adaptive forecasting models from irregularly sampled multivariate clinical data</article-title>
          .
          <source>In Thirtieth AAAI Conference on Artificial Intelligence</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [Liu et al.,
          <year>2013</year>
          ] Zitao Liu,
          <string-name>
            <surname>Lei Wu</surname>
            , and
            <given-names>Milos</given-names>
          </string-name>
          <string-name>
            <surname>Hauskrecht</surname>
          </string-name>
          .
          <article-title>Modeling clinical time series using gaussian process sequences</article-title>
          .
          <source>In SIAM International Conference on Data Mining (SDM)</source>
          , pages
          <fpage>623</fpage>
          -
          <lpage>631</lpage>
          . SIAM,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [Nguyen et al.,
          <year>2017</year>
          ]
          <string-name>
            <given-names>Phuoc</given-names>
            <surname>Nguyen</surname>
          </string-name>
          , Truyen Tran, Nilmini Wickramasinghe, and
          <string-name>
            <given-names>Svetha</given-names>
            <surname>Venkatesh</surname>
          </string-name>
          .
          <article-title>Deepr: A Convolutional Net for Medical Records</article-title>
          .
          <source>Journal of Biomedical and Health Informatics</source>
          ,
          <volume>21</volume>
          (
          <issue>1</issue>
          ),
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [Pham et al.,
          <year>2017</year>
          ]
          <string-name>
            <given-names>Trang</given-names>
            <surname>Pham</surname>
          </string-name>
          , Truyen Tran, Dinh Phung, and
          <string-name>
            <given-names>Svetha</given-names>
            <surname>Venkatesh</surname>
          </string-name>
          .
          <article-title>Predicting healthcare trajectories from medical records: A deep learning approach</article-title>
          .
          <source>Journal of Biomedical Informatics</source>
          ,
          <volume>69</volume>
          :
          <fpage>218</fpage>
          -
          <lpage>229</lpage>
          , May
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [Razavian et al.,
          <year>2016</year>
          ]
          <string-name>
            <given-names>Narges</given-names>
            <surname>Razavian</surname>
          </string-name>
          ,
          <article-title>Jake Marcus, and David Sontag. Multi-task prediction of disease onsets from longitudinal laboratory tests</article-title>
          .
          <source>In Proceedings of the 1st Machine Learning for Healthcare Conference</source>
          , pages
          <fpage>73</fpage>
          -
          <lpage>100</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <source>[Schulam and Saria</source>
          , 2016]
          <string-name>
            <given-names>Peter</given-names>
            <surname>Schulam</surname>
          </string-name>
          and
          <string-name>
            <given-names>Suchi</given-names>
            <surname>Saria</surname>
          </string-name>
          .
          <article-title>Integrative analysis using coupled latent variable models for individualizing prognoses</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>17</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>35</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [Silva et al.,
          <year>2012</year>
          ]
          <string-name>
            <given-names>Ikaro</given-names>
            <surname>Silva</surname>
          </string-name>
          , George Moody, Daniel J Scott,
          <article-title>Leo A Celi,</article-title>
          and Roger G Mark.
          <article-title>Predicting inhospital mortality of icu patients: The physionet/computing in cardiology challenge 2012</article-title>
          . In Computing in Cardiology (CinC),
          <year>2012</year>
          , pages
          <fpage>245</fpage>
          -
          <lpage>248</lpage>
          . IEEE,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <source>[Tresp and Briegel</source>
          , 1998]
          <string-name>
            <given-names>Volker</given-names>
            <surname>Tresp</surname>
          </string-name>
          and
          <string-name>
            <given-names>Thomas</given-names>
            <surname>Briegel</surname>
          </string-name>
          .
          <article-title>A solution for missing data in recurrent neural networks with an application to blood glucose prediction</article-title>
          .
          <source>Advances in Neural Information Processing Systems</source>
          , pages
          <fpage>971</fpage>
          -
          <lpage>977</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>