<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Predicting Glycemia in Type 1 Diabetes Patients: Experiments with XGBoost</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Cooper Midroni</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter J. Leimbigler</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gaurav Baruah</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maheedhar Kolla</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alfred J. Whitehead</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yan Fossat Klick Inc.</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bloor Street East</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Toronto</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ontario</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Canada</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>cmidroni</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>pleimbigler</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>gbaruah</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>mkolla</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>awhitehead</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>yfossatg@klick.com</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>Type 1 diabetes patients must self-administer insulin through injections or insulin-pump therapy, requiring careful lifestyle management around meals and physical activity. Accurate blood glucose prediction could increase patient quality of life, and foreknowledge of hypoglycemia or hyperglycemia could mitigate risks and save lives. For the 2018 BGLP Challenge, we experiment primarily with XGBoost to predict blood glucose levels at a 30-minute horizon in the OhioT1DM dataset. Our experiments show that XGBoost can be a competitive predictor of blood glucose levels, as compared to prior research, and that feature signals from different sources contribute in varying capacity for improved predictive ability of XGBoost.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Diabetes affects over 400 million people worldwide [World
Health Organization, 2016], with near 5% of diabetics
suffering from type 1 diabetes (T1D) [American Diabetes
Association, 2018]. Patients with T1D are incapable of producing
insulin, a hormone generated by the pancreas, which acts as
the primary regulator of blood glucose metabolism. This
dysfunction can lead to both hypoglycemia (low blood sugar) and
hyperglycemia (high blood sugar), resulting in a significant
patient burden to regulate carbohydrate consumption and
supplemental insulin delivery. Hyperglycemia can lead to
medical complications such as vision loss and kidney failure, and
increases risk of heart disease and stroke. Hypoglycemia can
lead to loss of consciousness and even death.</p>
      <p>An increasing number of T1D patients are adopting insulin
pump therapy, wherein a wearable device releases insulin
subcutaneously to mimic pancreatic response. Current
insulin pumps require patient input on carbohydrate intake and
approval of each recommended insulin dose. Driven by the
outstanding need for closed-loop insulin therapy, the notion
of an artificial pancreas has gained traction in diabetes-related
research [Graf et al., 2017; Juvenile Diabetes Research
Foundation, 2018].</p>
      <p>The dysregulation of blood glucose in T1D patients is
further complicated by daily variations in the magnitude
and timing of meals, physical activity, and insulin
selfadministration. This, along with the altered pharmacokinetics
of subcutaneous insulin, add further layers of complexity to
the task of predicting blood glucose. As such, the Blood
Glucose Level Prediction Challenge represents an important step
toward the realization of an artificial pancreas. Herein lies the
objective of restoring homeostasis through accurately dosed
insulin, and the creation of a model which captures the
complexities of the disease. This challenge was particularly
motivating for us, not simply from the perspective of predictive
modeling, but also for the potential applications in providing
tangible benefits to T1D patients.</p>
      <p>To predict glucose at a 30-minute time horizon, we
processed the raw features of the OhioT1DM dataset [Marling
and Bunescu, 2018] to create 3 different feature sets, and
experimented with gradient-boosted trees [Chen and Guestrin,
2016a] (XGBoost), random forests [Breiman, 2001], and
recurrent neural network variants. We find that:</p>
      <p>XGBoost performs on par with prior models [Bunescu
et al., 2013; Mirshekarian et al., 2017] for this task.
Many of the provided features do not contribute to
improved predictive performance. In essence, when using
XGBoost, past glucose is the most important predictor
of future glucose.</p>
      <p>Using -insensitive loss for training LSTMs improves
predictive performance compared to mean squared error.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Data</title>
      <p>The OhioT1DM dataset comprises 19 features collected from
6 patients with T1D [Marling and Bunescu, 2018].
Patients wore Medtronic 530G insulin pumps, Medtronic Enlite
continuous glucose monitors (CGM), and Basis Peak fitness
wristbands. 8 weeks of data were provided per patient, of
which the final 10 days were provided separately as a test set.
2.1</p>
      <sec id="sec-2-1">
        <title>Data Analysis</title>
        <p>Due to the heterogeneity of the data, we grouped features
according to their frequency:
one-off data: intermittent measurements with no fixed
sampling frequency or duration. (e.g., finger stick
glucose, insulin bolus time and dose, sleep times and
quality, work intensity, exercise intensity and duration, meal
)
/Ld400
g
m
(
l
vee200
l
e
s
o
lcug 0Mo
)
/Ld400
g
m
(
l
vee200
l
e
s
o
lcug 0Mo</p>
        <p>Patient 563
Th Fr</p>
        <p>Patient 591
Tu</p>
        <p>We</p>
        <p>Sa</p>
        <p>Su</p>
        <p>Mo</p>
        <p>Tu</p>
        <p>We</p>
        <p>Sa</p>
        <p>Su
)180
pbm150
(te120
tra 90
rae 60
h</p>
        <p>Patient 563
Th Fr
Patient 591</p>
        <p>type and carbohydrate content, hypoglycemic events,
illnesses, stressors)
quasi-continuous data: signals in continuous effect, and
signals aggregated at 5-minute intervals. (e.g., CGM
glucose level, basal and temporary basal rates of insulin
infusion, heart rate, steps taken, galvanic skin response
(GSR), skin and air temperature)</p>
        <p>Analysis of quasi-continuous data showed unique
temporal patterns in each patient (Figure 1). These bio-signals
displayed properties characteristic to each individual, such as
repeated waveforms and unique signal means. These qualities
reflect the hour-of-day periodicities and homeostatic norms
that vary across patients. Accordingly, we were motivated to
build a distinct model for each patient, as opposed to a general
model trained on the data of several patients.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Feature Engineering</title>
      </sec>
      <sec id="sec-2-3">
        <title>Expanded Feature Set</title>
        <p>We found that due to the variable sampling frequency of
oneoff features and missing values in the data, the feature vector
at any given timestamp was not guaranteed to contain values
for all fields. Converting the data into a feature matrix
resulted in rows with missing values, hindering analysis. We
thus chose to resample our data to 5-minute intervals,
reflecting the 5-minute aggregation frequency of quasi-continuous
variables.</p>
        <p>Within each 5-minute resampling window, we aggregated
each feature with either its maximum, mean, or last valid
value, depending on its nature. For instance, we took the
sum of steps, whereas we computed the mean of heart rate.
Some features are only in effect for specified durations. For
instance, basal insulin infusion rates were overridden with
respective temporary basal overrides, if any, and each
squarewave insulin bolus dose was spread evenly across its
specified time interval. Missing values for glucose were imputed
via linear interpolation.</p>
        <p>Based on our observations (Sec. 2.1) of time-dependent
patterns in the data, e.g., the dawn phenomenon, we included
one-hot encoded features for hour-of-day and day-of-week.
For each one-off feature, a binary indicator feature [Che et
al., 2018] was used to denote missing values.</p>
      </sec>
      <sec id="sec-2-4">
        <title>Condensed Feature Set</title>
        <p>We developed a condensed feature set based on pairwise
correlations between features. This feature set included Basis
Peak band and Medtronic CGM sensor data, based on the
strength of their correlation with glucose. Several derivations
of the glucose signal were added to the feature set,
including five- and ten-minute time lags of glucose to provide the
model with information of glucose history. A binary indicator
feature marked when the present glucose level was in the
upper or lower 20% of the patient’s glucose distribution. Other
features present in this set include the last bolus dose, mean
basal rate over the past 5 minutes, and an indicator of whether
the patient was asleep.</p>
      </sec>
      <sec id="sec-2-5">
        <title>Dimensionality-Reduced Features (PCA-reduced)</title>
        <p>We used Principal Component Analysis (PCA) to transform
our expanded feature set to remove features with minimal
variance. Such features are less likely to be predictive in
a model. On average, the first 55 principal components
accounted for 99% of the variance in each patient’s dataset.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Machine Learning Models</title>
      <p>For all models, we used the final week of each patient’s
training data for validation. In training personalized models,
hyperparameters and model structure (e.g. learning rates,
number of LSTM nodes) were kept consistent across patients. All
models in this study were implemented on all three of the
engineered feature sets.</p>
      <p>We investigated the following models:</p>
      <p>Tree ensembles using the Random Forest Regressor
implementation of Scikit-Learn [Pedregosa et al., 2011].
Regression-based gradient-boosted decision trees using
XGBoost [Chen and Guestrin, 2016b].</p>
      <p>Recurrent neural network variants using Keras [Chollet,
2015].</p>
      <p>Keras was used to create several RNN models:
multilayer LSTMs, GRUs, a Bidirectional GRU, and LSTMs with
dropout. These were implemented to evaluate their
performance on the data, and were tested on varying durations of
look-back windows (5 minutes–1 day).
We found that no one feature set (either expanded, condensed
or PCA-reduced) produced consistently better glucose
predictions, and different models performed better on different
feature sets. Table 1 lists our submitted systems and feature
sets.</p>
      <p>XGBoost was the best-performing model on both the
expanded and PCA-reduced feature sets, achieving a mean
RMSE across all patients of 20.377. These results are at par
with previously published models based on Support Vector
Regression [Bunescu et al., 2013].</p>
      <sec id="sec-3-1">
        <title>Experiments with LSTM Loss Functions</title>
        <p>Our LSTM models were simple and did not perform as well
as recent LSTM models for blood-glucose prediction
[Mirshekarian et al., 2017]. We submitted results for an LSTM
model that was composed of: layers LSTM(64 nodes),
LSTM(64 nodes), Fully-connected(32 nodes);
a dropout rate of 0.2; an Adadelta optimizer; and a
lookback of 5 minutes.</p>
        <p>We observed that Mean Absolute Error (MAE) improved
the performance of trained LSTMs over using Mean Squared
Error (MSE) as the loss function. The models trained with
MSE showed a degradation which was particularly severe for
glucose values near hypoglycemic and hyperglycemic
levels [Medtronic, 2010]. MAE, in contrast to MSE, does not
penalize large errors as heavily as MSE, which likely helped
improve performance over outlying cases.</p>
        <p>The generally accepted error rate for finger-stick blood
glucose measurements is 15 mg/dL [Food and Drug
Administration, 2016]. Thus, for predictions within a 15 mg/dL window
of the ground truth, the the loss for such values can be
considered less impactful. We therefore investigated the use of
an -insensitive loss function for training our LSTM, with
set as 5 mg/dL, for a more stringent boundary than 15 mg/dL.</p>
        <p>We compared the three loss functions: and found that
training LSTMs with MAE loss improved results (RMSE 24.586)
over MSE loss (RMSE 30.097) with -insensitive loss
performing the best with an RMSE of 23.483.
4
4.1</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Follow-up Experiments and Results</title>
      <sec id="sec-4-1">
        <title>Post Challenge Submission</title>
        <p>We tuned the hyperparameters of our best model, XGBoost,
and added the following features to the expanded featureset:
first difference of CGM glucose; time since last bolus, meal,
hypo event, and hypo correction; size of last bolus; carbs in
last meal; and lagged features, up to 2 hours. 8-fold
crossvalidation without shuffling was used on the full training set
to optimize the number of boosting rounds.
Given the diversity of features sourced from biological
measurements, self-reported events, and non-invasive
physiological signals, we conducted an ablation study to determine
the relative performance of models on subsets of feature
groups. This experiment was performed with our best
XGBoost model. From our expanded feature set, we created the
following feature subsets:</p>
        <p>Self-reported features (S): meals, finger-stick glucose,
illness, stress, exercise, and work, together with
missingvalue indicator columns, and one-hot encodings for meal
type (41 features)
Basis Peak band features (B): heart rate, GSR, skin and
air temperature, steps, and sleep (6 features)
Pump features (P): basal and temporary basal infusion
rates, bolus doses, together with missing-value indicator
and one-hot encoding columns (10 features)
CGM glucose feature (G): blood glucose level recorded
via CGM sensor (1 feature)
Time features: one-hot encodings for hour-of-day and
day-of-week (31 features)
The XGBoost model was trained on a feature vector
containing the current feature value as well as the previous 12 values,
lagged at 5 minutes apart. In total, we investigated 15
combinations of the S, P, B, and G features (Table 2). Time features
were included in all combinations.</p>
        <p>Table 2 shows that:</p>
        <p>Prediction suffers greatly when glucose (G) is ablated.
The best model does not include insulin (P) or band (B)
features.</p>
        <p>We used XGBoost’s feature importance rank to observe
which features contributed to the most decision splits within
the trees of the model. More splits within the trees infer a
higher importance in decision making. Table 3 shows the
mean rank of a feature as determined by XGBoost’s
importance score. We observe that, on average, XGBoost’s
decisions are most influenced by: (i) current glucose, (ii) glucose
one hour ago, and (iii–xii) other glucose values within the
past hour. This remains true irrespective of inclusion of Basis
Peak band features.
4.3</p>
      </sec>
      <sec id="sec-4-2">
        <title>Data Imputation Revisited</title>
        <p>In each of the patient’s data, missing values were observed in
CGM measurements. Initially, these data gaps were filled via
linear interpolation for both the training and test sets.
However, such an imputation across gaps is only valid in a
training and batch prediction setting. In online prediction, new
feature vectors stream into the model in real time. Thus, to
reflect realistic online prediction, missing values should only
be imputed using past data.</p>
        <p>To this point, any time intervals with missing glucose
values should be excluded from the test-RMSE metric. As
such, we corrected our implementation of test-RMSE to
include only data periods with available glucose. In Figure
2 (left), unfiltered RMSE is computed with the interpolated
points (black over gray) included in the test set, whereas
corrected RMSE is computed with the interpolated points
excluded from the test set (no predictions when glucose values
are missing). Under the linear interpolation scheme, mean
RMSE improves from an unfiltered value of 18.540 to 16.214
mg/dL (Table 4). Note that other features may have values
even when glucose is absent (Sec. 2.2).</p>
        <p>We then ran our best model on three test-set imputation
schemes, the latter two of which are compatible with online
prediction: linear interpolation (linear), persisting the last
valid value (ffill), and leaving gaps unchanged (none).</p>
        <p>Figure 2 and Table 4 compare the RMSEs of these
imputation schemes. The unfiltered RMSE column in Table 4
gives model performance on interpolated test glucose
values, without using the corrected RMSE function. The
corrected RMSE columns list performance for three imputation
schemes, using the corrected RMSE function. Interpolation
with corrected RMSE performs best, but only forward-filled
and non-imputed schemes can be implemented in an online
context. Figure 2 (left) illustrates incorrect interpolation of
glucose values. Figure 2 (center) and (right) show the ffill
and none schemes, both of which are compatible with online
prediction.</p>
        <p>As an interesting exercise, missing test-set glucose values
were left unchanged, and XGBoost was allowed to make
predictions on the remaining features in the absence of glucose
(Figure 2, right panel, orange over gray). Predictions were
more variable in the absence of glucose signal, but seem to
recover within two hours of the end of the data gap.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Context, Related Work, and Discussion</title>
      <p>In this work, our aim was to deepen our understanding of the
predictive modeling of bio-signals, for which the Blood
Glucose Prediction Challenge was ideally suited. We
acknowledge that a rich body of research exists, which explores the
prediction of glycemia in depth.</p>
      <p>Researchers have previously implemented Support Vector
Regression [Plis et al., 2014; Bunescu et al., 2013],
neural networks [Pappada et al., 2008], recurrent neural
networks [Allam et al., 2011; Mirshekarian et al., 2017], as
well as genetic algorithms [Hidalgo et al., 2017]. Feature
engineering approaches include using expectation
maximization for missing data imputation [Tresp and Briegel, 1998],
as well as physiologically modeling glucose response
signals as features [Bunescu et al., 2013; Zecchin et al., 2012;
Contreras et al., 2017].</p>
      <p>For this work, we applied conventional feature-engineering
methods. In the future, we would like to explore the inclusion
of features based on physiological models of bio-signals for
prediction. As an extension to our ablation and feature
importance study, we would also like to explore non-glucose
signals in-depth for glucose level prediction.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>Our main finding is that XGBoost remains competitive with
previously reported ARIMA models [Bunescu et al., 2013],
which supports glucose-derived features as the strongest
predictors of future glucose levels.</p>
      <p>We observed that in LSTMs, -insensitive loss proved a
more effective loss function than MAE, as inspired by the
notion of incorporating an error tolerance corresponding to
finger-stick measurement error. Interestingly, XGBoost
models outperformed LSTMs in our study.</p>
      <p>The collaboration of life sciences with the practice of data
science offers the possibility of developing truly
individualized proactive medicine. By personalizing such predictive
models, we endeavour to further explore key signals—digital
biomarkers, digital surrogate measurements—which reflect
the strength of this interdisciplinary collaboration, and our
ability to transform the future of healthcare.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>We would like to thank Michael Li and the rest of the
KlickLabs team at Klick Inc. for their feedback, support, and
encouragements.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [Allam et al.,
          <year>2011</year>
          ]
          <string-name>
            <given-names>Fayrouz</given-names>
            <surname>Allam</surname>
          </string-name>
          , Zaki Nossai, Hesham Gomma, Ibrahim Ibrahim, and
          <string-name>
            <given-names>Mona</given-names>
            <surname>Abdelsalam</surname>
          </string-name>
          .
          <article-title>A recurrent neural network approach for predicting glucose concentration in type-1 diabetic patients</article-title>
          .
          <source>In Engineering Applications of Neural Networks</source>
          , pages
          <fpage>254</fpage>
          -
          <lpage>259</lpage>
          . Springer,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [American Diabetes Association,
          <year>2018</year>
          ]
          <article-title>American Diabetes Association. Type 1 diabetes</article-title>
          . http://www.diabetes.org/ diabetes-basics/type-1/,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <source>[Breiman</source>
          , 2001]
          <string-name>
            <given-names>Leo</given-names>
            <surname>Breiman</surname>
          </string-name>
          .
          <article-title>Random forests</article-title>
          .
          <source>Machine Learning</source>
          ,
          <volume>45</volume>
          (
          <issue>1</issue>
          ):
          <fpage>5</fpage>
          -
          <lpage>32</lpage>
          ,
          <year>Oct 2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [Bunescu et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Razvan</given-names>
            <surname>Bunescu</surname>
          </string-name>
          , Nigel Struble, Cindy Marling, Jay Shubrook, and
          <string-name>
            <given-names>Frank</given-names>
            <surname>Schwartz</surname>
          </string-name>
          .
          <article-title>Blood glucose level prediction using physiological models and support vector regression</article-title>
          .
          <source>In ICMLA</source>
          ,
          <year>2013</year>
          , volume
          <volume>1</volume>
          , pages
          <fpage>135</fpage>
          -
          <lpage>140</lpage>
          . IEEE,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [Che et al.,
          <year>2018</year>
          ]
          <string-name>
            <given-names>Zhengping</given-names>
            <surname>Che</surname>
          </string-name>
          , Sanjay Purushotham, Kyunghyun Cho, David Sontag, and Yan Liu.
          <article-title>Recurrent neural networks for multivariate time series with missing values</article-title>
          .
          <source>Scientific reports</source>
          ,
          <volume>8</volume>
          (
          <issue>1</issue>
          ):
          <fpage>6085</fpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <source>[Chen and Guestrin</source>
          , 2016a]
          <string-name>
            <given-names>Tianqi</given-names>
            <surname>Chen</surname>
          </string-name>
          and
          <string-name>
            <given-names>Carlos</given-names>
            <surname>Guestrin</surname>
          </string-name>
          .
          <article-title>Xgboost: A scalable tree boosting system</article-title>
          .
          <source>CoRR, abs/1603.02754</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>[Chen and Guestrin</source>
          , 2016b]
          <string-name>
            <given-names>Tianqi</given-names>
            <surname>Chen</surname>
          </string-name>
          and
          <string-name>
            <given-names>Carlos</given-names>
            <surname>Guestrin</surname>
          </string-name>
          .
          <article-title>Xgboost: A scalable tree boosting system</article-title>
          .
          <source>In KDD, KDD '16</source>
          , pages
          <fpage>785</fpage>
          -
          <lpage>794</lpage>
          , New York, NY, USA,
          <year>2016</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <source>[Chollet</source>
          , 2015] Franc¸ois et al Chollet. Keras. https://keras. io,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [Contreras et al.,
          <year>2017</year>
          ]
          <article-title>Iva´n Contreras, Silvia Oviedo, Martina Vettoretti, Roberto Visentin, and Josep Veh´ı. Personalized blood glucose prediction: A hybrid approach using grammatical evolution and physiological models</article-title>
          .
          <source>PloS one</source>
          ,
          <volume>12</volume>
          (
          <issue>11</issue>
          ):
          <fpage>e0187754</fpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <source>[Food and Drug Administration</source>
          ,
          <year>2016</year>
          ]
          <string-name>
            <given-names>U.S.</given-names>
            <surname>Food</surname>
          </string-name>
          and
          <string-name>
            <given-names>Drug</given-names>
            <surname>Administration</surname>
          </string-name>
          .
          <article-title>Self-monitoring blood glucose test systems for over-the-counter use</article-title>
          . https://www.fda.gov/ downloads/ucm380327.pdf,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [Graf et al.,
          <year>2017</year>
          ]
          <string-name>
            <given-names>Anneke</given-names>
            <surname>Graf</surname>
          </string-name>
          ,
          <article-title>Sybil A</article-title>
          .
          <string-name>
            <surname>McAuley</surname>
          </string-name>
          ,
          <string-name>
            <surname>Catriona Sims</surname>
            , Johanna Ulloa,
            <given-names>Alicia J.</given-names>
          </string-name>
          <string-name>
            <surname>Jenkins</surname>
            , Gayane Voskanyan,
            <given-names>and David N. O</given-names>
          </string-name>
          <string-name>
            <surname>'Neal.</surname>
          </string-name>
          <article-title>Moving toward a unified platform for insulin delivery and sensing of inputs relevant to an artificial pancreas</article-title>
          .
          <source>Journal of Diabetes Science and Technology</source>
          ,
          <volume>11</volume>
          (
          <issue>2</issue>
          ):
          <fpage>308</fpage>
          -
          <lpage>314</lpage>
          ,
          <year>2017</year>
          . PMID:
          <volume>28264192</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [Hidalgo et al.,
          <year>2017</year>
          ]
          <string-name>
            <given-names>J</given-names>
            <surname>Ignacio Hidalgo</surname>
          </string-name>
          ,
          <string-name>
            <surname>J Manuel Colmenar</surname>
          </string-name>
          , Gabriel Kronberger, Stephan M Winkler,
          <string-name>
            <given-names>Oscar</given-names>
            <surname>Garnica</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Juan</given-names>
            <surname>Lanchares</surname>
          </string-name>
          .
          <article-title>Data based prediction of blood glucose concentrations using evolutionary methods</article-title>
          .
          <source>Journal of medical systems</source>
          ,
          <volume>41</volume>
          (
          <issue>9</issue>
          ):
          <fpage>142</fpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <source>[Juvenile Diabetes Research Foundation</source>
          ,
          <year>2018</year>
          ] Juvenile Diabetes Research Foundation. http://www.jdrf.org/research/artificialpancreas/. http://www.jdrf.org/research/ artificial-pancreas/,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>[Marling and Bunescu</source>
          , 2018]
          <string-name>
            <given-names>C.</given-names>
            <surname>Marling</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Bunescu</surname>
          </string-name>
          .
          <article-title>The OhioT1DM dataset for blood glucose level prediction</article-title>
          .
          <source>In The 3rd International Workshop on Knowledge Discovery in Healthcare Data</source>
          , Stockholm, Sweden,
          <year>July 2018</year>
          . CEUR proceedings in press, available at http://smarthealth.cs.ohio.edu/bglp/OhioT1DM-datasetpaper.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <source>[Medtronic</source>
          , 2010] Medtronic.
          <article-title>The basics of insulin pump therapy - medtronic diabetes</article-title>
          . https://www. medtronicdiabetes.com/sites/default/ files/library/download-library/workbooks/ BasicsofInsulinPumpTherapy.pdf,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [Mirshekarian et al.,
          <year>2017</year>
          ]
          <string-name>
            <given-names>Sadegh</given-names>
            <surname>Mirshekarian</surname>
          </string-name>
          , Razvan Bunescu, Cindy Marling, and
          <string-name>
            <given-names>Frank</given-names>
            <surname>Schwartz</surname>
          </string-name>
          .
          <article-title>Using lstms to learn physiological models of blood glucose behavior</article-title>
          .
          <source>In EMBC</source>
          ,
          <year>2017</year>
          , pages
          <fpage>2887</fpage>
          -
          <lpage>2891</lpage>
          . IEEE,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [Pappada et al.,
          <year>2008</year>
          ]
          <string-name>
            <surname>Scott M Pappada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Brent D</given-names>
            <surname>Cameron</surname>
          </string-name>
          , and Paul M Rosman.
          <article-title>Development of a neural network for prediction of glucose concentration in type 1 diabetes patients</article-title>
          .
          <source>Journal of diabetes science and technology</source>
          ,
          <volume>2</volume>
          (
          <issue>5</issue>
          ):
          <fpage>792</fpage>
          -
          <lpage>801</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [Pedregosa et al.,
          <year>2011</year>
          ]
          <string-name>
            <given-names>F.</given-names>
            <surname>Pedregosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Varoquaux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gramfort</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Michel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Thirion</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Grisel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Blondel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Prettenhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Weiss</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Dubourg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Vanderplas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Passos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Cournapeau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brucher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Perrot</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Duchesnay</surname>
          </string-name>
          .
          <article-title>Scikit-learn: Machine learning in Python</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>12</volume>
          :
          <fpage>2825</fpage>
          -
          <lpage>2830</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [Plis et al.,
          <year>2014</year>
          ]
          <string-name>
            <given-names>Kevin</given-names>
            <surname>Plis</surname>
          </string-name>
          , Razvan C Bunescu, Cindy Marling, Jay Shubrook, and
          <string-name>
            <given-names>Frank</given-names>
            <surname>Schwartz</surname>
          </string-name>
          .
          <article-title>A machine learning approach to predicting blood glucose levels for diabetes management</article-title>
          .
          <source>In AAAI Workshop: Modern Artificial Intelligence for Health Analytics, number 31</source>
          , pages
          <fpage>35</fpage>
          -
          <lpage>39</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <source>[Tresp and Briegel</source>
          , 1998]
          <string-name>
            <given-names>Volker</given-names>
            <surname>Tresp</surname>
          </string-name>
          and
          <string-name>
            <given-names>Thomas</given-names>
            <surname>Briegel</surname>
          </string-name>
          .
          <article-title>A solution for missing data in recurrent neural networks with an application to blood glucose prediction</article-title>
          .
          <source>In NIPS</source>
          , pages
          <fpage>971</fpage>
          -
          <lpage>977</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [World Health Organization,
          <year>2016</year>
          ] World Health Organization.
          <source>Global report on Diabetes. World Health Organization</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [Zecchin et al.,
          <year>2012</year>
          ]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zecchin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Facchinetti</surname>
          </string-name>
          , G. Sparacino, G. De Nicolao, and
          <string-name>
            <given-names>C.</given-names>
            <surname>Cobelli</surname>
          </string-name>
          .
          <article-title>Neural network incorporating meal information improves accuracy of short-time prediction of glucose concentration</article-title>
          .
          <source>IEEE Transactions on Biomedical Engineering</source>
          ,
          <volume>59</volume>
          (
          <issue>6</issue>
          ):
          <fpage>1550</fpage>
          -
          <lpage>1560</lpage>
          ,
          <year>June 2012</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>