Machine Learning Supported Next-Maintenance Prediction
                     for Industrial Vehicles
                 Sachit Mishra                                          Luca Vassio                               Luca Cagliero
            Politecnico di Torino                                  Politecnico di Torino                       Politecnico di Torino
                 Turin, Italy                                           Turin, Italy                                Turin, Italy
      sachit.mishra@studenti.polito.it                             luca.vassio@polito.it                      luca.cagliero@polito.it

                 Marco Mellia                                         Elena Baralis                               Riccardo Loti
             Politecnico di Torino                                 Politecnico di Torino                            Tierra Spa
                  Turin, Italy                                          Turin, Italy                                Turin, Italy
            marco.mellia@polito.it                                elena.baralis@polito.it                   rloti@tierratelematics.com

                                                                     Lucia Salvatori
                                                                        Tierra Spa
                                                                       Turin, Italy
                                                                 lsalvatori@topcon.com

ABSTRACT                                                                         time, oil pressure, temperature, engine speed). Domain experts
Industrial and construction vehicles require tight periodic main-                can thus monitor vehicle state in order to understand which
tenance operations. Their schedule depends on vehicle character-                 maintenance actions need to be performed. In recent years, the
istics and usage. The latter can be accurately monitored through                 significance advances of Internet of Things (IoT) and Big Data
various on-board devices, enabling the application of Machine                    analytics technologies have fostered the development of smart
Learning techniques to analyze vehicle usage patterns and design                 predictive maintenance solutions [15]. Specifically, the analysis
predictive analytics. This paper presents a data-driven application              of CAN bus data by means of data mining and machine learning
to automatically schedule the periodic maintenance operations                    techniques allow us to (i) predict the future vehicle usage by
of industrial vehicles. It aims to predict, for each vehicle and date,           means of classification and regression techniques (e.g., [7, 10]),
the actual remaining days until the next maintenance is due. Our                 (ii) aggregate vehicles with similar characteristics using cluster-
Machine Learning solution is designed to address the following                   ing techniques, (e.g., [1, 4]), and (iii) identify malfunctioning of
challenges: (i) the non-stationarity of the per-vehicle utilization              specific vehicle components (e.g., [6, 15]).
time series, which limits the effectiveness of classic scheduling                    Optimizing maintenance activities of fleet vehicles is a priority
policies, and (ii) the potential lack of historical data for those               in several industrial processes [2]. In fact, technological applica-
vehicles that have recently been added to the fleet, which hinders               tions related to activity planning and resource management are
the learning of accurate predictors from past data. Preliminary                  crucial for efficiently handling logistics [14]. This paper proposes
results collected in a real industrial scenario demonstrate the                  a machine learning approach to support the smart planning of
effectiveness of the proposed solution on heterogeneous vehi-                    the fleet maintenance operations in a industrial context.
cles. The system we propose here is currently under deployment,                      Data-driven vehicle maintenance planning has already been
enabling further tests and tunings.                                              addressed using various optimization methods. For example,
                                                                                 in [11] the authors have applied Genetic Algorithms to plan
                                                                                 the maintenance of geographically distributed assets by consid-
1    INTRODUCTION                                                                ering routing constraints and travel time to reach the assets. The
Fleets of industrial and construction vehicles are subject to pe-                authors in [12] have presented a dynamic optimization method to
riodic maintenance. These vehicles are of different models and                   plan maintenance of heavy vehicles by jointly scheduling main-
types. Thus, they require to plan actions of different type and                  tenance operations and production activities, whereas in [8] the
frequency. According to the current vehicles’ workload, mainte-                  authors propose a data-driven simulation framework for planning
nance schedule often changes. For example, some vehicles could                   snow removal projects considering weather and truck-related
remain unused for a relatively long period of time, then be moved                data acquired by real-time sensors. All the aforesaid strategies
to a construction site, and keep working at full capacity for many               are possible if accurate predictions of next maintenance events
days or weeks. The heterogeneity of industrial vehicle usage                     are available.
has indeed prompted the need for tracking their utilization and                      Contribution. This work presents an application of regres-
automating the schedule of maintenance operations [3].                           sion techniques to predict for a given vehicle when the next
   In the context of industrial vehicle management, the advent                   maintenance will be due. Specifically, it predicts the number of
of CAN bus technology has enabled the design of data-driven                      days left to the next maintenance operation, based on the series
decision-making processes [5]. The CAN bus provides access to                    of past daily utilization levels and on the current time of usage
various signals describing the vehicle usage state (e.g., working                left to the next maintenance. The application faces three of the
                                                                                 main issues related to fleet maintenance planning: (i) Vehicle
Copyright © 2020 for this paper by its author(s).
Published in the Workshop Proceedings of the EDBT/ICDT 2020 Joint Confer-        heterogeneity: The industrial vehicles in a fleet are commonly
ence (March 30-April 2, 2020, Copenhagen, Denmark) on CEUR-WS.org.               rather heterogeneous, in terms of number, type, and frequency of
Use permitted under Creative Commons License Attribution 4.0 International (CC   the necessary maintenance operations. This makes the planning
BY 4.0)
                                                                                 of these activities particularly challenging and time-consuming
                                                                                                                           v
for fleet managers. (ii) Non-stationarity of the utilization series:     least half of the usage in one cycle ( T2 ) is already available. (iii)
                                                                                                                             v
Vehicle usage pattern levels are rather irregular, whereas main-         New: If the vehicle has been used for less than T2 seconds since
tenance actions rather frequent. Seasonal trends may depend              the beginning of the data acquisition phase.
on vehicle type, model, and context of use. (iii) Lack of data for
new vehicles: When a vehicle is added to the fleet, usage data           2.1      Error computation
is typically not available. This hinders the learning of accurate        To effectively support fleet managers in planning periodic vehicle
machine learning models, which requires historical data to train         maintenance, our prediction system is tailored to a specific goal
reliable predictors.                                                     that we encode by considering specific error function definition.
    To handle heterogeneous fleets, we train a separate regression       We define three errors for each vehicle v: the daily error Ev (t),
model per vehicle. Each regressor analyzes the vehicle usage                               v
                                                                         the global error EGlobal  , and the Mean Residual Error EvM RE (D).
                                                                                                                                         e
patterns and the current time to maintenance of a specific vehi-
                                                                            The daily error counts on each day t the gap between the
cle. We conduct the analysis on 24 industrial and construction
                                                                         predicted and actual values of the next day of maintenance:
vehicles of different models. To handle the non-stationarity of
the analyzed series, we incorporate the historical usage levels
                                                                                          Ev (t) = Dv (t) − DvP r edict (t)            (2)
in the predictive models and train both linear and non-linear
models. Finally, to overcome the lack of data related to new ve-           The global error is a mean of the daily errors over all the N v
hicles, we combine the regressor outcomes achieved on similar            samples related to the vehicle under analysis, i.e.,
vehicles. The presented application is complementary to existing                                            ÍN v v
optimization-based planning strategies, e.g., [8, 11], providing                                      v       t =1 E (t)
                                                                                                     EGlobal =                            (3)
the fleet management system with specific hints on future vehi-                                                  Nv
cle usage states. The results, achieved in real industrial scenario,         The global error combines the daily errors together, but it
show substantial improvements achieved by applying non-linear            does not consider nor weight the time that is left for the next
regression models compared to classical statistics-based or linear       maintenance. In other words, an error of 1 day when we are close
models. In light of the achieved results, the data owner (collecting     to the maintenance (e.g., Dv (t) = 1 ) is considered as equal as an
telematics data from real industrial vechicles) has decided to put       error when we are far from the maintenance (e.g., Dv (t) = 100).
the present application under deployment, thus enabling further          In order to solve this issue, the mean residual error is considered.
tests, tunings, and extensions.                                          It is the mean of the daily errors over specific days. In particular
    This paper is organized as follows. Sections 2 formalizes the        we want to compute the average only for specific values of Dv ,
problem, while Section 3 describes the dataset and its prepara-          contained in a set D.e D e consists of a selection of days that are
tion phase. Section 4 presents the data-driven methodologies.            closer to the maintenance operation, for each maintenance cycle.1
Section 5 summarizes the main experimental results. Finally, Sec-        EvM RE is computed as follows:
tion 6 draws conclusions and summarizes the future research                                              Í              v
                                                                                                               v    e E (i)
agenda.                                                                                            e = i:D (i)∈D
                                                                                           EvM RE (D)                                     (4)
                                                                                                           i : Dv (i) ∈ De
2    PROBLEM STATEMENT
                                                                            The idea behind EvM RE is that fleet managers are mainly in-
Given an arbitrary industrial or construction vehicle v, our goal        terested in getting accurate predictions when the vehicles are
is to predict when the next maintenance operation for v will be          towards the end of their maintenance cycle, i.e., when mainte-
due. Let N v be the number of days for which historical data about       nance operations need to be scheduled soon. Therefore, our main
v usage is available and let T v be the allowed usage times (in          objective is to minimize EvM RE (D).
                                                                                                          e
seconds) for v between two consecutive maintenance operations.
The period from one maintenance operation to the next one will           3     DATA PREPARATION
be hereafter denoted as a cycle. The count of the number of days
                                                                         The application presented in this paper has been developed and
left to the next maintenance of vehicle v varies day by day. Let
                                                                         tested on real vehicle data provided by Tierra S.p.A.2 , a company
Dv (t) be the series of the aforesaid daily counts. Our aim is to
                                                                         that provides IoT solutions for monitoring vehicles of multiple
predict Dv (t), where t denotes the current day. The series used
                                                                         vendors. The dataset consists of historical usage of 24 heteroge-
to drive the prediction are enumerated below. For each vehicle v:
                                                                         neous vehicles acquired over a 4 year period (from January 2015
    • U v (t): series of the daily utilization of vehicle v.             to September 2019). For each vehicle, we consider the informa-
    • C v (t): series of the counts of the number of days already        tion coming from the CAN bus. Onboard sensors and Machine
      passed from the last maintenance operation.                        Control Systems generate messages for CAN at a frequency of
    • Lv (t): series of the utilization times left to the next main-     approximately 100 Hz. Each message is collected by a controller
      tenance operation. On an arbitrary day t, it is computed           which processes it, periodically generates a summary report, and
      as follows:                                                        sends it to a cloud server [7].
                                                                            To prepare vehicle data for the present study, the input CAN
                                       t −1
                                       Õ                                 bus data goes through a series of steps: (i) Data Cleaning, (ii) Nor-
                  Lv (t) = T v −                   U v (i)        (1)    malization, (iii) Aggregation, (iv) Enrichment and (v) Transforma-
                                   i=t −C v (t )                         tion. A more detailed description of each of the above-mentioned
    In the following, we define three categories of vehicles accord-     steps is given below. Data cleaning entails properly handling
ing to the amount of historical data that is currently available:        missing values and inconsistent values. Data normalization al-
(i) Old: If at least one maintenance cycle has already been com-         lows us to scale the values of the utilization times to a uniform
pleted since data acquisition has started. (ii) Semi-new: If the first                                                          e = {1, . . . , 29}.
                                                                         1 We have considered the last 29 days per cycle, i.e., D
maintenance cycle has not been completed yet, but data about at          2 https://www.tierratelematics.com/
                 50000                                                                  400
                             v1                                                                                                   v1
                             v2
                 40000                                                                                                            v2
                                                                                        300

                 30000


                                                                               Dv (t)
       U v (t)


                                                                                        200
                 20000

                                                                                        100
                 10000

                     0                                                                    0
                         0        20   40       60         80                                 0        200      400       600          800
                                        t                                                                        t


Figure 1: Daily utilization in seconds (U v ) in function of            Figure 2: Target variable number of days left to the next
the day in the series (t) for two sample vehicles.                      maintenance (Dv ) with respect to day in the series (t).
                                                                        Many cycles are shown for two sample vehicles.

value range (e.g., from 0 to 1) thus avoiding to introduce bias in
regression model learning [13]. Data aggregation, enrichment
                                                                                        150
and transformation aim at providing predictive algorithms with                                    v1
an appropriate set of features describing the usage patterns by                         125       v2
aggregating data at the desired time granularity. For each vehicle
                                                                                        100
we transformed the raw CAN bus data to produce the input fea-                  Dv (t)
tures described in Section 2. Specifically, in our case of study, we                     75
primarily focus on daily-usage time series U (t), i.e., the amount
                                                                                         50
of time each vehicle worked on each day.
                                                                                         25

3.1     Preliminary data exploration                                                      0
                                                                                              0   500000     1000000   1500000   2000000
Figure 1 plots part of the series of daily utilization seconds, i.e.,                                         Lv (t)
U v (t), for two sample vehicles. Curves show that vehicle utiliza-
tion patterns are rather heterogeneous. Vehicle v 1 has a daily
utilization of about 20 000-30 000 seconds, with few days without       Figure 3: Utilization seconds left to maintenance (Lv ) vs.
usage every 10-15 working days. On the other hand, vehicle v 2          the number of days to maintenance (Dv ) for a single cycle
remains almost unused for several weeks (from t=0 to t=40) and          of two vehicles.
then suddenly changes its usage pattern.
   After a fixed time amount of usage (we have considered T v =
2 000 000 seconds), every vehicle needs to go under maintenance.
Notice that we do not know whether maintenance operations               4    METHODOLOGY
have actually been performed or how long they take (from T v            In this section we present the data mining process designed
on). Figure 2 shows two examples of target series Dv (t), with          to address the problem under analysis. We propose different
many shown cycles. When Dv (t) reduces to zero, the vehicle             methodologies for vehicles when they are recognized as new,
goes to maintenance. Then a new maintenance cycle starts, the           semi-new, or old (see Section 2). For each old vehicle we have
number of days left to maintenance is maximal, and it mono-             enough data in order to train a prediction model on its own past.
tonically decreases (one day for each day passed) until the next        Semi-new and new vehicles will be separately handled.
maintenance operation is carried out. Notice how v 1 has a first            For each vehicle v we generate a relational dataset containing
longer cycle (221 days), while the others are more constant and         the historical utilization series U v (x) [13]. More specifically, each
homogeneous, with length between 65 and 105 days.                       record corresponds to a different day t and consists of a set
   In Figure 3 we show the number of days left to maintenance           of attributes denoting the past utilization levels (in seconds).
Dv (t) with respect to the number of utilization seconds left for       Given a window size W , the attributes include the values U v (x)
the next maintenance Lv (t). The functions seems to have a con-         [t − W ≤ x ≤ t − 1]. Along with the utilization level series, the
stant rate when Lv (t) is closer to zero, reflecting that for most of   attributes include the current time left until the next maintenance,
the time the utilization rate is relatively constant and different      i.e., Lv (t), and the target variable, i.e., the number of days left to
from zero. However, there are some vertical steps, corresponding        maintenance, i.e., Dv (t).
to consecutive days when the utilization was null. This confirms            Many machine learning techniques work better with more
that the presence of low- or zero-utilization periods has a serious     data, hence we desire to increase the number of records per
impact on the target variable. Thus, predicting the correct target      vehicle. Since we do not know when vehicle actually had the
value could be challenging. Hopefully, it is unlikely to see long       maintenance done, we can shift the time reference, i.e., chang-
periods of zero-utilization in the days approaching the deadline.       ing the first starting day t = 0, without introducing errors. We
This reinforces the motivations behind using EvM RE (D)     e as ref-   randomly re-sampled multiple times the time reference starting
erence error metric, considering values relatively close to the         from different time points within the training data and build the
maintenance.                                                            utilization series.
4.1    Approaches                                                            record of the training dataset.
We apply three different methods: (i) a baseline model, relying on           Support Vector Regressor (SVR): It is among the most effec-
simple estimate, (ii) a univariate model, whose predictions rely on          tive solutions to address regression and classification problems.
a single variable, and (iii) a multivariate model, where the model           Given a multidimensional training data representation, it finds an
considers multiple series values. These are standard methods for             hyper-plane separating points belonging to different target value
time series forecasting. Novelty relies in the categorization of the         ranges. According to the kernel function used to derive the hy-
vehicles and data engineering.                                               perplane, the predictive model can be either linear or non-linear
                                                                             (e.g., polynomial, sigmoid, rbf). Due to the high computational
   4.1.1 Baseline algorithm: With this baseline approach we sim-             complexity of non-linear kernels, in the remaining of the paper
ply predict when the next maintenance will be due assuming that              we focus on linear SVR (LSVR).
the utilization is constant and equal to the average utilization in          Random Forest regressor (RF): It is an established ensemble
the past. Hence, we compute the average utilization of vehicle v             method combining the predictions of multiple decision trees. De-
in the past (training set of size Tt r ain ). Then we use it for predict-    cision trees are the most popular non-linear mapping functions
ing the number of remaining days until the next maintenance is               between non-predictive and predictive variables. They rely on
due. The average utilization is defined as follows:                          tree-based structures. The Random Forest Regression averages
                                   ÍTt r ain                                 the predictions made by various decision tree models, which are
                                     i=1       U v (t)                       trained on different bootstraps (i.e., samples of the training data
                       AV G v =                             (5)
                                 Tt r ain                                    with replacement).
   Let DvBL (t) be the number of days left to next maintenance               Histogram-based gradient boosting (XGB): it is a popular en-
predicted by the Baseline algorithm at time t as:                            semble method relying on a boosting strategy. It minimizes the
                                                                             prediction loss by combining many decision tree regressors.
                               Lv (t)
                           DvBL (t) =                   (6)
                               AV G v                                           Additional models can be straightforwardly added and tested
  We will denote the baseline method as BL thoroughout the                   in the deployed version of the system. Notice that some models
paper.                                                                       (e.g., Neural Networks) have not been included in this first release
   4.1.2 Univariate regression model. We apply a univariate re-              due to the lack of a sufficiently large amount of training data.
gression model FU R to predict the number of days left to main-
tenance for a given vehicle v based on the last value of the daily
utilization seconds series Lv :                                              4.3    Methodology for old vehicles
                         v                v                                  Old vehicles are assumed to have a sufficiently large amount of
                        DU R (t) = FU R (L (t))                        (7)
                                                                             historical data to train reliable Machine Learning models (see
   4.1.3 Multivariate regression model. We extend the univariate             Section 2). Thus, separately for each vehicle we train the multiple
regression model to a multivariate context in order to consider              regression models described in the previous section. Among the
the temporal correlation between the target variable and the                 trained models, we select those that minimizes the mean residual
previous series values.                                                      error over the last 29 days predicting the maintenance (EvM RE (D)  e
                                                                             with D = {1, . . . , 29}). For each vehicle, we consider the first 70%
                                                                                   e
        DvM R (t) = FM R (Lv (t), U v (t − 1), . . . , U v (t − W ))   (8)   of their samples (N v ) as training set, and the remaining part as
                                                                             test set.
   Unlike the univariate model, the model formalized in Equa-
tion 8 does not consider only the last value of the daily utilization
time but also the most recent values of the historical utilization
series within a size-W window time interval, i.e., from t − W to             4.4    Methodology for new and semi-new
t − 1 (where W is a user-specified parameter).                                      vehicles
                                                                             To handle new and semi-new vehicles, we need to face the follow-
4.2    Regression algorithms                                                 ing issues: (i) The lack of historical usage data, which hinders the
Univariate and multivariate regression models can be solved                  training of per-vehicle regressors. (ii) The first maintenance cycle
with linear or non-linear models. Linear models are deemed                   of most vehicles appears to have peculiar characteristics, with
as appropriate whenever usage patterns are quite constant for                less usage. Indeed, the mean daily utilization time spent by the
most of the time. Conversely, non-linear models are potentially              vehicles within the first cycle (10 676 seconds) is approximately
able to capture more complex, non-stationary usage trends. As                30% lower than in the subsequent cycles (13 792 seconds).
a drawback, the complexity of non-linear models is typically                    We design ad-hoc strategies to predict Dv for semi-new and
higher than those of linear ones.                                            new vehicles. We consider as training data the utilization series
    The deployed system allows fleet managers to select an ac-               in the first cycle of many old vehicles. Collecting in the training
ceptable trade-off between the accuracy and complexity based                 set only usage data related to the first maintenance cycle allows
on empirical evidences. The results of a preliminary performance             Machine Learning models to focus on the usage patterns peculiar
evaluation on real vehicle data are reported in Section 5. The               to that usage period (which could be significantly different all
models that have already been integrated and tested are briefly              from the subsequent ones). We take 70% of the 24 vehicles (i.e.,
described below. A more detailed description is given in [13].               17 vehicles), and consider their complete first cycle as training
Linear Regression (LR): It is the simplest linear model. It learns           set. The first cycle of the remaining 30% of the vehicles (i.e., 7) is
a linear function minimizing the residual sum of squares between             considered as test set.
the predicted target value and the expected target value in the
Table 1: E M RE ({1, . . . , 29}) with models trained on all data
                                                                                                        60
and models trained in the last 29 days before maintenance,                                                         BL
i.e., at times i : D(i) ∈ D  e = {1, . . . , 29}                                                                   LR
                                                                                                        40         LSVR
                   Trained on all data        Trained on D = {1, . . . , 29}                                       RF


                                                                                      Improvement(%)
      Algorithm
                  E M R E ({1, . . . , 29})       E M R E ({1, . . . , 29})                                        XGB
                                                                                                        20
         BL                 20.2                            20.2
         LR                 26.1                            10.8
        LSVR                13.3                            6.1                                          0
         RF                  6.9                             2.4
        XGB                 10.9                             5.6
                                                                                                       −20

    4.4.1 Semi-new vehicles. When more than half of the first cy-
cle has been completed, we learn a regression model by combin-                                         −40
                                                                                                             0     3         6         9         12          15        18
ing data from the other training vehicles. We apply the following                                                                      W
strategies:
Baseline: we follow the same approach as for old vehicles. In
practice, we compute AV G v as the average utilization in the                  Figure 4: Improvement (%) for each algorithm by increas-
first half of the first cycle, i.e, t :Íx ≤t U v (x )≤T v /2 (U v (t)/|t :
                                   Í                                           ing the number of features. W is the window of past usage
Í         v         v                                                          in the time series U v (t).
   x ≤t U (x) ≤ T /2)|).
Unified ML model: We create a single model, hereafter denoted
as ModelU ni , on the first cycle data of the training vehicles. In            Table 2: Best setting for features and the corresponding
this case, we train a single regression model for all the semi-new             mean relative error of the different algorithms.
vehicles by merging data acquired from all the training vehicles
together. The same model is applied to all the test vehicles.                                          Algorithm       Best window W       E M R E ({1, . . . , 29})
Similarity-based ML model: We pick usage data only for the                                                BL                0                         20.2
most correlated vehicle (rather than for all the old vehicles)                                            LR                 0                        10.8
                                                                                                         LSVR                6                         5.2
and train vehicle-specific Machine Learning models, denoted                                               RF                18                         1.3
as Model Sim , on it. The key idea is to first decide whether each                                       XGB                12                         4.2
vehicle is similar to the target semi-new vehicles or not by es-
timating the pairwise correlation between the utilization series
acquired in the first half of the first cycle. Then, we train the              (central column) and just in the the last 29 days of the cycles in
regression model only the first cycle data of the selected vehicle.            the training data (right hand-side column). E M RE ({1, . . . , 29})
In the current implementation, we estimate the pairwise similar-               is the average of the mean residual errors EvM RE ({1, . . . , 29})
ity in terms of point-wise average distance AV G v between the                 computed over all the test vehicles.
utilization series. However, more advanced similarity measures                     We found out that by forcing the algorithm to train only on
(e.g., [9]) can be integrated as well.                                         the last 29 days, the error was reduced by 59% in LR, 54% in LSVR,
                                                                               65% in RF and 48% in XGB. Since BL is not trained, its results
   4.4.2 New vehicles. For these vehicles we have very few or                  do not change. In general, RF presents the best results, with an
none data at all, hence we cannot even compute AV G v . There-                 average relative error of only 6.9 and 2.4 days, respectively for
fore, the baseline and similarity-based ML models cannot be                    the two training sets. Second best results are obtained by XGB,
applied. Hence, we apply the Unified ML model (ModelU ni ). No-                closely followed by LSVR.
tice that, when dealing with new vehicles, it does not make sense                  Now we delve into the study of the usage of different features.
to compute EvM RE ({1, . . . , 29}) since when we are approaching              Figure 4 shows, for each algorithm, the performance variation
the deadline the vehicle will already be semi-new. Hence, we                   (percentage) by increasing the window size W. Positive variation
focus on comparing the algorithm performance in terms of the                   means decrease of the error shown in Table 1 (i.e., performance
              v
global error EGlobal .                                                         improvement), and vice-versa. W is the window of past usage in
                                                                               the time series U v (x).W equal to 0 means we are in the univariate
5     EXPERIMENTAL RESULTS                                                     case, while W > 0 means we are in the multivariate case. For
We have tested the proposed methodologies on real vehicles of dif-             example, W equal to 3 means usage of U (t −1), U (t −2) and U (t −3)
ferent categories (new, semi-new, and old). The experiments were               in the regression (other than L(t)). BL is obviously constant, since
performed on a machine with Intel(R) Core(TM) i7-8750H CPU                     it is not using past values of U . In RF and XGB, adding features
with 16 GB of RAM. To tune the algorithm parameter settings we                 greatly improve the results. RF and XGB seem to reach a plateau
have performed, separately for each vehicle, a grid search using               in performance with more than 15 previous value of utilization
a 5-fold cross validation. Specifically, for RF and XGB we have                U . Hence, having more than a couple of weeks of previous data
tuned the maximum tree depth from 3 to 50, and the number of                   is enough to reach good performance. Respectively, the results
estimators from 10 to 1000. For SVR, we tested the linear kernel               improved by 44% and 25% for RF and XGB. In LR the best results
and varied the values of the parameters epsilon (from 0.5 to 2.5)              are instead obtained without adding features. Finally, for LSVR
and C (from 0.01 to 100).                                                      the results improve by adding up to 6 features, but then decrease
                                                                               again. We report in Table 2 for each algorithm the final best
5.1    Results for old vehicles                                                results for E M RE ({1, . . . , 29}) and the optimal size of the window
In Table 1 we show the values of E M RE ({1, . . . , 29}) achieved on          W . The non-linear algorithms (XGB and RF) reach again the best
the test set by training each algorithm on the whole training data             results.
                                                                               Table 3: Results for semi-new and new vehicles
           40
                    BL
                    LR                                                                                Semi-new vehicles          New vehicles
                                                                                      Algorithm
           30       LSVR                                                                             E M R E ({1, . . . , 29})    EGl obal
                                                                                         BL                    34.9                   -
       EM RE (D)


                    RF
              e


                    XGB                                                                LR S im                  4.9                   -
           20                                                                         LSV R S im                6.2                   -
                                                                                       RF S im                  2.9                   -
                                                                                      X G B S im                5.3                   -
           10                                                                          LRU ni                   5.1                 27.2
                                                                                      LSV RU ni                 8.8                 27.8
                                                                                       RFU ni                   3.2                 30.1
            0                                                                         X G BU ni                 4.2                 17.9
            {0}    {5}     {10}   {15}   {20}    {25}    {30}
                                   De

                                                                        to train reliable predictors, (ii) based on data acquired from sim-
Figure 5: Mean relative error analyzed over all the test ve-            ilar vehicles, when the vehicle is semi-new (i.e., the first cycle
hicles computed for De ranging from 1 to 29 days.                       maintenance is partly completed), or (iii) vehicle-independent,
                                                                        when the vehicle under analysis is new. The achieved results
                                                                        show that, when a minimal amount of vehicle-related data is
   Finally we show in Figure 5 the error, i.e., E M RE (D)
                                                        e obtained      available, Machine Learning approaches relying on non-linear
with the best configuration, for each algorithm, in each of the last    models outperform both naive approaches and linear ML models.
29 days before maintenance. Here D  e is a set composed of a single        The deployed version of the current system will be further
value, corresponding to the number of days left for the the next        extended. Specifically, we plan to enrich regression models using
maintenance. Clearly, the closer to the deadline, the smaller the       contextual information (e.g., meteorological data, fleet move-
error. All the algorithms improve the poor performance of the           ments) and to design ML supported scheduling strategies.
baseline. Interestingly, RF gets very good results even when we
are 29 days from the deadline, with an average error of only 2.4.       REFERENCES
   We also analyze the time complexity of the proposed method-           [1] Carlos Alonso de Armiño, Miguel Ángel Manzanedo, and Álvaro Herrero.
                                                                             2019. Studying Road Transportation Demand in the Spanish Industrial Sec-
ologies, in terms of execution time on our machine. The whole                tor Through k-Means Clustering. In International Joint Conference SOCO’18-
methodology pipeline includes (i) Data preparation, (ii) Model               CISIS’18-ICEUTE’18. Springer International Publishing, Cham, 387–396.
                                                                         [2] Luis Barreto, Antonio Amaral, and Teresa Pereira. 2017. Industry 4.0 implica-
training, and (iii) Model Testing. Step (ii) was the most time-              tions in logistics: an overview. Procedia Manufacturing 13 (2017), 1245–1252.
consuming task for all the algorithms. The average training time         [3] T.G. Crainic and G. Laporte. 1998. Fleet Management and Logistics. Springer
on a single vehicle is 30.4 s for XGB and 8.1 s for RF, while BL,            US.
                                                                         [4] Zahid Halim, Rizwana Kalsoom, and Abdul Rauf Baig. 2016. Profiling drivers
LR, and LSVR are faster taking respectively 2.5 s, 3.8 s, and 2.8 s.         based on driver dependent vehicle driving features. Applied Intelligence 44, 3
Moreover, the model complexity increases more than linearly                  (01 Apr 2016), 645–664.
with the number of considered features, i.e., window size W.             [5] Karl Henrik Johansson, Martin Törngren, and Lars Nielsen. 2005. Vehicle
                                                                             Applications of Controller Area Network. Birkhäuser, Boston, MA, 741–765.
                                                                         [6] Q. Li, S. Meng, S. Wang, J. Zhang, and J. Hou. 2019. CAD: Command-Level
5.2    Results for new and semi-new vehicles                                 Anomaly Detection for Vehicle-Road Collaborative Charging Network. IEEE
                                                                             Access 7 (2019), 34910–34924.
Table 3 shows the results of the above discussed methods for             [7] Dena Markudova, Elena Baralis, Luca Cagliero, Marco Mellia, Luca Vassio,
                                                                             Elvio Amparore, Riccardo Loti, and Lucia Salvatori. 2019. Heterogeneous
semi-new vehicles. RF Sim has the best results among all mod-                Industrial Vehicle Usage Predictions: A Real Case. In Workshop Proceedings of
els, meaning that comparing the similarity of average usage can              the EDBT/ICDT 2019 Joint Conference (March 26, 2019, Lisbon, Portugal).
slightly improve the results (from 3.2 to 2.9 days in E M RE ). More-    [8] Emad Mohamed, Parinaz Jafari, Ming-Fung Francis Siu, and Simaan AbouRizk.
                                                                             2017. Data-Driven Simulation-Based Model for Planning Roadway Opera-
over, notice how the baseline approach performs badly, with a                tion and Maintenance Projects. In Proceedings of the 2017 Winter Simulation
mean relative error of 34.9, a value much higher than all the                Conference (WSC ’17). IEEE Press, Article Article 272, 12 pages.
other metrics. This is because the limited amount of past data in        [9] Rodica Neamtu, Ramoza Ahsan, Elke A Rundensteiner, Gabor Sarkozy, Ea-
                                                                             monn Keogh, Hoang Anh Dau, Cuong Nguyen, and Charles Lovering. 2018.
the semi-new test vehicle cannot be trusted to create a simple               Generalized dynamic time warping: Unleashing the warping power hidden
predictor based on average usage.                                            in point-wise distances. In 2018 IEEE 34th International Conference on Data
                                                                             Engineering (ICDE). IEEE, 521–532.
   In the last column of Table 3 we also report the results of          [10] Federico Perrotta, Tony Parry, and Luis C. Neves. 2017. Application of machine
the above discussed methods for new vehicles. Baseline and                   learning for fuel consumption modelling of trucks. In 2017 IEEE International
similarity-based models cannot be applied to new vehicle, since              Conference on Big Data, Boston. 3810–3815.
                                                                        [11] Mehdi Rashidnejad, Sadoullah Ebrahimnejad, and Jalal Safari. 2018. A bi-
there is no past data. XGBU ni has the best result among all mod-            objective model of preventive maintenance planning in distributed systems
els. Even if the results in terms of error appears poor, we recall           considering vehicle routing problem. Computers & Industrial Engineering 120
that the global error accounts also for dates that are far from the          (2018), 360 – 381.
                                                                        [12] Elodie Robert, Christophe Berenguer, Keomany Bouvard, Hoceane Tedie, and
deadline. Even more promising, the global error of XGBU ni is                Romain Lesobre. 2018. Joint dynamic scheduling of missions and mainte-
comparable with the baseline mean relative error in the case of              nance for a commercial heavy vehicle: value of on-line information. IFAC-
                                                                             PapersOnLine 51, 24 (2018), 837 – 842.
old vehicles (Table 2).                                                 [13] Pang-Ning Tan, Michael Steinbach, Anuj Karpatne, and Vipin Kumar. 2018.
                                                                             Introduction to Data Mining (2nd ed.). Pearson.
                                                                        [14] Dieter Uckelmann. 2008. A Definition Approach to Smart Logistics. In Next
6     CONCLUSIONS AND FUTURE WORK                                            Generation Teletraffic and Wired/Wireless Advanced Networking, Sergey Ba-
The paper has presented a Machine Learning application to sup-               landin, Dmitri Moltchanov, and Yevgeni Koucheryavy (Eds.). Springer Berlin
                                                                             Heidelberg, 273–284.
port maintenance planning for fleets of industrial and construc-        [15] W. Zhang, D. Yang, and H. Wang. 2019. Data-Driven Methods for Predictive
tion vehicles. It proposes to use regression techniques to predict           Maintenance of Industrial Equipment: A Survey. IEEE Systems Journal 13, 3
the remaining days until the next maintenance is due. The trained            (Sep. 2019), 2213–2227.

models are (i) vehicle-specific, when sufficient data is available