<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Invariant Feature Selection for Battery State of Health Estimation in Heterogeneous Hybrid Electric Bus Fleets</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yuantao Fan</string-name>
          <email>yuantao.fan@hh.se</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohammed Ghaith Altarabichi</string-name>
          <email>mohammed_ghaith.altarabichi@hh.se</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sepideh Pashami</string-name>
          <email>sepideh.pashami@hh.se</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peyman Sheikholharam Mashhadi</string-name>
          <email>peyman.mashhadi@hh.se</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sławomir Nowaczyk</string-name>
          <email>slawomir.nowaczyk@hh.se</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Isafjordsgatan 28 A, 164 40 Kista, Research Institutes of Sweden (RISE)</institution>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Kristian IV:</institution>
          <addr-line>s väg 3, 301 18 Halmstad</addr-line>
          ,
          <institution>Center for Applied Intelligent Systems Research (CAISR), Halmstad University</institution>
          ,
          <addr-line>Halmstad</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Batteries are a safety-critical and the most expensive component for electric buses (EBs). Monitoring their condition, or the state of health (SoH), is crucial for ensuring the reliability of EB operation. However, EBs come in many models and variants, including diferent mechanical configurations, and deploy to operate under various conditions. Developing new degradation models for each combination of settings and faults quickly becomes challenging due to the unavailability of data for novel conditions and the low evidence for less popular vehicle populations. Therefore, building machine learning models that can generalize to new and unseen settings becomes a vital challenge for practical deployment. This study aims to develop and evaluate feature selection methods for robust machine learning models that allow estimating the SoH of batteries across various settings of EB configuration and usage. Building on our previous work, we propose two approaches, a genetic algorithm for domain invariant features (GADIF) and causal discovery for selecting invariant features (CDIF). Both aim to select features that are invariant across multiple domains. While GADIF utilizes a specific fitness function encompassing both task performance and domain shift, the CDIF identifies pairwise causal relations between features and selects the common causes of the target variable across domains. Experimental results confirm that selecting only invariant features leads to a better generalization of machine learning models to unseen domains. The contribution of this work comprises the two novel invariant feature selection methods, their evaluation on real-world EBs data, and a comparison against state-of-the-art invariant feature selection methods. Moreover, we analyze how the selected features vary under diferent settings.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;State of Health Estimation</kwd>
        <kwd>Invariant Feature Selection</kwd>
        <kwd>Transfer Learning</kwd>
        <kwd>Casual Discovery</kwd>
        <kwd>Genetic Algorithm</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The transition from fossil fuel to electromobility in the transportation ecosystem is well
underway and accelerating as an increasing number of hybrid and full-electric vehicles are
manufactured and commissioned. As the market size of electric buses is growing worldwide, they have
become a crucial part of a sustainable and environmentally friendly transportation solution
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The level of electrification in the buses depends mainly on the powertrain configuration,
which includes Battery Electric, Fuel Cell Electric, Hybrid Electric, etc. Hybrid electrification
is a stepping stone towards full electrification; so far, both are of similar market share in EU.
The most common battery type that powers the electric drive-line is the lithium-ion battery
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] since the technology was matured for broad mass more than a decade ago [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. As a result
of major manufacturers, like Volvo, gradually electrifying their products, new hybrid electric
buses are regularly launched for public transportation [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ]. These hybrid electric buses (HEBs)
are mounted with prismatic or cylindrical lithium-ion cells using liquid-cooled packs.
      </p>
      <p>
        However, the introduction of new electrical components means entering new, less well-known
territories. For example, operating conditions, e.g., types of operation or temperature, can have a
considerable impact on accelerated component wear in electric drive-line. This raises a question
about the robustness of the system. Most conventional on-board diagnostic and monitoring
methods [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref6 ref7 ref8 ref9">6, 7, 8, 9, 10, 11, 12</xref>
        ] for batteries mainly utilize the internal parameters of the battery
unit based on experiments in a laboratory environment. Notably, the degradation process of the
battery heavily depends on the usage pattern of the buses it powers. Often, the batteries and their
management unit are deployed to various bus fleets with diferent mechanical configurations
(e.g., single vs. double deck) and operations (e.g., urban vs. inter-city transportation). Therefore,
developing robust battery deterioration models that could generalize well from established to
novel settings provides great value to the industry.
      </p>
      <p>Drive batteries of HEBs are a good example of this challenge. Optimizing the deployment,
usage, and maintenance of these components requires a better understanding of the component’s
health under diverse operational settings. Such batteries are expensive, and prolonging their
efective life is crucial to achieving a sustainable transportation system, low-cost operation, and
market satisfaction. This requires insights into what factors are relevant to battery deterioration;
analyzing the importance of available onboard signals for SoH prediction is of great help. The
reliability of this expensive component becomes more important as the electromobility is being
implemented at scale. Continuous changes and improvements in the batteries’ technologies
challenge the experts and statistical models currently in use. For example, diferent generations
of batteries have diferent definitions for health indicators, and modeling one generation might
not transfer to the next one. Besides, the lack of historical data due to the relatively recent
deployment of the first electric dive lines in the real world leads to high model uncertainty.</p>
      <p>
        The State of Health (SoH) is defined as the ratio between the nominal capacity at time  and
the initial capacity [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. It is a commonly used health indicator for evaluating the aging and
deterioration level of the battery. Accurate estimation of SoH can be considered as a proxy
to estimate the remaining useful life (RUL) and predict the end-of-life (EOL) of batteries for
maintenance planning. Conventionally, the SoH is computed by a model developed by the
manufacturer of the battery, and they are usually based on the internal parameters of the
battery measured online [
        <xref ref-type="bibr" rid="ref13 ref6">6, 13</xref>
        ], e.g., current, voltage, and temperature of the single cells [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
Although the SoH estimated via battery management unit (BMU) units on-board was developed
and specialized for the target energy system; however, in reality, it may not always function
as intended, e.g., due to faults or mismatch of newly replaced sub-component versions. An
alternative is to employ a data-driven approach, using machine learning models, e.g., [
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ],
based on sensor data collected onboard.
      </p>
      <p>
        In dynamic environments, the behavior of the systems is expected to change over time. In
such cases, the conditions under which a machine learning model is trained are likely to difer
from the conditions on which they are to be tested. Thus, the performance of machine learning
models deteriorates unless domain transfer is explicitly addressed. Moreover, the change in
conditions between domains is often associated with a change in the joint distribution of input
and output between source and target domains. These problems are commonly known as forms
of dataset shift [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] and were addressed by transfer learning or domain adaptation methods [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
An example approach is to select only a subset of features that generalize well to a new domain
[
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. More concretely, in our experiments with modeling Li-Ion drive batteries, the degradation
behavior of hybrid buses varies considerably across diferent bus configurations and operating
conditions. This means that the features that are important for model training might not be
relevant for the diferent configurations on which it will be tested. Even worse, they could
have detrimental efects, leading to negative transfer. Our objective is to select features useful
not only for the task in the source domain(s) but also those that transfer well across diferent
domains, including the unseen ones. Modern electric buses are complex machines equipped
with many sub-systems controlled by electronic computing units and monitored by hundreds of
sensors. However, not all sensor data collected are helpful in estimating the SoH of the batteries.
Selecting a relevant subset of features that generalize to various domains, e.g., diferent EB
models and operating conditions, is crucial for building an efective SoH prediction model.
      </p>
      <p>
        In this paper, we propose two invariant feature selection methods, i.e., GADIF [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] and CDIF,
and evaluate their performance for selecting a set of invariant features across diferent settings
to estimate the SoH of batteries. We compare the two proposed approaches with a causal
learning-based domain adaptation method that produces invariant models, presented by Rojas
et al. in [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], and several classical feature selection methods under transfer learning settings,
where the goal is to migrate to unseen domains. The experimental results show that using
invariant features selected by the proposed approaches leads to better generality of resulting
models.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        The vast majority of studies on data-driven methods for estimating the SoH of batteries (review
available in [
        <xref ref-type="bibr" rid="ref6 ref7 ref8 ref9">6, 7, 8, 9</xref>
        ]) assumes the target population is the same as the population available at
the training time. However, the deterioration characteristics of the battery might vary depending
on the usage pattern of the equipment it powers. Vehicles may be deployed to a new area, i.e.,
an unseen domain, where the terrain and trafic situation are very diferent from its peers in the
previous deployment. The same type of battery and the management unit may be installed in a
vehicle with diferent physical configurations. Under such circumstances, diferences between
vehicle populations, i.e., domains, need to be considered for SoH prediction.
      </p>
      <p>
        Recently, more works on SoH prediction were conducted from the perspective of transfer
learning, where the source domain is diferent from the target domain. The most popular
approach is to utilize temporal information and sequence modeling, e.g., the long short-term
memory (LSTM) and the gated recurrent unit (GRU). For the SoH prediction task, this is done
under an inductive transfer learning setting, i.e., a small amount of labeled data from the target
domain is available. In their work [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], Tan et al. presented an approach using model-based
transfer using an LSTM network for SoH Prediction. The base model was trained for the
generalization performance, and the last two fully connected layers were fine-tuned with a
subset. Che et al., in their work [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], have adopted gate recurrent neural networks for SoH
prediction under a transfer learning setting. The model was first trained using a relevant
battery and fine-tuned with an early degradation cycle from the testing battery. To address the
variability in the length of the cycle sequences, Zhou et al. presented an interpretable transfer
learning method [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] that uses cycle synchronization for data alignment and explored a few
variations of GRUs to estimate the SoH of batteries in the target domain.
      </p>
      <p>Genetic algorithms (GA) are widely used for feature selection [24, 25, 26] and shown to be
very promising in many applications [27, 28, 29, 30]. However, few works have considered the
scenarios in which heterogeneity exists between the training and testing samples. One needs to
account for useful features for SoH regression in the source domain being diferent from the
ones in the target domain.</p>
      <p>
        Causal feature learning [31, 32] is another promising approach for selecting informative
features. It exploits the causal relationships learned from the observations since these
relationships imply the underlying mechanism of the variables. In particular, causal relationships
that do not vary over diferent source domains are of great interest and are likely to be robust
and able to generalize to future domains. Rojas et al. have proposed, in their work [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], an
approach based on causal transfer learning to produce invariant models (IMCTL) that deal
with covariate shift in a subset of features. Similarly, Persello et al. presented a kernel-based
approach [33] for selecting features that minimize the dataset shift between the training and
testing data. Magliacane et al. proposed an approach [34] selecting a subset of features that
lead to the best predictions in the source domains while satisfying the invariance condition,
determined by a joint causal inference framework [35] by Mooij et al. Another perspective is to
learn a common set of features for multiple tasks [36, 37], presented by Argyriou et al. Work
[38] presented by Taghiyarrenani et al. focused on constructing a new feature space that aligns
multiple domains with conditional shifts in the joint distribution, using a novel loss function
considering pairwise similarities between samples from diferent domains. With their dedicated
efort to studying capacity degradation in batteries [ 39], Sahoo et al. have proposed a two-layer
artificial neural networks model, with the second layer being fine-tuned using a small portion of
the target data and a novel feature that yields a low generalization error on a battery of diferent
types and working conditions. Few works on causality-based feature selection have considered
the settings where the source domains are diferent from target domains [ 32]. Neither works
proposed by Magliacane et al. [34], and Rojas et al. [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] have considered domains that consist
of time series data collected from multiple systems.
      </p>
      <p>In this study, we emphasize the specific challenges of EBs setting, which motivate the need
for proposed approaches, beyond state-of-the-art. The first one, inspiring GADIF, is employing a
iftness function for GA that takes into account heterogeneity in the available sample population
and extracting features that are invariant w.r.t. samples from diferent domains is of interest.
The second one, inspiring CDIF, is that causal graphs need to be learned from each system
individually, using data with time dependencies, and invariant features shall be selected based
on the presence of causal relationships across multiple diferent domains.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Method</title>
      <p>This study targets an unsupervised domain adaptation problem, to be specific, a multi-source
domain generalization problem, i.e., deploying machine learning models trained using labeled
pairs of sample observations from one or more source domains  = {1 , 2 , ...,  } to
an unseen target domain  , without any labels. Given a source domain  , and its entire
vehicle population  = {1, 2, ...,  }, the observations (labeled pairs) collected from vehicle
 are denoted using  = {(⃗(,1), (,1)), ..., (⃗(,), (,))}. In this case,  is the number of
vehicles, and  has dimensions of ( × ), where  is the number of samples and  is the
number of features. A sample instance ⃗(· ) is a -dimensional real-valued vector ⃗(· ) ∈ IR.
Data set of a source domain  is merely the union of samples from all vehicles  in that
domain, i.e.,  = ⋃︀ =1  . On the contrary, sample observations in the target domain 
are not labeled, i.e.,  = ⋃︀∈ {⃗(,1), ..., ⃗(,)}. Nevertheless, the feature space  of sample
instances in the source and the target domains are the same.</p>
      <p>
        In this study, we propose two approaches, i.e., using a genetic algorithm for domain invariant
features (GADIF) and causal discovery for selecting invariant features (CDIF), to select invariant
features that are expected to generalize based on several source domains {1 , 2 , ...,  },
where  is the number of source domains of which labeled data is available. Similar to [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ],
our work relies on the assumption that if the conditional distribution of the target label given a
feature subset is invariant across several source domains, then the same is likely to hold in the
target domain. In other words, there exists a subset of features that is invariant in predicting
the SoH values across all domains. One way to verify how well this assumption holds is to
evaluate whether the presence of causal relation between any features to the target variable is
consistent across various domains (an illustration is available in Figure 2).
      </p>
      <sec id="sec-3-1">
        <title>3.1. Causal discovery for invariant features selection</title>
        <p>We propose to use a causal discovery algorithm to learn causal relations in each vehicle and
select a set of features that were the common causes for the target variable , among vehicles
across all source domains. More specifically, we consider the probabilistic causation between the
two variables and compute the probability of the causal relations, e.g., that the feature  causes
, presented in each source population as a scoring criterion for selecting invariant features.
An illustration of the CDIF method is presented in Figure 1; invariant features were extracted
from two sets of causal graphs learned from individual vehicles in every source domain. For
each vehicle , a causal graph  = (, ) is generated using a causal discovery method
with additive noise models, proposed by Hoyer et al. in their work [40], where  denotes a
set of nodes,  = {1, 2, ..., }, corresponding to the  number of features in the training
instances of vehicle , and  denotes a set of edges  , indicating causal relations direct from
feature  to feature  if  = 1, for vehicle . Correspondingly,  = 0 indicates no causal
relation directly from feature  to  .</p>
        <p>Given a source domain  and its corresponding vehicle population  , a card( ), i.e.
cardinally of set  , number of causal graphs were learned, and the number of occurrences of
any causal relation presented in this vehicle population was computed as ∑︀ca=r1d( )  . The
probability of any causal relation given a specific source domain  , with a vehicle population
 , is then:
 (  = 1 |  ∈  ) =
1</p>
        <p>∑︁  ,
card( ) ∈
where  indicates the causal relation of feature  to  in vehicle . In this work, we only
consider the target variable  as  . Given a number of source domains {1 , 2 , ...,  } and
its corresponding vehicle population {1 , 2 , ...,  }, we propose two criteria for selecting
invariant features:</p>
        <p>⋂︁
∈{1 ,2 ,..., }</p>
        <p>1
{ | card() ∑︁  ≥ ,  ∈ {1, 2, ..., }}
∈
 =
 =</p>
        <p>⋃︁
1,2∈{1 ,2 ,..., },
1̸=2
1 1
∑︁  − card(2) ∈2</p>
        <p>∑︁  | ≤ ,  ∈ {1, 2, ..., }}
{ | | card(1) ∈1
(1)
(2)
(3)
where  and  are the thresholds for selecting features  in  and  . Invariant features
included in  are the ones of high presence, determined by  , in the causal relation across all
source domains; features included in  are the ones of similar presence, determined by  , in at
least a pair of the source domains.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Genetic algorithm for domain invariant features</title>
        <p>
          The proposed Genetic Algorithm (GA) [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] identifies good predictors of the target label
in  , based on their generalization performance across a number of source domains
{1 , 2 , ...,  } in a leave-one-domain-out cross-validation setting. The fitness
function of GADIF was designed to evaluate a feature subset  by testing its performance across a
number of source domains {1 , 2 , ...,  }.
        </p>
        <p>Our algorithm GADIF is initiated with a population of individuals encoding feature subsets
as chromosomes of binary strings, with 1 indicating the inclusion of the feature of the
corresponding index. The algorithm proceeds from one generation to the next by applying crossover
and mutation operators to the selected individuals in order to produce ofspring. The process
reassembles natural selection by choosing individuals with the best fitness as the most likely to
propagate to the following generation. GADIF uses a variant of GA called CHC [41] to carry out
the evolutionary search, which is essentially a wrapper method for feature selection, and can be
computationally expensive. Our feature selection method evaluates feature subsets according
to their performance across all available labeled source domains. The optimization search of
GADIF maximizes a fitness function that extends the equation provided by [42]:
 ( ) = 
+ (1 −  )(1 −


)
(4)
where  is the leave-one-domain-out cross-validation performance (e.g. mean absolute error) of
a machine learning model  trained in a wrapper setting using  , and  is a hyperparameter
for the tradeof between the model performance across several source domains P, and the
reduction in a number of selected features  with respect to the total number of features .</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experimental Results</title>
      <p>The application scenario we focus on in this study is to build robust machine learning models
using domain invariant features. That means features that generalize well from multiple source
domains to an unseen target domain. For example, it can be a new population with novel
operating conditions that were not available at training time. Therefore, we have chosen
a dataset from heterogeneous fleets of commercial buses consisting of approximately 1500
hybrid energy vehicles. They have diferent physical configurations (e.g., Double-Decker vs.
Single-Decker) and were deployed for diferent usage (e.g., long-haul vs. city operations) in
various locations. All of them are equipped with lithium-ion batteries. Although both numerical
and categorical features were present in the dataset, we only focused on a set of numerical
features (10 in total) to model SoH degradation, as these features are of the best data quality
and availability. The SoH values are collected from BMU, which is a component that does
not necessarily last longer than batteries. The age of the energy storage systems (ESS) was
post-processed (using repair records of the battery and BMU) and added to the feature set.</p>
      <sec id="sec-4-1">
        <title>4.1. Experiment settings</title>
        <p>Two sets of three experiments (six scenarios in total) were listed in Table 1 and were proposed
to address the need for domain adaption in the presence of a discrepancy between the source
and the target domain. The first set of experiments (i.e., 1 to 3) focuses on finding domain
invariant features that could generalize over diferent operating conditions. The second set of
experiments (i.e., 4 to 6) focuses on generalizing over diferent physical configurations. They
correspond to real-world scenarios in the applications with heterogeneous groups of vehicles,
where a freshly deployed fleet is of varying configuration, or operates under diferent conditions;
thus, it requires transfer learning.</p>
        <p>The first three scenarios focus on the transfer learning setting of fleets with diferent transport
cycles (operating conditions), with respect to the average speed. There are three diferent vehicle
populations/domains, i.e., the “slow”, the “moderate”, and the “fast”. The first domain of vehicles
belongs to the transport cycle “slow”, which corresponds to buses operating in city centers. Due
to heavier trafic, frequent stops, and speed limits, this group of vehicles obtains an average
speed of under 12.5 kph. The second transport cycle domain “moderate” includes buses traveling
at a higher speed range (12.5 to 17.5 kph) operating within the outer circle of cities. The group
of “fast” vehicles travels at an average speed larger than 17.5 kph, which corresponds to buses
operating between cities traveling long-hauls and highways. The second set of scenarios (i.e., 4 to
hybrid_tidmieesel_time
electrical_cthimarege_time
hybrid_ddisiteasnecl_edistance
electrical_dtiosttaaln_dceistatnocteal_timeess_age
hybrid_tidmieesel_time
electrical_cthimarege_time
hybrid_ddisiteasnecl_edistance
electrical_dtiosttaaln_dceistatnocteal_timeess_age
(a) experiment scenario 1-3
(b) experiment scenario 4-6
6) focuses on the transfer learning setting of vehicles with diferent physical configurations. The
ifrst domain in the second set was the single-decker buses; the second domain was the
doubledecker buses; the third domain was articulated buses, which are longer in length compared to
vehicles in the other two domains. Figure 2 provided an overview of the presence of the causal
relations on each feature  to the target variable  in diferent populations. It is shown in the
ifgure that the presence of features in the “moderate” and the “fast” population are quite similar,
as are the “double-decker” and the “articulated” vehicle population.</p>
        <p>
          For each experiment scenario, several well-known filter- and wrapper-based feature selection
methods were tested to compare with the proposed approaches, including Pearson index; feature
importance measured by random forest (RF), extreme gradient boost (XGB), and coeficient from
linear regression (LR) model; and sequential feature selection (SFS) [43] with linear regression.
In addition, a recently proposed approach based on invariant models for causal transfer learning
(IMCTL) [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] by Rojas et al., along with its variation utilizing greedy search (IMCTL-G) is
tested and compared to the proposed approach based causal discovery, i.e., casual discovery for
invariant features (CDIF  and  ). Moreover, we compared the Genetic Algorithm (GA) using
labeled data from the entire dataset (including all domains) with the proposed GA variation GAIF
using a special fitness function. GA trained with data from all domains served as a reference and
is expected to achieve the best performance. To evaluate the goodness of the features selected
by diferent methods under a fair setting, a linear regression model was chosen for the SoH
regression task, using Python implementation in scikit-learn [44]. Mean absolute error (MAE)
was adopted as the evaluation metric for the SoH prediction task, and 4-fold cross-validation
was performed on the vehicle level to measure the uncertainty.
        </p>
        <p>The result section is organized as follows. We first present a comparison (in Table 2) of the</p>
        <p>SFS</p>
        <p>XGB
electrical_distance
charge_time
hybrid_time
2 diesel_distance
iro diesel_time
caen ess_age
S total_distance
electrical_time
hybrid_distance</p>
        <p>total_time
1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 Fold
1 2 3 4 1 2 3 4 1 2 3 4 2 4
1 2 3 4 1 2 3 4
proposed approach against well-known methods across all six scenarios. Next, the uncertainty
in the feature selection over diferent folds of the experiments is analyzed (Figures 3 and 4). We
also study how the performance of diferent conventional feature selection methods depends on
design parameter , the number of features to be selected. We show the performance of CDIF
across diferent thresholds and numbers of features selected in Figure 7.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Methods performance comparison</title>
        <p>the best 5 features were selected. GA* corresponds to the approach using the genetic algorithm
with the entire population from both the source and the target domain. GA* served as a reference
and is expected to achieve the best performance, compared to other methods that only use the
source population for feature selection. The two variations of CDIF, i.e., CDIF  employed a
threshold  of 0.45, and CDIF  employed a threshold of 0.18. The selection of the thresholds,
i.e.,  and  , is a trade-of between regression performance and number of selected features.
The proposed approach (GADIF), with an  of 0.5, outperforms other well-known methods
under scenarios 2, 3, 5, and 6 (except for GA*, which is expected since it utilizes data from the
target domain while GADIF does not). CDIF  outperformed other methods under scenario 1.
For the rest of the scenarios, CDIF outperformed only IMCTL and IMCTL-G, which were also
based on causal learning; however, it was not competitive against classical methods.</p>
        <p>GADIF performs better under scenarios 5 and 6 compared to scenario 4. The source population
under scenarios 5 and 6 contains vehicles of both simple and a more complex/heavier structure,
i.e., single-decker and with one of the other two types. Intuitively, with the designed fitness
function, GADIF can extract domain invariant features from a population with larger variation
compared to scenario 4, in which only a vehicle with a complex/heavy structure is included
while in the target only resides vehicle with a simple/lighter structure. In addition, the standard
deviation in MAE values over 4-fold cross-validation of GA-based methods is rather small
(around 0.01), which indicates that the diference in the prediction performance of GA-based
methods between diferent scenarios is statistically significant.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Uncertainties in SoH modeling performance</title>
        <p>The uncertainty (e.g., standard deviation computed over the 4 folds) in the MAE performance
measure, presented in Table 2, was from the variations in the sample population. Therefore,
features selected by diferent methods might vary as well. Figures 3 and 4 illustrated features
selected by diferent methods: feature sets (top 5 features) suggested by Pearson index under
scenarios 1-6 remain the same over all four folds; features sets suggested by CDIF  remain the
same across all four folds under most scenarios (except for the 5-th run); feature sets suggested
by RF, LR, XGB, and CDIF  are quite stable, with less than seven mismatches. The deeper and
larger the filled squares are, the more important the corresponding feature is; unfilled squares
correspond to features selected by GA, GADIF, IMCTL, IMCTL-G, and CDIF; these methods are
binary and do not entail a degree of importance. Neither GA nor GADIF requires one to specify
the number of features for the selection, and features selected from both methods vary similarly
over the four folds; on the other hand, features selected by IMCTL, and IMCTL-G were quite
diferent over the four folds for all scenarios.</p>
        <p>The stability of feature selection methods was estimated by computing the distance between
the selected features from experiments of diferent cross-validation folds over the six scenarios.
A stable and consistent feature selection method would select identical feature sets from diferent
scenarios, yielding a zero-sum of all pairwise distances, computed from, e.g., feature sets selected
from diferent cross-validation folds. We computed the sum of all pairwise distances between
feature sets selected from the 4-folds using the Jaccard index and compared diferent methods
using a boxplot of summed distances from all six scenarios. Results are presented in Figure 5.</p>
        <p>The result shows the stability of the proposed approaches GADIF and CDIF  were not
electrical_distance
charge_time
hybrid_time
5 diesel_distance
iro diesel_time
caen ess_age
S total_distance
electrical_time
hybrid_distance</p>
        <p>total_time
1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 Fold
1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4
significantly diferent from the majority of the methods (i.e., RF, LR, SFS, and GA); feature sets
selected by Pearson index, and the proposed approach CDIF  were the most stable across
the six scenarios, while XGB appears to slightly outperform most of the methods; feature sets
selected using IMCTL and IMCTL-G were highly unstable.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Performance based on design parameter</title>
        <p>Figure 6 illustrates the performance of using diferent sets of features selected by diferent
methods over the feature set size , under experiment scenario 2. The gray horizontal line
corresponds to the mean MAE of GADIF, and the margin corresponds to the standard deviation
2.00
1.95
1.90
1.85
EA1.80
M
1.75
1.70
1.65
1.60
Methods</p>
        <p>Pearson
RF
LR
SFS
XGB
pearson RF</p>
        <p>LR</p>
        <p>SFS</p>
        <p>XGB</p>
        <p>GA</p>
        <p>GADIF IMCTLIMCTL-CGDIF UCDIF U
over 4-fold cross-validation. For most methods, it can be clearly seen that the performance
starts to converge from a  of 5. GADIF does not need to specify the number of features to
select and outperform other methods, regardless of the chosen .</p>
        <p>Subfigures in Figure 7(a), 7(c), 7(e), and 7(g) illustrate the performance of CDIF  and CDIF
 for diferent values of the thresholds  and  . Correspondingly, Figure 7(b), 7(d), 7(f), and 7(h)
showcases the number of features selected versus the threshold. For experiment scenario 1 with
the feature set  , the MAE for the “slow” target population was significantly lower compared
to scenarios 2 and 3. This is due to the presence of many features that cause the target variable
. Therefore, more features were selected, which yielded better overall prediction power. In
contrast, for scenarios 2 and 3, the presence of most features that cause the target variable 
is quite low in the “slow” population compared to the other two populations. Thus, only a
few features were selected for the regression task. For all six scenarios with the feature set 
and  , MAE changes negatively correspond to the number of features selected, and hence, in
general, the more features selected, the better the performance, until a saturation point that
there is no significant improvement, as is shown in Figure 6.
0.01160.01650.02390.03270.0423 0.065 0.0924 0.128
threshold
0.13620.15690.1811
0.01160.01650.02390.03270.0423 0.065 0.0924 0.128
threshold
0.13620.15690.1811
Single-decker
Double-decker
Articulated</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>This paper aims to develop and evaluate methods for selecting domain-invariant features. Such
features are expected to lead to robust machine learning models and lead to a better
generalization performance to unseen target populations. Two approaches, one based on causal discovery
(CDIF) and the other based on an evolutionary algorithm (GADIF), were proposed to identify
invariant features across several source domains in a multi-source domain generalization. The
performance of the proposed approaches was compared with well-known feature selection
methods in predicting the SoH of the batteries in real-world data from heterogeneous fleets
of HEBs, running under diferent operating conditions and physical configurations. The main
contribution of the paper is the comparison of well-known methods with the proposed
approaches on their ability to choose invariant features for training SoH regression models that
could generalize well to the unseen target domains. The experiment results show that GADIF
outperforms other methods by selecting features that generalize better to unseen domains in
4 out of 6 testing scenarios, while CDIF outperforms other methods in one scenario. Future
eforts for further development of CDIF include exploring the alternatives in generating the
causal graph for each vehicle; criteria for suggesting threshold over diferent settings; extending
the experiments with settings involving larger numbers of source domains; a more holistic
evaluation metric with a balanced weighting on both performance accuracy and stability in the
selected domain invariant features.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>The work was carried out with support from the Knowledge Foundation and Vinnova (Sweden’s
innovation agency) through the Vehicle Strategic Research and Innovation Programme FFI.
[24] S. Sindhiya, S. Gunasundari, A survey on genetic algorithm based feature selection for
disease diagnosis system, in: Proceedings of IEEE international conference on computer
communication and systems ICCCS14, IEEE, 2014, pp. 164–169.
[25] M. Rostami, K. Berahmand, S. Forouzandeh, A novel community detection based genetic
algorithm for feature selection, Journal of Big Data 8 (2021) 1–27.
[26] P. Dhal, C. Azad, A comprehensive survey on feature selection in the various fields of
machine learning, Applied Intelligence (2021) 1–39.
[27] Z. Ullah, S. R. Naqvi, W. Farooq, H. Yang, S. Wang, D.-V. N. Vo, et al., A comparative
study of machine learning methods for bio-oil yield prediction–a genetic algorithm-based
features selection, Bioresource Technology 335 (2021) 125292.
[28] A. A. Ewees, M. A. Al-qaness, L. Abualigah, D. Oliva, Z. Y. Algamal, A. M. Anter,
R. Ali Ibrahim, R. M. Ghoniem, M. Abd Elaziz, Boosting arithmetic optimization algorithm
with genetic algorithm operators for feature selection: case study on cox proportional
hazards model, Mathematics 9 (2021) 2321.
[29] N. Maleki, Y. Zeinali, S. T. A. Niaki, A k-nn method for lung cancer prognosis with the use
of a genetic algorithm for feature selection, Expert Systems with Applications 164 (2021)
113981.
[30] P. Z. Lappas, A. N. Yannacopoulos, A machine learning approach combining expert
knowledge with genetic algorithms in feature selection for credit risk assessment, Applied
Soft Computing 107 (2021) 107391.
[31] K. Chalupka, F. Eberhardt, P. Perona, Causal feature learning: an overview,
Behaviormetrika 44 (2017) 137–164.
[32] K. Yu, X. Guo, L. Liu, J. Li, H. Wang, Z. Ling, X. Wu, Causality-based feature selection:</p>
      <p>Methods and evaluations, ACM Computing Surveys (CSUR) 53 (2020) 1–36.
[33] C. Persello, L. Bruzzone, Kernel-based domain-invariant feature selection in hyperspectral
images for transfer learning, IEEE transactions on geoscience and remote sensing 54 (2015)
2615–2626.
[34] S. Magliacane, T. Van Ommen, T. Claassen, S. Bongers, P. Versteeg, J. M. Mooij,
Domain adaptation by using causal inference to predict invariant conditional distributions,
Advances in neural information processing systems 31 (2018).
[35] J. M. Mooij, S. Magliacane, T. Claassen, Joint causal inference from multiple contexts
(2020).
[36] A. Argyriou, T. Evgeniou, M. Pontil, Multi-task feature learning, in ‘advances in neural
information processing systems 19’, 2007.
[37] A. Argyriou, M. Pontil, Y. Ying, C. Micchelli, A spectral regularization framework for
multi-task structure learning, Advances in neural information processing systems 20
(2007).
[38] Z. Taghiyarrenani, S. Nowaczyk, S. Pashami, M.-R. Bouguelia, Multi-domain adaptation
for regression under conditional distribution shift, Available at SSRN 4197949 (2022).
[39] S. Sahoo, K. S. Hariharan, S. Agarwal, S. B. Swernath, R. Bharti, S. Han, S. Lee, Transfer
learning based generalized framework for state of health estimation of li-ion cells, Scientific
Reports 12 (2022) 1–12.
[40] P. Hoyer, D. Janzing, J. M. Mooij, J. Peters, B. Schölkopf, Nonlinear causal discovery with
additive noise models, Advances in neural information processing systems 21 (2008).
[41] L. J. Eshelman, The chc adaptive search algorithm: How to have safe search when engaging
in nontraditional genetic recombination, in: Foundations of genetic algorithms, volume 1,
Elsevier, 1991, pp. 265–283.
[42] S. M. Vieira, L. F. Mendonça, G. J. Farinha, J. M. Sousa, Modified binary pso for feature
selection using svm applied to mortality prediction of septic patients, Applied Soft Computing
13 (2013) 3494–3504.
[43] S. Raschka, Mlxtend: Providing machine learning and data science utilities and extensions
to python’s scientific computing stack, The Journal of Open Source Software 3 (2018).</p>
      <p>URL: http://joss.theoj.org/papers/10.21105/joss.00638. doi:10.21105/joss.00638.
[44] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel,
P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher,
M. Perrot, E. Duchesnay, Scikit-learn: Machine learning in Python, Journal of Machine
Learning Research 12 (2011) 2825–2830.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.-Q.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>Battery-electric transit bus developments and operations: A review</article-title>
          ,
          <source>International Journal of Sustainable Transportation</source>
          <volume>10</volume>
          (
          <year>2016</year>
          )
          <fpage>157</fpage>
          -
          <lpage>169</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mahmoud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Garnett</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ferguson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Kanaroglou</surname>
          </string-name>
          ,
          <article-title>Electric buses: A review of alternative powertrains</article-title>
          ,
          <source>Renewable and Sustainable Energy Reviews</source>
          <volume>62</volume>
          (
          <year>2016</year>
          )
          <fpage>673</fpage>
          -
          <lpage>684</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P.</given-names>
            <surname>Pichler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kapaun</surname>
          </string-name>
          ,
          <article-title>Lithium ion batteries for hybrid busses and hybrid commercial vehicles-being ready for the broad mass.?; li-ion batterien fuer hybridbusse und hybridnutzfahrzeuge. startklar fuer die breite masse</article-title>
          .
          <source>?</source>
          (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Borén</surname>
          </string-name>
          ,
          <article-title>Electric buses' sustainability efects, noise, energy use, and costs</article-title>
          ,
          <source>International Journal of Sustainable Transportation</source>
          <volume>14</volume>
          (
          <year>2020</year>
          )
          <fpage>956</fpage>
          -
          <lpage>971</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <article-title>[5] Volvo group invests in optibus to speed up on bus electrification and digitalization</article-title>
          , https: //www.sustainable-bus.com/its/optibus-volvo-investment/, ???? Accessed:
          <fpage>2021</fpage>
          -11-12.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Lipu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hannan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hussain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hoque</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. J.</given-names>
            <surname>Ker</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. H. M. Saad</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Ayob</surname>
          </string-name>
          ,
          <article-title>A review of state of health and remaining useful life estimation methods for lithium-ion battery in electric vehicles: Challenges and recommendations</article-title>
          ,
          <source>Journal of Cleaner Production</source>
          <volume>205</volume>
          (
          <year>2018</year>
          )
          <fpage>115</fpage>
          -
          <lpage>133</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Fotouhi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Auger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Propp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Longo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wild</surname>
          </string-name>
          ,
          <article-title>A review on electric vehicle battery modelling: From lithium-ion toward lithium-sulphur</article-title>
          ,
          <source>Renewable and Sustainable Energy Reviews</source>
          <volume>56</volume>
          (
          <year>2016</year>
          )
          <fpage>1008</fpage>
          -
          <lpage>1021</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Barré</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Deguilhem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Grolleau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gérard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Suard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Riu</surname>
          </string-name>
          ,
          <article-title>A review on lithium-ion battery ageing mechanisms and estimations for automotive applications</article-title>
          ,
          <source>Journal of Power Sources</source>
          <volume>241</volume>
          (
          <year>2013</year>
          )
          <fpage>680</fpage>
          -
          <lpage>689</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Berecibar</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Gandiaga</surname>
          </string-name>
          , I. Villarreal,
          <string-name>
            <given-names>N.</given-names>
            <surname>Omar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. V.</given-names>
            <surname>Mierlo</surname>
          </string-name>
          , P. V. den Bossche,
          <article-title>Critical review of state of health estimation methods of li-ion batteries for real applications</article-title>
          ,
          <source>Renewable and Sustainable Energy Reviews</source>
          <volume>56</volume>
          (
          <year>2016</year>
          )
          <fpage>572</fpage>
          -
          <lpage>587</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ouyang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <article-title>Battery degradation minimization oriented energy management strategy for plug-in hybrid electric bus with multi-energy storage system</article-title>
          ,
          <source>Energy</source>
          <volume>165</volume>
          (
          <year>2018</year>
          )
          <fpage>153</fpage>
          -
          <lpage>163</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>J.-H. Lee</surname>
            ,
            <given-names>I.-S.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>Estimation of online state of charge and state of health based on neural network model banks using lithium batteries</article-title>
          ,
          <source>Sensors</source>
          <volume>22</volume>
          (
          <year>2022</year>
          )
          <fpage>5536</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>N.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Yue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>A data-driven approach to state of health estimation and prediction for a lithium-ion battery pack of electric buses based on real-world data</article-title>
          ,
          <source>Sensors</source>
          <volume>22</volume>
          (
          <year>2022</year>
          )
          <fpage>5762</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>B.</given-names>
            <surname>Pattipati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Pattipati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Christopherson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Namburu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. V.</given-names>
            <surname>Prokhorov</surname>
          </string-name>
          , L. Qiao,
          <article-title>Automotive battery management systems</article-title>
          , IEEE,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Rothgang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rogge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Becker</surname>
          </string-name>
          , D. U. Sauer,
          <article-title>Battery design for successful electrification in public transport</article-title>
          ,
          <source>Energies</source>
          <volume>8</volume>
          (
          <year>2015</year>
          )
          <fpage>6715</fpage>
          -
          <lpage>6737</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>D.</given-names>
            <surname>Roman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Saxena</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Robu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pecht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Flynn</surname>
          </string-name>
          ,
          <article-title>Machine learning pipeline for battery state-of-health estimation</article-title>
          ,
          <source>Nature Machine Intelligence</source>
          <volume>3</volume>
          (
          <year>2021</year>
          )
          <fpage>447</fpage>
          -
          <lpage>456</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tan</surname>
          </string-name>
          ,
          <string-name>
            <surname>G</surname>
          </string-name>
          . Zhao,
          <article-title>Transfer learning with long short-term memory network for stateof-health prediction of lithium-ion batteries</article-title>
          ,
          <source>IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS 67</source>
          (
          <year>2020</year>
          )
          <fpage>8723</fpage>
          -
          <lpage>8731</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>J.</given-names>
            <surname>Quiñonero-Candela</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sugiyama</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. D.</given-names>
            <surname>Lawrence</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Schwaighofer</surname>
          </string-name>
          ,
          <article-title>Dataset shift in machine learning</article-title>
          , Mit Press,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <article-title>A survey on transfer learning</article-title>
          ,
          <source>IEEE Transactions on knowledge and data engineering 22</source>
          (
          <year>2009</year>
          )
          <fpage>1345</fpage>
          -
          <lpage>1359</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S.</given-names>
            <surname>Uguroglu</surname>
          </string-name>
          , J. Carbonell,
          <article-title>Feature selection for transfer learning</article-title>
          ,
          <source>in: Joint European Conference on Machine Learning and Knowledge Discovery in Databases</source>
          , Springer,
          <year>2011</year>
          , pp.
          <fpage>430</fpage>
          -
          <lpage>442</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>M. G.</given-names>
            <surname>Altarabichi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Fan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pashami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. S.</given-names>
            <surname>Mashhadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Nowaczyk</surname>
          </string-name>
          ,
          <article-title>Extracting invariant features for predicting state of health of batteries in hybrid energy buses</article-title>
          ,
          <source>in: 2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)</source>
          , IEEE,
          <year>2021</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rojas-Carulla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Schölkopf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Turner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Peters</surname>
          </string-name>
          ,
          <article-title>Invariant models for causal transfer learning</article-title>
          ,
          <source>The Journal of Machine Learning Research</source>
          <volume>19</volume>
          (
          <year>2018</year>
          )
          <fpage>1309</fpage>
          -
          <lpage>1342</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Che</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Deng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <article-title>Predictive battery health management with transfer learning and online model correction</article-title>
          ,
          <source>IEEE Transactions on Vehicular Technology</source>
          <volume>70</volume>
          (
          <year>2021</year>
          )
          <fpage>1269</fpage>
          -
          <lpage>1277</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>K. Q.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Qin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yuen</surname>
          </string-name>
          ,
          <article-title>Transfer-learning-based state-of-health estimation for lithiumion battery with cycle synchronization</article-title>
          , IEEE/ASME Transactions on Mechatronics (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>