<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Generative Adversarial Network Based Autoencoder: Application to fault detection problem for closed-loop dynamical systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Data Length</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Building Technologies Office through the Emerging Technologies, Sensors and Controls Program</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>CVGMI, University of Florida, FL</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>I. Chakraborty</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Optimization and Control Group Pacific Northwest National Laboratory</institution>
          ,
          <addr-line>Richland, WA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The fault detection problem for closed-loop, uncertain dynamical systems is investigated in this paper, using different deep-learning based methods. The traditional classifier-based method does not perform well, because of the inherent difficulty of detecting system-level faults for a closed-loop dynamical system. Specifically, the acting controller in any closed-loop dynamical system works to reduce the effect of systemlevel faults. A novel generative-adversarial-based deep autoencoder is designed to classify data sets under normal and faulty operating conditions. This proposed network performs quite well when compared to any available classifier-based methods, and moreover, does not require labeled fault-incorporated data sets for training purposes. This network's performance is tested on a highcomplexity building energy system data set.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Fault detection and isolation enables safe operation of
critical dynamical systems, along with cost effective system
performance and maximally effective control performance. For
this reason, fault detection and isolation research is of
interest in many engineering areas, such as aerospace systems
(e.g., [1; 2; 3; 4]), automotive systems (e.g., [5; 6; 7; 8; 9;
10; 11]), photovoltaic systems (e.g., [12; 13; 14; 15; 16; 17;
18]), and building heating and cooling systems (e.g., [20;
21]). For feedback-controlled dynamical systems subjected
to exogenous disturbances, fault detection and isolation
becomes challenging because the controller expends effort to
compensate for the undesired effect of the fault.</p>
      <p>In this paper, we will focus on the fault detection
problem. The objective will be to successfully distinguish data
sets collected under faulty operating conditions from data
sets representative of normal operating conditions. We will
only investigate physical faults that affect the system
dynamics. One can classify the approaches to fault detection
problems based on the assumption regarding system
dynamics, namely linear or nonlinear systems, and based on the use
of a system model for fault detection, either model-driven or
data-driven. Model-driven methods use a model for the
dynamical system to detect the fault, whereas the data-driven
methods do not make explicit use of a model of the physical
system. Next we provide a brief overview of the available
literature in all these categories.</p>
      <p>
        The fault detection problem for linear systems was first
formulated in [
        <xref ref-type="bibr" rid="ref21">22</xref>
        ] and [
        <xref ref-type="bibr" rid="ref22">23</xref>
        ]. Both papers developed
Luenberger-observer based approaches, where the observer
gain matrix decouples the effects of different faults. The
observer-based approach was extended in [
        <xref ref-type="bibr" rid="ref23">24</xref>
        ] to include
fault identification by solving the problem of residual
generation by processing the inputs and outputs of the system.
A model- and parameter-estimation based fault detection
method is developed in [
        <xref ref-type="bibr" rid="ref24">25</xref>
        ]. An observer-based fault
detection approach, where eigenstructure assignment provides
robustness to the effects of exogenous disturbances, is
demonstrated in [
        <xref ref-type="bibr" rid="ref25">26</xref>
        ]. Sliding-mode observers are used in [
        <xref ref-type="bibr" rid="ref26">27</xref>
        ] and
[
        <xref ref-type="bibr" rid="ref27">28</xref>
        ], who also provide fault severity estimates. Isermann
and Balle [
        <xref ref-type="bibr" rid="ref28">29</xref>
        ] provide an overview of fault detection
methods developed in the 1990s, including state and output
observers, parity equations, bandpass filters, spectral analysis
(fast Fourier transforms), and maximum-entropy estimation.
      </p>
      <p>
        For nonlinear systems, fault detection methods primarily
use the concept of unknown input observability.
Controllability and observability Gramians for nonlinear systems
are defined in [
        <xref ref-type="bibr" rid="ref29">30</xref>
        ]. De Persis and Isidori [
        <xref ref-type="bibr" rid="ref30">31</xref>
        ] develop a
differential geometric method for fault detection and
isolation. They use the concept of an unobservability subspace,
based on the similar notion for linear systems (see [
        <xref ref-type="bibr" rid="ref31">32</xref>
        ]).
The method guarantees the existence of a quotient
subsystem of a given system space, which is only affected by the
fault of interest. Martinelli [
        <xref ref-type="bibr" rid="ref32">33</xref>
        ] develops a generalized
algorithm to calculate the rank of the observable codistribution
matrix (equivalent to the observability Gramian for linear
systems) for nonlinear systems, and demonstrates its
applicability for several practical examples, such as motion of a
unicycle, a vehicle moving in three-dimensional space, and
visual-inertial sensor fusion dynamics.
      </p>
      <p>
        For a model-based fault detection problem, Maybeck et
al. and Elgersma et al. used an assemble of Kalman filters
to match a particular fault pattern in [
        <xref ref-type="bibr" rid="ref33">34</xref>
        ] and [
        <xref ref-type="bibr" rid="ref34">35</xref>
        ],
respectively. Boskovic et al. [
        <xref ref-type="bibr" rid="ref35">36</xref>
        ] and [
        <xref ref-type="bibr" rid="ref36">37</xref>
        ] develop a multiple
model method to detect and isolate actuator faults, using
multiple hypothesis testing. In [9], a nonlinear
observerbased fault identification method has been developed for a
robot manipulator, which shows an asymptotic convergence
of the fault observer to the actual fault value. Dixon et al.
[5] develop a torque filtering based fault isolation for a class
of robotic manipulator systems. In [
        <xref ref-type="bibr" rid="ref37">38</xref>
        ], a model-based fault
detection and identification approach is developed, by using
a differential algebraic and residual generation method.
      </p>
      <p>
        Data-driven approaches such as [
        <xref ref-type="bibr" rid="ref38">39</xref>
        ] and [
        <xref ref-type="bibr" rid="ref39">40</xref>
        ], use
system data to identify the state-space matrices, without using
any knowledge of system dynamics. In [
        <xref ref-type="bibr" rid="ref40">41</xref>
        ], for a class of
discrete time-varying networked systems with incomplete
measurements, a least-squares filter paired with a residual
matching (RM) approach is developed to isolate and
estimate faults. This approach comprises several Kalman
filters, with each filter designed to estimate the augment
signal, composed of the system state and a specific fault signal,
associated with it. An adaptive fault detection and
diagnosis method is developed in [
        <xref ref-type="bibr" rid="ref41">42</xref>
        ], by implementing a
clustering approach to detect faults. For incipient faults,
Harmouche et al. in [
        <xref ref-type="bibr" rid="ref42">43</xref>
        ] used a principal component analysis
(PCA) framework to transform a data set with faulty
operating conditions into either principal or residual subspaces.
For nonlinear systems, although data-driven approaches are
effective in many fault identification scenarios, the quality
of fault detection greatly depends on the quality of available
training data and the training data span. Zhang et al. [
        <xref ref-type="bibr" rid="ref3">4</xref>
        ]
proposed merging data-driven and model-based methods in a
Bayesian framework. In [
        <xref ref-type="bibr" rid="ref43">44</xref>
        ], sparse global-local preserving
projections are used to extract sparse transformation vectors
from given data set. The extracted sparse transformation is
able to extract meaningful features from the data set, which
results in a fault related feature extraction, as shown in [
        <xref ref-type="bibr" rid="ref43">44</xref>
        ].
      </p>
      <p>
        Generative adversarial networks (GANs) were introduced
in [
        <xref ref-type="bibr" rid="ref44">45</xref>
        ] as data generative models in a zero-sum game
framework. The training objective for a GAN is to increase the
error rate of the discriminative network that was trained
on an existing data set. Since their introduction, GANs
have been used to augment machine learning techniques
to do boosting of classification accuracy, generate
samples, and detect fraud [46; 47; 48; 49; 50; 51; 52; 53;
54; 55]. GAN has been proposed as an alternative to
variational autoencoders [56; 57]. Several research
publications propose algorithms that can distinguish between
“true” samples and samples generated by GANs [58; 59;
60; 61].
      </p>
      <p>The remainder of the paper is organized as follows. We
provide a mathematical description of the fault detection
problem along with the proposed approach in Section 2.
In Section 3 we explain the architecture of an autoencoder
and we propose a GAN to generate and classify data sets
with normal and faulty operating conditions. A novel loss
function, suitable for the proposed GAN based autoencoder
network, is developed in Section 3. In Section 4, we first
train and test a support vector machine (SVM) based
classifier, on labeled data sets; (labeling is done based on both
faulty and normal operating conditions). Subsequently, we
demonstrate a way to improve the performance of the
designed SVM, by training a GAN based autoencoder on a
Gaussian random data set, which represents data sets with
faulty operating conditions, for training the proposed GAN
based network. In Section 4, we show further improved
performance of our proposed GAN based network architecture
using a representative data set with faulty operating
conditions generated by taking linear combinations of vectors that
are orthogonal to the principal components of the normal
data set space. Finally, we summarize our findings in
Section 5.
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>The problem and the proposed method</title>
      <sec id="sec-2-1">
        <title>Problem description</title>
        <p>where x : [0; 1) ! Rn is the n-dimensional vector
containing system states, u : [0; 1) ! Rm is the
mdimensional vector containing control inputs, d : [0; 1] !
Rp is the p-dimensional vector of exogenous disturbances,
f : Rn Rm Rp ! Rn is an unknown nonlinear mapping
that represents the system dynamics, y : [0; 1] ! Rq is a
vector of measurable system outputs, and g : Rn Rm
Rp ! Rq is an unknown nonlinear mapping that represents
the relationship of input to output.</p>
        <p>Now we define a fault detection problem for the
dynamics in (1) as follows. Given any data set S containing sample
measurement pairs of u and y, identify an unwanted change
in the system dynamics. In order to further generalize the
fault detection problem, we will only use the data set
representing normal operating conditions. This restriction uses
the fact that having a data set that incorporates faulty
operating conditions indicates either having the capabilities of
inserting system-level faults in the dynamics or having a
known system dynamics f (as in (1)). Developing either
of these aforementioned capabilities involves manual labor
and associated cost. We will further assume that the
observable part of the system described in (1) can be sufficiently
identified from the available data set with normal operating
conditions.
For the fault detection problem described in Section 2.1, we
develop a GAN based deep autoencoder, which uses data
set with normal operating conditions (let us define this data
space as S0), to successfully identify the presence of faulty
operating conditions in a given data set. In order to do
that, we take the principal components of S0 and use the
orthogonals to those principal components to define a vector
space S1. Now, the training objective of our proposed GAN
is to “refine” S1, to calculate S2 S1, such that S1
becomes a representative of the data set that contains
systemlevel faulty operating conditions. The purpose of our deep
autoencoder is to learn the data structure of S0, by going
through the process of encoding and decoding. Upon
selecting an encoding dimension, we map the GAN generated
space S2 to the selected encoding dimension space. Let us
designate the encoded representation of S0 as S0 , and S2
as S2 . Our final step is to design a classifier, which takes
S0 and S2 for training. Furthermore, this entire training
process, of both GAN and the deep autoencoder, is done
simultaneously by defining a cumulative loss function.</p>
        <p>
          In order to motivate the requirement of doing an
orthogonal transformation on S0 to define S1, we demonstrate a
case of defining S1 using Gaussian random noises, and
follow the aforementioned training process of the proposed
network. Moreover, a single SVM based classifier is trained
on labeled normal and fault-incorporated data sets, to
compare performance with our proposed network for two
different cases (orthogonal transformation and Gaussian-noise
based prior selection).
Autoencoders are multilayer computational graphs used to
learn a representation (encoding) for a set of data, for the
purpose of dimensionality reduction or data compression. In
other words, the training objective of our proposed deep
autoencoder is to learn the underlying representation of a data
set with normal operating conditions while going through
the encoding transformations. A deep enough autoencoder,
in theory, should be able to extract a latent representation
signature from the training data, which can then be used to
better distinguish normal and faulty operation. An
autoencoder comprises two different transformations, namely
encoding and decoding. The architecture of an autoencoder
was first introduced and described by Bengio et al. in [
          <xref ref-type="bibr" rid="ref61">62</xref>
          ].
The encoder takes an input vector x 2 Rd and maps it to a
hidden (encoded) representation xe 2 Rd0 , through a
convolution of deterministic mappings. The decoder maps the
resulting encoded expression into a reconstruction vector x0.
We will use the notation E and G for the encoder and
decoder of the autoencoder respectively.
        </p>
        <p>Let the number of layers in the autoencoder network be
2n + 1, and let yi denote the output for the network’s ith
layer. Then
yi = i(wityi 1 + bi); 8i 2 [1; 2n + 1] :
Let i = fwi; big denote the parameters of the ith layer and
i : R ! R be the activation function selected for each
layer of the autoencoder. Let us also define = f igi2=n+11.</p>
        <p>defined for this autoencoder is optimized to minimize
the average reconstruction error, given by
= arg min
m
1
p
m</p>
        <p>X L(xi; x0i)
d i=1
(2)
where L is square of Euclidean distance, defined as
L(x; x0) , kx x0k2, and m 2 N is the number of available
data points.</p>
        <p>Our proposed autoencoder is trained on the normal data
set, mentioned in Section 4.1. 90% of the normal data (data
span one year, with 5 minute resolution) is used to train the
autoencoder, and the rest 10% is used for testing. Figure 2
shows both training and testing performance of our
autoencoder with encoding dimension 100, with increase in
training epochs. Furthermore, selecting the proper encoding
dimension is crucial for the following classifier to perform
optimally. Figure 2 also demonstrates that the true positive
accuracy rate from the classifier decreases when we decrease
the encoding dimension. This signifies the loss of valuable
information, if we keep decreasing the encoding dimension.
For our application, we selected an encoding dimension of
100.</p>
        <p>
          In the next subsection, we give the formulation of our
proposed generative model. Our generative model is in the
spirit of the well-known GAN [
          <xref ref-type="bibr" rid="ref44">45</xref>
          ]. Our proposed model
will essentially generate samples that are not from the
training data population. So clearly, unlike GAN, here the
objective is not to fool the discriminator but to learn which
samples are different. We will first formulate our proposed
model and then comment on the relationship of our model
with GAN in detail.
3.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Our proposed generative model</title>
        <p>We will use x1 to denote a sample of the data from the
normal class, i.e., the class for which the training data is given.
Let pdata be the distribution of the normal class. We will
denote a sample from the abnormal class by x2. In our setting,
the distribution of the abnormal class is unknown, because
the training data does not have any samples from the
abnormal class. Our generative model will generate sample x2
from the unknown distribution, pnoise. Note, here we will
use the terminology “data” and “noise” to denote the
normal and abnormal samples. Let pz be the prior of the noise
in the encoding space, i.e., x2 pnoise = G(pz).
Furthermore, let D be the discriminator (a multilayer perceptron for
binary classification), such that</p>
        <p>D(x) =
1; if x
0; if x
pdata
pnoise
We will solve for E , G, and D in a maximization problem
with the error function V as follows:</p>
        <p>V (D; E ; G) = Ex1 pdata ((1</p>
        <p>L(x1; G(E (x1))))
+ log(D(x1))) + Ez pz log(1
Note that here, L is as defined in Eq. 2. Furthermore, L
is normalized in [0; 1]. Now, we will state and prove some
theorems about the optimality of the solutions for the error
function V .</p>
        <p>Theorem 1. For fixed G and E , the optimal D is
D (x) =</p>
        <p>pdata(x)
pdata(x) + pnoise(x)
(4)
x
Proof. Given G and E , V (D; E ; G) can be written as:
V (D) = Ex1 pdata (log(D(x1))) + Ez pz log(1</p>
        <p>Z
=
(pdata(x) log(D(x)) + pnoise(x) log(1
D(G(z)))</p>
        <p>D(x)))
The above function achieves the maximum at D (x) =
pdata(x)
pdata(x)+pnoise(x) .</p>
        <p>Theorem 2. With D
when x1
tion.</p>
        <p>pdata and x2
and fixed E , the optimal G is attained</p>
        <p>pnoise has zero mutual
informaProof. Observe, from Eq. 3, the first term is maximized if
and only if the loss, L, is zero. Hence, when D = D and E
are fixed, the objective function, V reduces to</p>
        <sec id="sec-2-2-1">
          <title>Ex1 pdata (1</title>
          <p>L(x1; G(E (x1)))) + H(x1; x2)</p>
          <p>H(x1)</p>
          <p>H(x2)
The first term goes to zero, 1, when the reconstruction is
perfect, then, the remaining term is maximized iff,
H(x1; x2)</p>
          <p>H(x1)</p>
          <p>H(x2) = 0
where H(:) and H(:; :) denote the marginal and joint
entropies, respectively. Note that, the LHS of the above
expression is the mutual information, which is denoted by
I(x1; x2). Hence, the claim holds.</p>
          <p>
            Theorem 2 signifies that x1 pdata and x2 pnoise
have zero mutual information, i.e., the distributions pdata and
pnoise are completely uncorrelated. This is exactly what we
intend to get, i.e., we want to generate abnormal samples
that are completely different from the normal samples
(training data). Now, we will talk about how to choose the prior
pz after commenting on the contrast of our proposed
formulation with [
            <xref ref-type="bibr" rid="ref44">45</xref>
            ]. In GAN [
            <xref ref-type="bibr" rid="ref44">45</xref>
            ], the generator is essentially
mimicking the data distribution to fool the discriminator. On
the contrary, because our problem requires that samples be
generated from outside training data, our proposed
generator generates samples outside the data distribution. Note that
one can choose the Wasserstein loss function in Eq. 3
similar to [
            <xref ref-type="bibr" rid="ref47">48</xref>
            ]. Below we will mention some of the important
characteristics of our proposed model.
          </p>
          <p>Though we have called it a GAN based autoencoder,
clearly the decoder G is generating the samples and
hence acts as a generator in GAN.</p>
          <p>In Equation 3, on samples drawn from pdata,
autoencoder (i.e., both encoder and decoder) acts, i.e.,
G(E (x1)) should be very closed to x1, when x1
pdata. On the contrary, on z pz, only the decoder
(generator G) acts. Thus, the encoder is learned only
from pdata, while the decoder (generator) is learned
from both pdata and pz.</p>
          <p>Unlike GAN, here we do not have a two player min
max game, instead we have a maximization problem
over all the unknown parameters. Intuitively, this can
be justified, because we are not generating counterfeit
samples.</p>
        </sec>
      </sec>
      <sec id="sec-2-3">
        <title>How to choose prior pz</title>
        <p>If we do not know anything about the structure of the data,
i.e., about pdata, an obvious choice of prior for pz is a
uniform prior. In this work, we have used PCA to extract the
inherent lower-dimension subspace containing the data (or
most of the data). This is essential not only for the
selection of pz but for the selection of the encoding dimension
as well. By the construction of our proposed formulation,
the support of pz should be in the encoding dimension, i.e.,
in Rd0 . Given the data, we will choose d0 to be the
number of principal directions along which the data has &gt; 90%
variance. The span of these d0 bases will give a point, S, on
the Grassmannian Gr(d0; d), i.e., the manifold of d0
dimensional subspaces in Rd. The PCA suggests that “most of
the data” lies on S. In order to make sure that the generator
generates pnoise different from pdata, we will use the prior pz
as follows.
d0</p>
        <p>Let N 2 Gr(d0; d) be such that N 6= S.pLz eitf fznii=gi=x1tnbie,
the bases of N . We will say a sample z
for all i, for some x pdata. Without any loss of generality,
assume 2d0 &gt; d; then, we can select the first d d0 nis to
be orthogonal to S (this can be computed by using
GramSchmidt orthogonalization). The remaining fnigs we will
select from the bases of S.
4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>In this section, we will present experimental validation of
our proposed GAN based model. Recall that in our setting,
we have only the “normal” samples in the training set and
both “normal” and “faulty” samples in the testing set. In
the training phase, we will use our proposed GAN based
framework to generate samples from the population that are
uncorrelated to the normal population. We will teach a
discriminator to do so. Then, in the testing phase, we will show
that our trained discriminator can distinguish “normal” from
“faulty” samples with high prediction accuracy.
Furthermore, we will also show that using the prior, as suggested in
Section 3.1, gives better prediction accuracy than the
Gaussian prior.
4.1</p>
      <sec id="sec-3-1">
        <title>Dataset</title>
        <p>We use simulation data from a high-fidelity building energy
system emulator. This emulator captures the building
thermal dynamics, the performance of the building heating,
ventilation, and air conditioning (HVAC), as well as the
building control system. The control sequences that drive
operation of the building HVAC are representative of typical
existing large commercial office buildings in the U.S. We
selected Chicago for the building location, and we used the
typical meteorological year TMY3 data as simulation input.
The data set comprises normal operation data and data
representative of operation under five different fault types. We
use these labeled data sets for training an SVM based
classifier. The five fault types are the following: constant bias
in outdoor air temperature measurement (Fault 1), constant
bias in supply air temperature measurement (Fault 2),
constant bias in return air temperature measurement (Fault 3),
offset in supply air flow rate (Fault 4), and stuck cooling
coil valve (Fault 5). Table 1 summarizes the characteristics
of the data set including fault location, intensity, type, and
data length.</p>
      </sec>
      <sec id="sec-3-2">
        <title>Faulty</title>
      </sec>
      <sec id="sec-3-3">
        <title>Component</title>
        <sec id="sec-3-3-1">
          <title>Outdoor air</title>
          <p>temperature sensor</p>
          <p>Supply air
temperature sensor</p>
          <p>Return air
temperature sensor
Supply air flow rate
set point
Cooling coil valve
actuator</p>
        </sec>
      </sec>
      <sec id="sec-3-4">
        <title>System</title>
        <sec id="sec-3-4-1">
          <title>Building HVAC Mid-floor AHU</title>
          <p>Mid-floor</p>
          <p>AHU
Mid-floor</p>
          <p>AHU
Mid-floor</p>
          <p>AHU
Mid-floor
AHU</p>
        </sec>
      </sec>
      <sec id="sec-3-5">
        <title>Time of</title>
      </sec>
      <sec id="sec-3-6">
        <title>Year</title>
        <p>Jan-Dec
Feb, May,
Aug, Nov</p>
        <p>Aug
May-Jun
May-Jun
Aug</p>
      </sec>
      <sec id="sec-3-7">
        <title>Fault</title>
        <p>Intensity
2; 4 C</p>
        <sec id="sec-3-7-1">
          <title>Yearly</title>
        </sec>
        <sec id="sec-3-7-2">
          <title>Monthly</title>
        </sec>
        <sec id="sec-3-7-3">
          <title>Monthly</title>
        </sec>
        <sec id="sec-3-7-4">
          <title>Monthly</title>
        </sec>
        <sec id="sec-3-7-5">
          <title>Monthly</title>
        </sec>
        <sec id="sec-3-7-6">
          <title>Monthly</title>
        </sec>
      </sec>
      <sec id="sec-3-8">
        <title>4.2 Application of SVM on simulated dataset</title>
        <p>
          Support vector machines are statistical classifiers originally
introduced by [
          <xref ref-type="bibr" rid="ref62">63</xref>
          ] and [
          <xref ref-type="bibr" rid="ref63">64</xref>
          ], later formally introduced by
[
          <xref ref-type="bibr" rid="ref64">65</xref>
          ]. In this subsection, we will briefly demonstrate the
use of SVMs for classifying properly labeled datasets with
normal and various faulty operating conditions. SVM
separates a given set of binary labeled training data with a
hyperplane, which is at maximum distance from each binary
label. Therefore, the objective of this classification method
is to find the maximal margin hyperplane for a given
training data set. For our work, a linear separation is not
possible (i.e., to successfully draw a line to separate faulty and
normal data sets); that motivates the necessity of using a
radial basis function (RBF) kernel ([
          <xref ref-type="bibr" rid="ref65">66</xref>
          ]), along with finding a
non-polynomial hyperplane to separate the labeled datasets.
        </p>
        <p>
          Before describing SVM classification in detail, the RBF
kernel (see [
          <xref ref-type="bibr" rid="ref63">64</xref>
          ]) on two samples xi and xj is defined as
Kij , K(xi; xj ) = exp
kxi
where kxi xj k denotes the square of the Euclidean
distance, and is a user-defined parameter, selected to be unity
for this work.
        </p>
        <p>A Scikit learning module available in Python 3.5+ is used
for implementation of SVM on the building HVAC data set.
Specifically, NuSVC is used with a cubic polynomial kernel
function to train for normal and faulty data classification. As
the nu value represents the upper bound on the fraction of
training error, a range of nu values from 0:5 to 0:9 are tried
during cross validation of the designed classifier. Table 2
shows the confusion matrix for the designed SVM classifier,
for data sets labeled “normal” and “fault type 1,” where the
true positive accuracy rate is less than 50%. This finding,
as mentioned before, justifies the need to develop an
adversarial based classifier, which uses the given normal data to
create representative faulty training dataset.</p>
      </sec>
      <sec id="sec-3-9">
        <title>4.3 Data set using Gaussian noise</title>
        <p>In Table 3, the confusion matrix is shown for the GAN based
autoencoder, where Gaussian noise is used as an input to
GAN, for representing a training class of fault types for
the GAN based autoencoder. From left to right, the values
in Table 4, denote true positive rate (TPR), false positive
rate (FPR), false negative rate (FNR), and true negative rate
(TNR). Although in Table 3, the normal data set gives more
than 90% TPR, the faulty data set gives around 40% TPR.
We can conclude that Gaussian noise as an initial
representative of a faulty data set does not represent a completely
different faulty data set from the normal data set.
We demonstrated three different methods for the fault
detection problem, applied to a high complexity building data set.
The SVM classifier, despite using labeled data sets, gives
poor TPRs for our data set. Our proposed GAN based deep
autoencoder network is trained and tested using two
different training approaches. First, we use a Gaussian-noise
based data set as a representation of space S1 (as in Section
2) to train the designed GAN and simultaneously find a
representative class for a data set with faults, i.e., S2 . Although
the Gaussian-noise based data set gives much better TPR for
the normal data set than the SVM, it performs poorly when
identifying a data set with faulty conditions. Second, we use
orthogonal transformation on the normal data set to generate
S2, and subsequently our proposed GAN based autoencoder
is trained on this new S2 to generate S2 . Although the
orthogonal transformation based training approach gives
similar TPRs for the normal data set, it gives significantly better
performance for the data set with faulty conditions than the
Gaussian-noise based training approach.
4.6</p>
      </sec>
      <sec id="sec-3-10">
        <title>Group testing</title>
        <p>
          In this section, we will do some statistical analysis of the
output produced by our proposed framework. More
specifically, we will do group testing in the encoding space, i.e.,
we will pass the generated noise and the data through the
trained encoder and perform a group test. However, because
we do not know the distribution of the data and noise in the
encoding space, we cannot do a two-sample t-test. We will
develop a group testing scheme for our purpose. Let y1
i
and y2 be two sets of samples in the encoding space
geni
erated using our proposed network. Let nCi1 := yi1 y1 to
i
and nCi2 := yi2 yi2 to be the corresponding covariance
matrices capturing the interactions among dimensions. We
will identify each of the covariance matrices with the
product space of Stiefel and symmetric positive definite (SPD)
matrices, as proposed in [
          <xref ref-type="bibr" rid="ref66">67</xref>
          ].
the distance, d(X; Y ) =
        </p>
        <p>
          Now, we perform the kernel based two-sample test to
find the group difference [
          <xref ref-type="bibr" rid="ref67">68</xref>
          ] between Ci1 and Ci2 .
In order to use their formulation, we first define the
intrinsic metric we will use in this work. We will use the
general linear (GL)-invariant metric for SPD matrices, which
is defined as follows: Given X; Y as two SPD matrices,
r
        </p>
        <p>
          trace (Log (X 1Y ))2 . For
the Stiefel manifold, we will use the canonical metric [
          <xref ref-type="bibr" rid="ref68">69</xref>
          ].
On the product space, we will use the `1 norm as the
product metric. As the kernel, we will use the Gaussian RBF,
which is defined as follows: Given C1 = (A; X) and
C2 = (B; Y ) as two points on the product space, the kernel,
k (C1; C2) := exp d2 (C1; C2) . Here, d is the product
metric. Given C1 N1 and Ci2 iN=21, the maximum mean
i i=1
discrepancy (MMD) is defined as follows:
        </p>
        <p>For a level test, we reject the null hypothesis H0 =
f samples from the two groups are from same distribution g
if M M D &lt; 2p1= max N1; N2 1 + p log . Finally,
we conclude from the experiments that for our proposed
framework, we reject the null hypothesis with 95%
confidence.
5</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>A novel GAN based autoencoder is introduced in this paper.
This proposed network performs very well when compared
to an SVM based classifier. Although the SVM classifier
uses labeled training data for classification, it still gives less
than 50% TPR for our high complexity simulated data set.
On the other hand, the proposed GAN based deep
autoencoder gives significantly better performance for two
different types of training scenarios. The proposed GAN based
autoencoder is initially trained on a random Gaussian data
set. Next, orthogonal projection is used to generate a data
set that is perpendicular to the given normal data set. This
orthogonally projected data set is used as an initial
faultincorporated data set for our proposed GAN based
autoencoder for training. Confusion matrices for both training
scenarios are presented, and both of them perform very well
compared to the SVM based classification approach.
Finally, a statistical group test demonstrates that our encoded
normal and GAN based fault-incorporated data spaces (i.e.,
data sets in S0 and S2 spaces, respectively) are statistically
different, and subsequently validates the favorable
performance of our proposed network.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>RJ</given-names>
            <surname>Patton</surname>
          </string-name>
          .
          <article-title>Fault detection and diagnosis in aerospace systems using analytical redundancy</article-title>
          .
          <source>Computing &amp; Control Engineering Journal</source>
          ,
          <volume>2</volume>
          (
          <issue>3</issue>
          ):
          <fpage>127</fpage>
          -
          <lpage>136</lpage>
          ,
          <year>1991</year>
          . 1 Ron
          <string-name>
            <given-names>J</given-names>
            <surname>Patton and Jie Chen</surname>
          </string-name>
          .
          <article-title>Robust fault detection of jet engine sensor systems using eigenstructure assignment</article-title>
          .
          <source>Journal of Guidance</source>
          , Control, and Dynamics,
          <volume>15</volume>
          (
          <issue>6</issue>
          ):
          <fpage>1491</fpage>
          -
          <lpage>1497</lpage>
          ,
          <year>1992</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Ron J Patton and Jie Chen</surname>
          </string-name>
          .
          <article-title>Review of parity space approaches to fault diagnosis for aerospace systems</article-title>
          .
          <source>Journal of Guidance</source>
          , Control, and Dynamics,
          <volume>17</volume>
          :
          <fpage>278</fpage>
          -
          <lpage>278</lpage>
          ,
          <year>1994</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Shuo</given-names>
            <surname>Zhang</surname>
          </string-name>
          and
          <article-title>Miroslav Baric´. A Bayesian approach to hybrid fault detection and isolation</article-title>
          .
          <source>In Decision and Control (CDC)</source>
          ,
          <source>2015 IEEE 54th Annual Conference on</source>
          , pages
          <fpage>4468</fpage>
          -
          <lpage>4473</lpage>
          . IEEE,
          <year>2015</year>
          .
          <volume>1</volume>
          ,
          <fpage>2</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Warren E Dixon</surname>
          </string-name>
          ,
          <string-name>
            <surname>Ian D Walker</surname>
          </string-name>
          ,
          <string-name>
            <surname>Darren M Dawson</surname>
            ,
            <given-names>and John P Hartranft.</given-names>
          </string-name>
          <article-title>Fault detection for robot manipulators with parametric uncertainty: A predictionerror-based approach</article-title>
          .
          <source>IEEE Transactions on Robotics and Automation</source>
          ,
          <volume>16</volume>
          (
          <issue>6</issue>
          ):
          <fpage>689</fpage>
          -
          <lpage>699</lpage>
          ,
          <year>2000</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Aengus</given-names>
            <surname>Murray</surname>
          </string-name>
          , Bruce Hare, and
          <string-name>
            <given-names>Akihiro</given-names>
            <surname>Hirao</surname>
          </string-name>
          .
          <article-title>Resolver position sensing system with integrated fault detection for automotive applications</article-title>
          . In Sensors,
          <year>2002</year>
          . Proceedings of IEEE, volume
          <volume>2</volume>
          , pages
          <fpage>864</fpage>
          -
          <lpage>869</lpage>
          . IEEE,
          <year>2002</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Domenico</given-names>
            <surname>Capriglione</surname>
          </string-name>
          , Consolatina Liguori, Cesare Pianese, and Antonio Pietrosanto.
          <article-title>On-line sensor fault detection, isolation, and accommodation in automotive engines</article-title>
          .
          <source>IEEE Transactions on Instrumentation and Measurement</source>
          ,
          <volume>52</volume>
          (
          <issue>4</issue>
          ):
          <fpage>1182</fpage>
          -
          <lpage>1189</lpage>
          ,
          <year>2003</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Rolf</given-names>
            <surname>Isermann</surname>
          </string-name>
          .
          <article-title>Model-based fault detection and diagnosis-status and applications</article-title>
          .
          <source>IFAC Proceedings Volumes</source>
          ,
          <volume>37</volume>
          (
          <issue>6</issue>
          ):
          <fpage>49</fpage>
          -
          <lpage>60</lpage>
          ,
          <year>2004</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Michael L McIntyre</surname>
          </string-name>
          ,
          <string-name>
            <surname>Warren E Dixon</surname>
          </string-name>
          ,
          <string-name>
            <surname>Darren M Dawson</surname>
            , and
            <given-names>Ian D</given-names>
          </string-name>
          <string-name>
            <surname>Walker</surname>
          </string-name>
          .
          <article-title>Fault detection and identification for robot manipulators</article-title>
          .
          <source>In Robotics and Automation</source>
          ,
          <source>2004. Proceedings. ICRA'04</source>
          . 2004 IEEE International Conference on, volume
          <volume>5</volume>
          , pages
          <fpage>4981</fpage>
          -
          <lpage>4986</lpage>
          . IEEE,
          <year>2004</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Inseok</surname>
            <given-names>Hwang</given-names>
          </string-name>
          , Sungwan Kim, Youdan Kim, and
          <article-title>Chze Eng Seah. A survey of fault detection, isolation, and reconfiguration methods</article-title>
          .
          <source>IEEE Transactions on Control Systems Technology</source>
          ,
          <volume>18</volume>
          (
          <issue>3</issue>
          ):
          <fpage>636</fpage>
          -
          <lpage>653</lpage>
          ,
          <year>2010</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Justin</given-names>
            <surname>Flett and Gary M Bone.</surname>
          </string-name>
          <article-title>Fault detection and diagnosis of diesel engine valve trains</article-title>
          .
          <source>Mechanical Systems and Signal Processing</source>
          ,
          <volume>72</volume>
          :
          <fpage>316</fpage>
          -
          <lpage>327</lpage>
          ,
          <year>2016</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Steven</surname>
            <given-names>K Firth</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kevin J Lomas</surname>
          </string-name>
          , and
          <string-name>
            <surname>Simon J Rees</surname>
          </string-name>
          .
          <article-title>A simple model of PV system performance and its use in fault detection</article-title>
          .
          <source>Solar Energy</source>
          ,
          <volume>84</volume>
          (
          <issue>4</issue>
          ):
          <fpage>624</fpage>
          -
          <lpage>635</lpage>
          ,
          <year>2010</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A</given-names>
            <surname>Chouder</surname>
          </string-name>
          and
          <string-name>
            <given-names>S</given-names>
            <surname>Silvestre</surname>
          </string-name>
          .
          <article-title>Automatic supervision and fault detection of PV systems based on power losses analysis</article-title>
          .
          <source>Energy conversion and Management</source>
          ,
          <volume>51</volume>
          (
          <issue>10</issue>
          ):
          <fpage>1929</fpage>
          -
          <lpage>1937</lpage>
          ,
          <year>2010</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Henry</surname>
            <given-names>Braun</given-names>
          </string-name>
          , Santoshi T Buddha, Venkatachalam Krishnan, Andreas Spanias, Cihan Tepedelenlioglu, Ted Yeider, and
          <string-name>
            <given-names>Toru</given-names>
            <surname>Takehara</surname>
          </string-name>
          .
          <article-title>Signal processing for fault detection in photovoltaic arrays</article-title>
          .
          <source>In Acoustics, Speech and Signal Processing (ICASSP)</source>
          ,
          <year>2012</year>
          IEEE International Conference on, pages
          <fpage>1681</fpage>
          -
          <lpage>1684</lpage>
          . IEEE,
          <year>2012</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Ye</surname>
            <given-names>Zhao</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Ling</given-names>
            <surname>Yang</surname>
          </string-name>
          , Brad Lehman, Jean-Franc¸ois de Palma, Jerry Mosesian, and
          <string-name>
            <given-names>Robert</given-names>
            <surname>Lyons</surname>
          </string-name>
          .
          <article-title>Decision tree-based fault detection and classification in solar photovoltaic arrays</article-title>
          .
          <source>In Applied Power Electronics Conference and Exposition (APEC)</source>
          ,
          <source>2012 TwentySeventh Annual IEEE</source>
          , pages
          <fpage>93</fpage>
          -
          <lpage>99</lpage>
          . IEEE,
          <year>2012</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Yongheng</surname>
            <given-names>Yang</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Frede</given-names>
            <surname>Blaabjerg</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Zhixiang</given-names>
            <surname>Zou</surname>
          </string-name>
          .
          <article-title>Benchmarking of grid fault modes in single-phase grid-connected photovoltaic systems</article-title>
          .
          <source>IEEE Transactions on Industry Applications</source>
          ,
          <volume>49</volume>
          (
          <issue>5</issue>
          ):
          <fpage>2167</fpage>
          -
          <lpage>2176</lpage>
          ,
          <year>2013</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Santiago</surname>
            <given-names>Silvestre</given-names>
          </string-name>
          , Aissa Chouder, and
          <string-name>
            <given-names>Engin</given-names>
            <surname>Karatepe</surname>
          </string-name>
          .
          <article-title>Automatic fault detection in grid connected PV systems</article-title>
          .
          <source>Solar Energy</source>
          ,
          <volume>94</volume>
          :
          <fpage>119</fpage>
          -
          <lpage>127</lpage>
          ,
          <year>2013</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Elyes</surname>
            <given-names>Garoudja</given-names>
          </string-name>
          , Fouzi Harrou, Ying Sun, Kamel Kara, Aissa Chouder, and
          <string-name>
            <given-names>Santiago</given-names>
            <surname>Silvestre</surname>
          </string-name>
          .
          <article-title>A statisticalbased approach for fault detection and diagnosis in a photovoltaic system</article-title>
          .
          <source>In Systems and Control (ICSC)</source>
          ,
          <year>2017</year>
          6th International Conference on, pages
          <fpage>75</fpage>
          -
          <lpage>80</lpage>
          . IEEE,
          <year>2017</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>ME</given-names>
            <surname>Ropp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M</given-names>
            <surname>Begovic</surname>
          </string-name>
          , and
          <string-name>
            <given-names>A</given-names>
            <surname>Rohatgi</surname>
          </string-name>
          .
          <article-title>Prevention of islanding in grid-connected photovoltaic systems</article-title>
          .
          <source>Progress in Photovoltaics: Research and Applications</source>
          ,
          <volume>7</volume>
          (
          <issue>1</issue>
          ):
          <fpage>39</fpage>
          -
          <lpage>59</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Zhimin</surname>
            <given-names>Du</given-names>
          </string-name>
          , Bo Fan, Xinqiao Jin, and
          <string-name>
            <given-names>Jinlei</given-names>
            <surname>Chi</surname>
          </string-name>
          .
          <article-title>Fault detection and diagnosis for buildings and HVAC systems using combined neural networks and subtractive clustering analysis</article-title>
          .
          <source>Building and Environment</source>
          ,
          <volume>73</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          ,
          <year>2014</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Dian-ce Gao</surname>
          </string-name>
          , Shengwei Wang,
          <string-name>
            <surname>Kui Shan</surname>
            , and
            <given-names>Chengchu</given-names>
          </string-name>
          <string-name>
            <surname>Yan</surname>
          </string-name>
          .
          <article-title>A system-level fault detection and diagnosis method for low delta-t syndrome in the complex hvac systems</article-title>
          .
          <source>Applied Energy</source>
          ,
          <volume>164</volume>
          :
          <fpage>1028</fpage>
          -
          <lpage>1038</lpage>
          ,
          <year>2016</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [22] Richard Vernon Beard.
          <article-title>Failure accomodation in linear systems through self-reorganization</article-title>
          .
          <source>PhD thesis</source>
          , Massachusetts Institute of Technology,
          <year>1971</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Harold</surname>
            <given-names>Lee</given-names>
          </string-name>
          <string-name>
            <surname>Jones</surname>
          </string-name>
          .
          <article-title>Failure detection in linear systems</article-title>
          .
          <source>PhD thesis</source>
          , Massachusetts Institute of Technology,
          <year>1973</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [24]
          <string-name>
            <surname>M-A Massoumnia</surname>
          </string-name>
          ,
          <article-title>George C Verghese, and Alan S Willsky. Failure detection and identification</article-title>
          .
          <source>IEEE transactions on automatic control</source>
          ,
          <volume>34</volume>
          (
          <issue>3</issue>
          ):
          <fpage>316</fpage>
          -
          <lpage>321</lpage>
          ,
          <year>1989</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Paul</surname>
            <given-names>M Frank.</given-names>
          </string-name>
          <article-title>Fault diagnosis in dynamic systems using analytical and knowledge-based redundancy: A survey and some new results</article-title>
          .
          <source>Automatica</source>
          ,
          <volume>26</volume>
          (
          <issue>3</issue>
          ):
          <fpage>459</fpage>
          -
          <lpage>474</lpage>
          ,
          <year>1990</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>RJ</given-names>
            <surname>Patton</surname>
          </string-name>
          and
          <string-name>
            <given-names>J</given-names>
            <surname>Chen</surname>
          </string-name>
          .
          <article-title>Observer-based fault detection and isolation: Robustness and applications</article-title>
          .
          <source>Control Engineering Practice</source>
          ,
          <volume>5</volume>
          (
          <issue>5</issue>
          ):
          <fpage>671</fpage>
          -
          <lpage>682</lpage>
          ,
          <year>1997</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [27]
          <string-name>
            <surname>Christopher</surname>
            <given-names>Edwards</given-names>
          </string-name>
          ,
          <article-title>Sarah K Spurgeon,</article-title>
          and
          <string-name>
            <surname>Ron</surname>
          </string-name>
          J Patton.
          <article-title>Sliding mode observers for fault detection and isolation</article-title>
          .
          <source>Automatica</source>
          ,
          <volume>36</volume>
          (
          <issue>4</issue>
          ):
          <fpage>541</fpage>
          -
          <lpage>553</lpage>
          ,
          <year>2000</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>Chee</given-names>
            <surname>Pin</surname>
          </string-name>
          Tan and
          <string-name>
            <given-names>Christopher</given-names>
            <surname>Edwards</surname>
          </string-name>
          .
          <article-title>Sliding mode observers for robust detection and reconstruction of actuator and sensor faults</article-title>
          .
          <source>International Journal of Robust and Nonlinear Control</source>
          ,
          <volume>13</volume>
          (
          <issue>5</issue>
          ):
          <fpage>443</fpage>
          -
          <lpage>463</lpage>
          ,
          <year>2003</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>Rolf</given-names>
            <surname>Isermann</surname>
          </string-name>
          and
          <string-name>
            <given-names>Peter</given-names>
            <surname>Balle</surname>
          </string-name>
          .
          <article-title>Trends in the application of model-based fault detection and diagnosis of technical processes</article-title>
          .
          <source>Control Engineering Practice</source>
          ,
          <volume>5</volume>
          (
          <issue>5</issue>
          ):
          <fpage>709</fpage>
          -
          <lpage>719</lpage>
          ,
          <year>1997</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>Robert</given-names>
            <surname>Hermann and Arthur Krener</surname>
          </string-name>
          .
          <article-title>Nonlinear controllability and observability</article-title>
          .
          <source>IEEE Transactions on automatic control</source>
          ,
          <volume>22</volume>
          (
          <issue>5</issue>
          ):
          <fpage>728</fpage>
          -
          <lpage>740</lpage>
          ,
          <year>1977</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [31]
          <string-name>
            <surname>Claudio De Persis and Alberto Isidori</surname>
          </string-name>
          .
          <article-title>A geometric approach to nonlinear fault detection and isolation</article-title>
          .
          <source>IEEE transactions on automatic control</source>
          ,
          <volume>46</volume>
          (
          <issue>6</issue>
          ):
          <fpage>853</fpage>
          -
          <lpage>865</lpage>
          ,
          <year>2001</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>Alberto</given-names>
            <surname>Isidori</surname>
          </string-name>
          .
          <source>Nonlinear control systems. Springer Science &amp; Business Media</source>
          ,
          <year>2013</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>Agostino</given-names>
            <surname>Martinelli</surname>
          </string-name>
          .
          <article-title>Nonlinear unknown input observability: The general analytic solution</article-title>
          .
          <source>arXiv preprint arXiv:1704.03252</source>
          ,
          <year>2017</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [34]
          <string-name>
            <surname>Peter S Maybeck</surname>
          </string-name>
          .
          <article-title>Multiple model adaptive algorithms for detecting and compensating sensor and actuator/surface failures in aircraft flight control systems</article-title>
          .
          <source>International Journal of Robust and Nonlinear Control</source>
          ,
          <volume>9</volume>
          (
          <issue>14</issue>
          ):
          <fpage>1051</fpage>
          -
          <lpage>1070</lpage>
          ,
          <year>1999</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Elgersma</surname>
          </string-name>
          and
          <string-name>
            <given-names>Sonja</given-names>
            <surname>Glavaski</surname>
          </string-name>
          .
          <article-title>Reconfigurable control for active management of aircraft system failures</article-title>
          .
          <source>In American Control Conference</source>
          ,
          <year>2001</year>
          .
          <source>Proceedings of the 2001</source>
          , volume
          <volume>4</volume>
          , pages
          <fpage>2627</fpage>
          -
          <lpage>2639</lpage>
          . IEEE,
          <year>2001</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>JD</given-names>
            <surname>Boskovic</surname>
          </string-name>
          ,
          <string-name>
            <surname>S-M Li</surname>
          </string-name>
          ,
          <article-title>and Raman K Mehra</article-title>
          .
          <article-title>Intelligent spacecraft control using multiple models, switching, and tuning</article-title>
          .
          <source>In Intelligent Control/Intelligent Systems and Semiotics</source>
          ,
          <year>1999</year>
          .
          <source>Proceedings of the 1999 IEEE International Symposium on</source>
          , pages
          <fpage>84</fpage>
          -
          <lpage>89</lpage>
          . IEEE,
          <year>1999</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [37]
          <string-name>
            <surname>Jovan</surname>
            <given-names>D</given-names>
          </string-name>
          <string-name>
            <surname>Boskovic</surname>
          </string-name>
          ,
          <string-name>
            <surname>Sai-Ming Li</surname>
          </string-name>
          , and
          <article-title>Raman K Mehra</article-title>
          .
          <article-title>On-line failure detection and identification (fdi) and adaptive reconfigurable control (ARC) in aerospace applications</article-title>
          . In American Control Conference,
          <year>2001</year>
          .
          <source>Proceedings of the 2001</source>
          , volume
          <volume>4</volume>
          , pages
          <fpage>2625</fpage>
          -
          <lpage>2626</lpage>
          . IEEE,
          <year>2001</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [38]
          <string-name>
            <surname>Jo</surname>
          </string-name>
          <article-title>´zsef Bokor and Zolta´n Szabo´. Fault detection and isolation in nonlinear systems</article-title>
          .
          <source>Annual Reviews in Control</source>
          ,
          <volume>33</volume>
          (
          <issue>2</issue>
          ):
          <fpage>113</fpage>
          -
          <lpage>123</lpage>
          ,
          <year>2009</year>
          . 1
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>SX</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P</given-names>
            <surname>Zhang</surname>
          </string-name>
          , A Naik, EL Ding, and
          <string-name>
            <given-names>B</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <article-title>Subspace method aided data-driven design of fault detection and isolation systems</article-title>
          .
          <source>Journal of Process Control</source>
          ,
          <volume>19</volume>
          (
          <issue>9</issue>
          ):
          <fpage>1496</fpage>
          -
          <lpage>1510</lpage>
          ,
          <year>2009</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>Jianfei</given-names>
            <surname>Dong</surname>
          </string-name>
          and
          <string-name>
            <given-names>Michel</given-names>
            <surname>Verhaegen</surname>
          </string-name>
          .
          <article-title>Data driven fault detection and isolation of a wind turbine benchmark</article-title>
          .
          <source>IFAC Proceedings Volumes</source>
          ,
          <volume>44</volume>
          (
          <issue>1</issue>
          ):
          <fpage>7086</fpage>
          -
          <lpage>7091</lpage>
          ,
          <year>2011</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [41]
          <string-name>
            <surname>Xiao</surname>
            <given-names>He</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zidong</surname>
            <given-names>Wang</given-names>
          </string-name>
          , Yang Liu, and
          <string-name>
            <given-names>DH</given-names>
            <surname>Zhou</surname>
          </string-name>
          .
          <article-title>Least-squares fault detection and diagnosis for networked sensing systems using a direct state estimation approach</article-title>
          .
          <source>IEEE Transactions on Industrial Informatics</source>
          ,
          <volume>9</volume>
          (
          <issue>3</issue>
          ):
          <fpage>1670</fpage>
          -
          <lpage>1679</lpage>
          ,
          <year>2013</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [42]
          <string-name>
            <surname>Andre</surname>
            <given-names>Lemos</given-names>
          </string-name>
          , Walmir Caminhas, and
          <string-name>
            <given-names>Fernando</given-names>
            <surname>Gomide</surname>
          </string-name>
          .
          <article-title>Adaptive fault detection and diagnosis using an evolving fuzzy classifier</article-title>
          .
          <source>Information Sciences</source>
          ,
          <volume>220</volume>
          :
          <fpage>64</fpage>
          -
          <lpage>85</lpage>
          ,
          <year>2013</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [43]
          <string-name>
            <surname>Jinane</surname>
            <given-names>Harmouche</given-names>
          </string-name>
          , Claude Delpha, and
          <string-name>
            <given-names>Demba</given-names>
            <surname>Diallo</surname>
          </string-name>
          .
          <article-title>Incipient fault detection and diagnosis based on kullback-leibler divergence using principal component analysis: Part i</article-title>
          .
          <source>Signal Processing</source>
          ,
          <volume>94</volume>
          :
          <fpage>278</fpage>
          -
          <lpage>287</lpage>
          ,
          <year>2014</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [44]
          <string-name>
            <surname>Shiyi</surname>
            <given-names>Bao</given-names>
          </string-name>
          , Lijia Luo, Jianfeng Mao, and
          <string-name>
            <given-names>Di</given-names>
            <surname>Tang</surname>
          </string-name>
          .
          <article-title>Improved fault detection and diagnosis using sparse global-local preserving projections</article-title>
          .
          <source>Journal of Process Control</source>
          ,
          <volume>47</volume>
          :
          <fpage>121</fpage>
          -
          <lpage>135</lpage>
          ,
          <year>2016</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [45]
          <string-name>
            <surname>Ian</surname>
            <given-names>Goodfellow</given-names>
          </string-name>
          , Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and
          <string-name>
            <given-names>Yoshua</given-names>
            <surname>Bengio</surname>
          </string-name>
          .
          <article-title>Generative adversarial nets</article-title>
          .
          <source>In Advances in Neural Information Processing Systems</source>
          , pages
          <fpage>2672</fpage>
          -
          <lpage>2680</lpage>
          ,
          <year>2014</year>
          .
          <volume>2</volume>
          ,
          <issue>3</issue>
          ,
          <fpage>4</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [46]
          <string-name>
            <surname>Shibani</surname>
            <given-names>Santurkar</given-names>
          </string-name>
          , Ludwig Schmidt, and
          <string-name>
            <given-names>Aleksander</given-names>
            <surname>Madry</surname>
          </string-name>
          .
          <article-title>A classification-based perspective on GAN distributions</article-title>
          .
          <source>arXiv preprint arXiv:1711.00970</source>
          ,
          <year>2017</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [47]
          <string-name>
            <surname>Ilya</surname>
            <given-names>O Tolstikhin</given-names>
          </string-name>
          , Sylvain Gelly, Olivier Bousquet,
          <string-name>
            <surname>Carl-Johann</surname>
          </string-name>
          Simon-Gabriel, and Bernhard Scho¨lkopf. Adagan:
          <article-title>Boosting generative models</article-title>
          .
          <source>In Advances in Neural Information Processing Systems</source>
          , pages
          <fpage>5430</fpage>
          -
          <lpage>5439</lpage>
          ,
          <year>2017</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [48]
          <string-name>
            <surname>Martin</surname>
            <given-names>Arjovsky</given-names>
          </string-name>
          , Soumith Chintala, and Le´on Bottou.
          <article-title>Wasserstein gan</article-title>
          .
          <source>arXiv preprint arXiv:1701.07875</source>
          ,
          <year>2017</year>
          .
          <volume>2</volume>
          ,
          <fpage>4</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>LI</given-names>
            <surname>Chongxuan</surname>
          </string-name>
          , Taufik Xu,
          <string-name>
            <given-names>Jun</given-names>
            <surname>Zhu</surname>
          </string-name>
          , and Bo Zhang.
          <article-title>Triple generative adversarial nets</article-title>
          .
          <source>In Advances in Neural Information Processing Systems</source>
          , pages
          <fpage>4091</fpage>
          -
          <lpage>4101</lpage>
          ,
          <year>2017</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          [50]
          <string-name>
            <given-names>Min</given-names>
            <surname>Lin</surname>
          </string-name>
          .
          <source>Softmax GAN. arXiv:1704.06191</source>
          ,
          <year>2017</year>
          . 2 arXiv preprint
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          [51]
          <article-title>Parametrizing filters of a CNN with a GAN, author=Kilcher, Yannic and Becigneul, Gary</article-title>
          and Hofmann, Thomas,
          <source>journal=arXiv preprint arXiv:1710</source>
          .11386,
          <issue>year</issue>
          =
          <year>2017</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          [52]
          <article-title>How to train your DRAGAN, author=Kodali, Naveen and Abernethy, Jacob and Hays, James and Kira, Zsolt</article-title>
          ,
          <source>journal=arXiv preprint arXiv:1705</source>
          .07215,
          <issue>year</issue>
          =
          <year>2017</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref52">
        <mixed-citation>
          [53]
          <string-name>
            <surname>Panpan</surname>
            <given-names>Zheng</given-names>
          </string-name>
          , Shuhan Yuan, Xintao Wu,
          <string-name>
            <given-names>Jun</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Aidong</given-names>
            <surname>Lu</surname>
          </string-name>
          .
          <article-title>One-class adversarial nets for fraud detection</article-title>
          .
          <source>arXiv preprint arXiv:1803.01798</source>
          ,
          <year>2018</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref53">
        <mixed-citation>
          [54]
          <string-name>
            <given-names>Tarik</given-names>
            <surname>Arici</surname>
          </string-name>
          and
          <string-name>
            <given-names>Asli</given-names>
            <surname>Celikyilmaz</surname>
          </string-name>
          .
          <article-title>Associative adversarial networks</article-title>
          .
          <source>arXiv preprint arXiv:1611.06953</source>
          ,
          <year>2016</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref54">
        <mixed-citation>
          [55]
          <string-name>
            <given-names>Yunus</given-names>
            <surname>Saatci</surname>
          </string-name>
          and Andrew G Wilson.
          <article-title>Bayesian GAN</article-title>
          .
          <source>In Advances in Neural Information Processing Systems</source>
          , pages
          <fpage>3622</fpage>
          -
          <lpage>3631</lpage>
          ,
          <year>2017</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref55">
        <mixed-citation>
          [56]
          <string-name>
            <surname>Alireza</surname>
            <given-names>Makhzani</given-names>
          </string-name>
          , Jonathon Shlens, Navdeep Jaitly, and
          <string-name>
            <given-names>Ian</given-names>
            <surname>Goodfellow</surname>
          </string-name>
          .
          <article-title>Adversarial autoencoders</article-title>
          .
          <source>In International Conference on Learning Representations</source>
          ,
          <year>2016</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref56">
        <mixed-citation>
          [57]
          <string-name>
            <surname>Dmitry</surname>
            <given-names>Ulyanov</given-names>
          </string-name>
          , Andrea Vedaldi, and
          <string-name>
            <given-names>Victor</given-names>
            <surname>Lempitsky</surname>
          </string-name>
          .
          <article-title>It takes (only) two: Adversarial generator-encoder networks</article-title>
          .
          <year>2018</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref57">
        <mixed-citation>
          [58]
          <string-name>
            <surname>Alex</surname>
            <given-names>Kurakin</given-names>
          </string-name>
          , Dan Boneh, Florian Tramr, Ian Goodfellow, Nicolas Papernot, and
          <string-name>
            <surname>Patrick McDaniel</surname>
          </string-name>
          .
          <article-title>Ensemble adversarial training: Attacks and defenses</article-title>
          .
          <year>2018</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref58">
        <mixed-citation>
          [59]
          <string-name>
            <surname>Aurko</surname>
            <given-names>Roy</given-names>
          </string-name>
          , Colin Raffel, Ian Goodfellow, and
          <string-name>
            <given-names>Jacob</given-names>
            <surname>Buckman</surname>
          </string-name>
          .
          <article-title>Thermometer encoding: One hot way to resist adversarial examples</article-title>
          .
          <year>2018</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref59">
        <mixed-citation>
          [60]
          <string-name>
            <surname>Shiwei</surname>
            <given-names>Shen</given-names>
          </string-name>
          , Guoqing Jin,
          <string-name>
            <given-names>Ke</given-names>
            <surname>Gao</surname>
          </string-name>
          , and Yongdong Zhang.
          <article-title>AE-GAN: adversarial eliminating with GAN</article-title>
          .
          <source>arXiv preprint arXiv:1707.05474</source>
          ,
          <year>2017</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref60">
        <mixed-citation>
          [61]
          <string-name>
            <surname>Pouya</surname>
            <given-names>Samangouei</given-names>
          </string-name>
          , Maya Kabkab, and
          <string-name>
            <given-names>Rama</given-names>
            <surname>Chellappa</surname>
          </string-name>
          .
          <article-title>Defense-gan: Protecting classifiers against adversarial attacks using generative models</article-title>
          .
          <source>arXiv preprint arXiv:1805.06605</source>
          ,
          <year>2018</year>
          . 2
        </mixed-citation>
      </ref>
      <ref id="ref61">
        <mixed-citation>
          [62]
          <string-name>
            <surname>Yoshua</surname>
            <given-names>Bengio</given-names>
          </string-name>
          , Pascal Lamblin, Dan Popovici, and
          <string-name>
            <given-names>Hugo</given-names>
            <surname>Larochelle</surname>
          </string-name>
          .
          <article-title>Greedy layer-wise training of deep networks</article-title>
          .
          <source>In Advances in Neural Information Processing Systems</source>
          , pages
          <fpage>153</fpage>
          -
          <lpage>160</lpage>
          ,
          <year>2007</year>
          . 3
        </mixed-citation>
      </ref>
      <ref id="ref62">
        <mixed-citation>
          [63]
          <string-name>
            <surname>Bernhard</surname>
            <given-names>E Boser</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Isabelle M Guyon</surname>
          </string-name>
          , and
          <string-name>
            <surname>Vladimir N Vapnik</surname>
          </string-name>
          .
          <article-title>A training algorithm for optimal margin classifiers</article-title>
          .
          <source>In Proceedings of the Fifth Annual Workshop on Computational Learning Theory</source>
          , pages
          <fpage>144</fpage>
          -
          <lpage>152</lpage>
          . ACM,
          <year>1992</year>
          . 5
        </mixed-citation>
      </ref>
      <ref id="ref63">
        <mixed-citation>
          [64]
          <string-name>
            <given-names>Vladimir</given-names>
            <surname>Naumovich</surname>
          </string-name>
          Vapnik and
          <string-name>
            <given-names>Vlamimir</given-names>
            <surname>Vapnik</surname>
          </string-name>
          .
          <source>Statistical Learning Theory</source>
          , volume
          <volume>1</volume>
          . Wiley New York,
          <year>1998</year>
          . 5
        </mixed-citation>
      </ref>
      <ref id="ref64">
        <mixed-citation>
          [65]
          <string-name>
            <given-names>Nello</given-names>
            <surname>Cristianini and John Shawe-Taylor</surname>
          </string-name>
          .
          <article-title>An Introduction to Support Vector Machines</article-title>
          . Cambridge University Press, Cambridge, UK,
          <year>2000</year>
          . 5
        </mixed-citation>
      </ref>
      <ref id="ref65">
        <mixed-citation>
          [66]
          <string-name>
            <surname>Ke-Lin</surname>
            <given-names>Du</given-names>
          </string-name>
          and
          <string-name>
            <given-names>MNS</given-names>
            <surname>Swamy</surname>
          </string-name>
          .
          <article-title>Radial basis function networks</article-title>
          .
          <source>In Neural Networks and Statistical Learning</source>
          , pages
          <fpage>299</fpage>
          -
          <lpage>335</lpage>
          . Springer,
          <year>2014</year>
          . 5
        </mixed-citation>
      </ref>
      <ref id="ref66">
        <mixed-citation>
          [67]
          <string-name>
            <given-names>Silvere</given-names>
            <surname>Bonnabel</surname>
          </string-name>
          and
          <string-name>
            <given-names>Rodolphe</given-names>
            <surname>Sepulchre</surname>
          </string-name>
          .
          <article-title>Riemannian metric and geometric mean for positive semidefinite matrices of fixed rank</article-title>
          .
          <source>SIAM Journal on Matrix Analysis and Applications</source>
          ,
          <volume>31</volume>
          (
          <issue>3</issue>
          ):
          <fpage>1055</fpage>
          -
          <lpage>1070</lpage>
          ,
          <year>2009</year>
          . 6
        </mixed-citation>
      </ref>
      <ref id="ref67">
        <mixed-citation>
          [68]
          <string-name>
            <surname>Arthur</surname>
            <given-names>Gretton</given-names>
          </string-name>
          , Kenji Fukumizu, Zaid Harchaoui, and
          <article-title>Bharath K Sriperumbudur. A fast, consistent kernel two-sample test</article-title>
          .
          <source>In Advances in Neural Information Processing systems</source>
          , pages
          <fpage>673</fpage>
          -
          <lpage>681</lpage>
          ,
          <year>2009</year>
          . 6
        </mixed-citation>
      </ref>
      <ref id="ref68">
        <mixed-citation>
          [69]
          <string-name>
            <surname>Tetsuya</surname>
            <given-names>Kaneko</given-names>
          </string-name>
          , Simone Fiori, and
          <string-name>
            <given-names>Toshihisa</given-names>
            <surname>Tanaka</surname>
          </string-name>
          .
          <article-title>Empirical arithmetic averaging over the compact Stiefel manifold</article-title>
          .
          <source>IEEE Transactions on Signal Processing</source>
          ,
          <volume>61</volume>
          (
          <issue>4</issue>
          ):
          <fpage>883</fpage>
          -
          <lpage>894</lpage>
          ,
          <year>2013</year>
          . 6
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>