<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Framework For Assessing Diagnostics Model Fidelity</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gregory Provan</string-name>
          <email>g.provan@cs.ucc.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alex Feldman</string-name>
          <email>afeldman@parc.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Computer Science Department, University College Cork</institution>
          ,
          <addr-line>Cork</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>PARC Inc.</institution>
          ,
          <addr-line>Palo Alto, CA 94304</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <fpage>127</fpage>
      <lpage>134</lpage>
      <abstract>
        <p>“All models are wrong but some are useful" [1]. We address the problem of identifying which diagnosis models are more useful than others. Models are critical to diagnostics inference, yet little work exists to be able to compare models. We define the role of models in diagnostics inference, propose metrics for models, and apply these metrics to a tank benchmark system. Given the many approaches possible for model metrics, we argue that only information-theoretic methods address how well a model mimics real-world data. We focus on some well-known information-theoretic modelling metrics, demonstrating the trade-offs that can be made on different models for a tank benchmark system.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>A core goal of Model-Based Diagnostics (MBD) is to
accurately diagnose a range of systems in real-world
applications. There has been significant progress in developing
algorithms for systems of increasing complexity. A key
area where further work is needed is scaling-up to
realworld models, as multiple-fault diagnostics algorithms are
currently limited by the size and complexity of the models
to which they can be applied. In addition, there is still a great
need for defining metrics to measure diagnostics accuracy,
and to measure the computational complexity of inference
and of the models’ contribution to inference complexity.</p>
      <p>This article addresses the modeling side of MBD: we
focus on methods for measuring the size and complexity of
MBD models. We explore the role that diagnostics model
fidelity can play in being able to generate accurate
diagnostics. We characterise model fidelity and examine the
tradeoffs of fidelity and inference complexity within the overall
MBD inference task.</p>
      <p>
        Model fidelity is a crucial issue in diagnostics [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]:
models that are too simple can be inaccurate, yet highly detailed
and complex models are expensive to create, have many
parameters that require significant amounts of data to estimate,
and are computationally intensive to perform inference on.
There is an urgent need to incorporate inference complexity
within modelling, since even relatively simple models, such
as some of the combinational ISCAS-85 benchmark models,
pose computational challenges to even the most advanced
solvers for multiple-fault tasks. In addition, higher-fidelity
models can actually perform worse than lower-fidelity
models on real-world data, as can be explained using over-fitting
arguments within a machine learning framework.
      </p>
      <p>To our knowledge, there is no theory within Model-Based
Diagnostics that relates notions of model complexity, model
accuracy, and inference complexity. To address these issues,
we explore several of the factors that contribute to model
complexity, as well as a theoretically sound approach for
selecting models based on their complexity and diagnostics
performance, i.e., their accuracy in diagnosing faults.</p>
      <p>Our contributions are as follows:
• We characterise the task of selecting a diagnosis model
of appropriate fidelity as an information-theoretic
model selection task.
• We propose several metrics for assessing the quality of
a diagnosis model, and derive approximation versions
of a subset of these metrics.
• We use a dynamical systems benchmark model to
demonstrate our compare how the metrics assess
models relative to the accuracy of diagnostics output based
on using the models.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>This section reviews work related to our proposed approach.</p>
      <p>Model-Based Diagnostics: There is some seminal work
on modelling principles within the Model-Based Diagnosis
(MBD) community, e.g., [2; 3]; this early work adopts an
approach based on logic or qualitative physics for model
specification. However, this work provides no means for
comparing models in terms of diagnostics accuracy. More
recent work ([4]) provides a logic-based specification of
model fidelity. There is also work specifying metrics for
diagnostics accuracy, e.g., [5].</p>
      <p>However, none of this work defines precise metrics for
computing both diagnostics accuracy and model
complexity, and their trade-offs. This article adopts a theoretically
well-founded approach for integrating multiple MBD
metrics.</p>
      <p>Multiple Fidelity Modeling There is limited work
describing the use of models of multiple levels of fidelity.
Examples of such work includes [6; 7; 8]. In this article we
focus on methods for evaluating multi-fidelity models and
their impact on diagnostics accuracy, as opposed to
developing methodoligies for modelling at multiple levels of
fidelity.</p>
      <sec id="sec-2-1">
        <title>Multiple-Mode Modeling One approach to MBD is to</title>
        <p>use a separate model for every failure mode, rather than to
define a model containing all failure modes. Examples of
this approach include [9; 10; 11; 12]. Note that this work
does not specify metrics for computing both diagnostics
accuracy and model complexity, or their trade-offs.</p>
        <p>
          Model- Selection The metrics that we adopt and extend
have been used extensively to compare different models,
e.g., [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. The metrics are used to compare simulation
performance of models only. In contrast, we extend this
framework to examine diagnostics performance. In the process,
we explore the use of multiple loss functions for penalising
models, in addition to the standard penalty functions based
on number of model parameters.
        </p>
        <p>
          Model-Order Reduction Model-Order reduction [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]
aims to reduce the complexity of a model with an aim to
limit the performance losses of the reduced model. The
reduction methods are theoretically well-founded, although
they are highly domain-specific. In contrast to this
approach, we assume a model-composition approach from a
component library containing hand-constructed models of
multiple levels of fidelity.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Diagnostics Modeling and Inference</title>
      <p>This section formalises the notion of diagnostics model
within the process of diagnostics inference. We first
introduce the task, and then define it more precisely.</p>
      <sec id="sec-3-1">
        <title>3.1 Diagnosis Task</title>
        <p>Assume that we have a system S that can operate in a
nominal state, ξN , or a faulty state, ξF , where Ξ is the set of
possible states of S. We further assume that we have a
discrete vector of measurements, Y˜ = {y˜1, ..., y˜n} observed
at times t = {1, ..., n} that summarizes the response of
the system S to control variables U = {u1, ..., un}. Let
Yφ = {y1, ..., yn} denote the corresponding predictions
from a dynamic (nonlinear) model, φ, with parameter values
θ: this can be represented by Yφ = φ(x0, θ, ξ, U˜ ), where x0
signifies the initial states of the system att0.</p>
        <p>We assume that we have a prior probability distribution
P (Ξ) over the states Ξ of the system. This distribution
denotes the likelihood of the failure states of the system.</p>
        <p>We define a residual vectorR(Y˜ , Yφ) to capture the
difference between the actual and model-simulated system
behaviour. An example of a residual vector is the
meansquared-error (MSE). We assume a fixed diagnosis task T
throughout this article, e.g., computing the most likely
diagnosis, or a deterministic multiple-fault diagnosis.</p>
        <p>The classical definition of diagnosis is as a state
estimation task, whose objective is to identify the system state that
minimises the residual vector:
ξ∗ = argmin R(Y˜ , Yφ)
ξ∈Ξ
(1)
Since this is a minimisation task, we typically need to
run multiple simulations over the space of parameters and
modes to compute ξ∗. We can abstract this process as
performing model-inversion, i.e., computing some ξ∗ =
φ−1(x0, θ, ξ, U˜ ) that minimises R(Y˜ , Yφ).</p>
        <p>During this diagnostics inference task, a model φ can play
two roles: (a) simulating a behaviour to estimate R(Y˜ , Yφ);
(b) enabling the computation of ξ∗ = φ−1(x0, θ, ξ, U˜ ). It
is clear that diagnostics inference requires a model that has
good fidelity and is computationally efficient for performing
these two roles.</p>
        <p>We generalise that notion to incorporate inference
efficiency as well as accuracy. We can define an inference
complexity measure as C(Y˜ , φ). We can then define our
diagnosis task as jointly minimising a function g that incorporates
the accuracy (based on the residual function) and the
inference complexity:
ξ∗ = argmin g
ξ∈Ξ</p>
        <p>R(Y˜ , Yφ), C(Y˜ , φ) .</p>
        <p>(2)
Here g specifies a loss or penalty function that induces a
non-negative real-valued penalty based on the lack of
accuracy and computational cost.</p>
        <p>
          In forward simulation, a model φ, with parameters θ, can
generate multiple observations Y˜ = {y˜1, ..., y˜n}. The
diagnostics task involves performing the inverse operation on
these observations. Our objective thus involves optimising
the state estimation task over a future set of observations,
Y˜ = {Y˜1, ..., Y˜n}. Our model φ and inference algorithm
A have different performance based on Y˜i, i = 1, ..., n: for
example, [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] shows that both inference-accuracy and -time
vary based on the fault cardinality . As a consequence, to
compute ξ∗ we want to optimise the mean performance over
future observations. This notion of mean performance
optimisation has been characterised using the Bayesian model
selection approach, which we examine in the following
section.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Diagnosis Model</title>
        <sec id="sec-3-2-1">
          <title>We specify a diagnosis model as follows:</title>
          <p>Definition 1 (Diagnosis Model). We characterise a
Diagnosis Model φ using the tuple hV , θ, Ξ, E i, where
• V is a set of variables, consisting of variables denoting
the system state (X), control (U ), and observations
(Y ).
• θ is a set of parameters.
• Ξ is a set of system modes.
• E is a set of equations, with a subset Eξ ⊆ E for each
mode ξ ∈ Ξ.</p>
          <p>We will assume that we can use a physics-based approach
to hand-generate a set E of equations to specify a model.
Obtaining good diagnostics accuracy, given a fixed E ,
entails estimating the parameters θ to optimise that accuracy.
3.3</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>Running Example: Three-Tank Benchmark</title>
        <p>In this paper, we use the three-tank system shown in Fig. 1
to illustrate our approach. The three tanks are denoted as T1,
T2, and T3. Each tank has the same area A1 = A2 = A3.
For i = 1, 2, 3, tank Ti has height hi, a pressure sensor pi,
and a valve Vi, i = 1, 2, 3 that controls the flow of liquid
out of Ti. We assume that gravity g = 10 and the liquid has
density ρ = 1.</p>
        <p>Tank T1 gets filled from a pipe, with measured flow q0.
Using Torricelli’s law, the model can be described by the
following non-linear equations:
dh1
dt
dh2
dt
dh3
dt
=
=
=
1 h</p>
        <p>In eq. 3, the coefficientκ1 denotes a parameter that
captures the product of the cross-sectional area of the tank
A1, the area of the drainage hole, a gravity-based constant
(√2g), and the friction/contraction factor of the hole. κ2
and κ3 can be defined analogously.</p>
        <p>Finally, the pressure at the bottom of each tank is obtained
from the height: pi = g hi, where i is the tank index (i ∈
{1, 2, 3}).</p>
        <p>We emphasize the use of the κi, i = 1, 2, 3 because we
will use these parameter-values as a means for
“diagnosing” our system in term of changes in κi, i = 1, 2, 3.
Consider a physical valve R1 between T1 and T2 that constraints
the flow between the two tanks. We can say that the valve
changes proportionally the cross-sectional drainage area of
q1 and hence κ1. The diagnostic task will be to compute the
true value of κ1, given p1, and from κ1 we can compute the
actual position of the valve R1.</p>
        <p>We now characterise our nominal model in terms of
Definition 1:
• variables V consist of variables denoting
the system state (X = {h1, h2, h3}),
control (U = {q0, V1, V2, V3}), and observations
(Y = {p1, p2, p3}).
• θ = {{A1, A2, A3}, {κ1, κ2, κ3}} is the set of
parameters.
• Ξ consists of a single nominal mode.
• E is a set of equations, given by equations 3 through 5.
Note that this model has a total of 6 parameters.</p>
        <p>Fault Model In this article we focus on valve faults,
where a valve can have a blockage or a leak. We model
this class of faults by including in equations 3 to 5 an
additive parameter β, which is applied to the parameter κ, i.e., as
1
κi(1+βi), i = 1, 2, 3, where −1 ≤ βi ≤ κi −1, i = 1, 2, 3.
β &gt; 0 corresponds to a leak, such that β ∈ (0, 1/κ − 1];
β &lt; 0 corresponds to a blockage, such that β ∈ [−1, 0).
The fault equations can be written as:
dh1</p>
        <p>The fault equations allow faults for any combination of
the valves {V1, V2, V3}, resulting in system modes Ξ =
{ξN , ξ1, ξ2, ξ3, ξ12, ξ13, ξ23, ξ123}, where ξN is the nominal
mode, and ξ· is the mode where · denotes the combination
of valves (taken from a combination of {1, 2, 3}) which are
faulty. This fault model has 9 parameters.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Modelling Metrics</title>
      <p>This section describes the metrics that can be applied to
estimate properties of a diagnosis model. We describe two types
of metrics, dealing with accuracy (fidelity) and complexity.
4.1</p>
      <sec id="sec-4-1">
        <title>Model Accuracy</title>
        <p>Model accuracy concerns the ability of a model to mimic a
real system. From a diagnostics perspective, this translates
to the use of a model to simulate behaviours that distinguish
nominal and faulty behaviours sufficiently well that
appropriate fault isolation algorithms can identify the correct type
of fault when it occurs. As such, a diagnostics model needs
to be able to simulate behaviours for multiple modes with
“appropriate" fidelity.</p>
        <p>Note that we distinguish model accuracy from diagnosis
inference accuracy. As noted above, model accuracy
concerns the ability of a model to mimic a real system through
simulation, and to assist in diagnostics isolation. Diagnosis
inference accuracy concerns being able to isolate the true
fault given an observation and the simulation output of a
model.</p>
        <p>A significant challenge for a diagnosis model is the need
to simulate behaviours for multiple modes. Two approaches
that have been taken are to use a single model with multiple
modes explicitly defined (a multi-mode approach), or to use
multiple models [9; 16; 17], each of which is optimised for
a single or small set of modes (a multi-model approach).</p>
        <p>
          The AI-based MBD approach typically uses a single
model φ with multiple modes explicitly defined [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], or a
single model with just nominal behaviour [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. From a
diagnostics perspective, accuracy must be defined with respect
to the task T . We adopt here the task of computing the
mostlikely diagnosis.
        </p>
        <p>Given evidence suggesting that model fidelity for a
multimode approach varies depending on the mode, it is
important to explicitly consider the mean performance of φ over
the entire observation space Y (the space of possible
observations of the system).</p>
        <p>In this article we adopt the expected residual approach,
i.e., given a space Y = {Y˜1, ..., Y˜n} of observations, the
expected residual is the average over the n observations, e.g.,
as given by: R¯ = n1 Pn</p>
        <p>i=1 R(Y˜i, Yφ).
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Model Complexity</title>
        <p>At present, there is no commonly-accepted definition of
model complexity, whether the model is used purely for
simulation or if it is used for diagnostics or control.
Defining the complexity of a model is inherently tricky, due to the
number of factors involved.</p>
        <p>
          Less complex models are often preferred either due to
their low computational simulation costs [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ], or to
minimise model over-fitting given observed data[21; 22]. Given
the task of simulating a variable of interest conditioned by
certain future values of input (control) variables, overfitting
can lead to high uncertainty in creating accurate simulations.
Overfitting is especially severe when we have limited
observation variables for generating a model representing the
underlying process dynamics. In contrast, models with low
parameter dimensionality (i.e. fewer parameters) are
considered less complex and hence are associated with low
prediction uncertainty [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ].
        </p>
        <p>
          Several approaches have been used, based on issues like
(a) number of variables [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ], (b) model structure [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ], (c)
number of free parameters [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], (d) number of parameters
that the data can constrain [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ], (e) a notion of model weight
[
          <xref ref-type="bibr" rid="ref27">27</xref>
          ], or (f) type and order of equations for a non-linear
dynamical model [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], where type corresponds to non-linear,
linear, etc.; e.g., order for a non-linear model is such that a
k-th order system has k-th derivates in E .
        </p>
        <p>Factors that contribute to the true cost of a model include:
(a) model-generation; (b) parameter estimation; and (c)
simulation complexity, i.e., the computational expense (in terms
of CPU-time and memory) needed to simulate the model
given a set of initial conditions Rather than try to
formulate this notion in terms of the number of model variables or
parameters, or a notion of model structural complexity, we
specify model complexity in terms of a measure based on
parameter estimation, and inference complexity, assuming a
construction cost of zero.</p>
        <p>
          A thorough analysis of model complexity will need to
take into consideration the model equation class, since
model complexity is class-specific. For example, for
nonlinear dynamical models, complexity is governed by the
type and order of equations [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. In contrast, for linear
dynamical models, which have only matrices and variables in
equations (no derivatives), it is the order of the matrices that
determines complexity. In this article, we assume that
models are of appropriate complexity, and hence do not address
Model order reduction techniques [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], which aim to
generate lower-dimensional systems that trade off fidelity for
reduced model complexity.
4.3
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>Diagnostics Model Selection Task</title>
        <p>The model in this model selection problem corresponds to
a system with a single mode. Given a space Φ of possible
models, we can define this model selection task as follows:
φ∗ = argmin g1
φ∈Φ</p>
        <p>R(Y˜ , Yφ)
+ g2 C(Y˜ , φ) ,
(7)
adopting the simplifying assumption that our loss function
g is additively decomposable.
4.4</p>
      </sec>
      <sec id="sec-4-4">
        <title>Information-Theoretic Model Complexity</title>
        <p>The Information-Theoretic (or Bayesian) model
complexity approach, which is based on the model likelihood,
measures whether the increased “complexity" of a model with
more parameters is justified by the data. The
InformationTheoretic approach chooses a model (and a model structure)
from a set of competing models (from the set of
corresponding model structures, respectively) such that the value of a
Bayesian criterion is maximized (or prediction uncertainty
in choosing a model structure is minimized).</p>
        <p>
          The Information-Theoretic approach addresses prediction
uncertainty by specifying an appropriate likelihood
function. In other words, it specifies the probability with which
the observed values of a variable of interest are generated
by a model. The marginal likelihood of a model structure,
which represents a class of models capturing the same
processes (and hence have the same parameter
dimensionality), is obtained by integrating over the prior distribution of
model parameters; this measures the prediction uncertainty
of the model structure [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ].
        </p>
        <p>Statistical model selection is commonly based on
Occam’s parsimony principle (ca.1320), namely that
hypotheses should be kept as simple as possible. In statistical terms,
this is a trade-off between bias (distance between the
average estimate and truth) and variance (spread of the estimates
around the truth).</p>
        <p>The idea is that by adding parameters to a model we
obtain improvement in fit, but at the expense of making
parameter estimates “worse"’ because we have less data (i.e.,
information) per parameter. In addition, the computations
typically require more time. So the key question is how to
identify how complex a model works best for a given
problem.</p>
        <p>If the goal is to compute the likelihood of a given model
φ(x0, θ, ξ, U ), then θ and U are nuisance parameters.
These parameters affect the likelihood calculation but are
not what we want to infer. Consequently, these parameters
should be eliminated from the inference. We can remove
nuisance parameters by assigning them prior probabilities
and integrating them out to obtain the marginal probability
of the data given only the model, that is, the model
likelihood (also called integrative, marginal, or predictive
likelihood). In equational form, this looks like: P (Y |φ) =
Rθ RU P (φ|Y , θ, U )P (θ, U |φ)dθdU . However, this
multidimensional integral can be very difficult to compute, and it
is typically approximated using computationally intensive
techniques like Markov chain Monte Carlo (MCMC).</p>
        <p>Rather than try to solve such a computationally
challenging task, we adopt an approximation to the
multidimensional integral. In the statistics literature several
decomposable approximations have been proposed.</p>
        <p>
          Spiegelhalter et al. [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] have proposed a well-known
such decomposable framework, termed the Deviance
Information Criterion (DIC), which measures the number of
model parameters that the data can constrain.: DI C =
D + pD, where D is a measure of fit (expected deviance),
and pD is a complexity measure, the effective number of
parameters. The Akaike Information Criterion (AIC) [29;
30] is another well-known measure: AI C = −2L(θˆ) + 2k,
where θˆ is the Maximum Likelihood Estimate (MLE) of θ
and k is the number of parameters.
        </p>
        <p>To compensate for small sample size n, a variant of AIC,
termed AICc, is typically used:</p>
        <p>AI Cc = −2L(θˆ) + 2k +
2k(k + 1)
(n − k − 1)
(8)</p>
        <p>
          Another computationally more tractable approach is the
Bayesian Information Criterion (BIC) [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ]: BI C =
−2L(θˆ) + klogn, where k is the number of estimable
parameters, and n is the sample size (number of observations).
BIC was developed as an approximation to the log marginal
likelihood of a model, and therefore, the difference between
two BIC estimates may be a good approximation to the
natural log of the Bayes factor. Given equal priors for all
competing models, choosing the model with the smallest BIC is
equivalent to selecting the model with the maximum
posterior probability. BIC assumes that the (parameters’) prior is
the unit information prior (i.e., a multivariate normal prior
with mean at the maximum likelihood estimate and variance
equal to the expected information matrix for one
observation).
        </p>
        <p>
          Wagenmakers [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ] shows that one can convert the BIC
metric to
        </p>
        <p>BIC = n log</p>
        <p>+ k logn,
SSE</p>
        <p>SStotal
where SSE is the sum of squares for the error term. In our
experiments, we assume that the non-linear model is the
“correct" model (or the null hypothesis H0), and either the
linear or qualitative models are the competing model (or
alternative hypothesis H1). Hence what we do is use BIC to
compare the non-linear to each of the competing models.</p>
        <p>Suppose that we obtain the BIC values for the alternative
and the correct models, using the relevant SS terms. When
computing ΔBIC = BIC(H1) − BIC(H0), note that both
the null (H0) and the alternative hypothesis (H1) models
share the same SStotal term (both models attempt to explain
the same collection of scores), although they differ with
respect to SSE. The SStotal term common to both BIC values
cancels out in computing ΔBIC , producing
ΔBIC = n log SSE1 + (k1 − k0)logn,</p>
        <p>SSE0
(9)
where SSE1 and SSE0 are the sum of squares for the
error terms in the alternative and the null hypothesis models,
respectively.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Experimental Design</title>
      <p>This section compares three tank benchmark models
according to various model-selection measures. We adopt as our
“correct" model the non-linear model. We will examine the
fidelity and complexity tradeoffs of two simpler models over
a selection of failure scenarios.</p>
      <p>The diagnostic task will be to compute the fault state
of the system, given an injected fault, which is one of
(ξN , ξB, ξP ), denoting nominal blocked and passing valves,
respectively. This translates to different tasks given the
different models.
non-linear model estimate the true value of κ1 given p1,
which corresponds to a most-likely failure mode
assignment of one of (ξN , ξB, ξP ).
linear model estimate the true value of κ1 given p1, which
corresponds to a most-likely failure mode assignment
of one of (ξN , ξB, ξP ).
qualitative model estimate the failure mode assignment of
one of (ξN , ξB, ξP ).
5.1</p>
      <sec id="sec-5-1">
        <title>Alternative Models</title>
        <p>This section describes the two alternative models that we
compare to the non-linear model, a linear and a qualitative
model.</p>
      </sec>
      <sec id="sec-5-2">
        <title>Linear Model</title>
        <p>
          We compare the non-linear model with a linearised version.
We can perform this linearised process in a variety of ways
[
          <xref ref-type="bibr" rid="ref33">33</xref>
          ]. In this simple tank example, we can perform the
linearisation directly through replacement of non-linear and
linear operators, as shown below.
        </p>
        <p>Nominal Model We can linearise the the non-linear
3-tank model by replacing the non-linear sub-function
phi − hj with the linear sub-function γij (hi − hj ), where
γij is a parameter (to be estimated) governing the flow
between tanks i and j. The linear model has 4 parameters,
γ12, γ12, γ23, γ3.</p>
        <p>Fault Model The fault model introduces a parameter βi
associated with κi, i.e., we replace κi with κi(1 + βi), i =
1
1, 2, 3, where −1 ≤ βi ≤ κi − 1, i = 1, 2, 3. This model
has 7 parameters, adding parameters β1, β2, β3.</p>
      </sec>
      <sec id="sec-5-3">
        <title>Qualitative Model</title>
        <p>
          Nominal Model For the model we replace the non-linear
sub-function phi − hj with the qualitative sub-function
M +(hi − hj ), where M + is the set of reasonable functions
f such that f 0 &gt; 0 on the interior of its domain [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ].
        </p>
        <p>The tank-heights are constrained to be non-negative, as
are the parameters κi. As a consequence, we can discretize
the hi to take on values {+, 0}, which means that M +(hi −
hj ) can take on values {+, 0, −}. The domain for ddht1 must
be {+, 0, −}, since the qualitative version of q0, Q is
nonnegative (domain of {+, 0}) and each M +(hi − hj ) can
take on values {+, 0, −}. We see that this model has no
parameters to estimate.</p>
        <p>Fault Model</p>
        <p>The qualitative fault model has different M + functions
for the modes where the valve is passing and blocked. We
derive these functions as follows. From a qualitative
perspective, the domain of βi is {0,+} for a passing valve, and
{-,0} for a blocked valve. To create a new M + function for
the cases of passing and blocked valve, we qualitatively
apply these corresponding domains to the standard M +
function with domain {-,0,+} to obtain fault-based M +
functions : MP+(hi − hj ) denotes the M + function when the
valve is passing, and MB+(hi − hj ) denotes the M +
function when the valve is blocked.
5.2</p>
      </sec>
      <sec id="sec-5-4">
        <title>Simulation Results</title>
        <p>We have compared the simulation performance of the
models under nominal and faulty conditions, considering faults
to individual valves V1, V2 and V3, as well as double-fault
combinations of the valves. In the following we present
some plots for simulations of faults and fault-isolation for
different model types.</p>
        <p>Figure 2 shows the results from a single-fault scenario,
where valve V1 is stuck at 50%) at t = 250, based on the
non-linear model. The plot from this simulation show that
at the time of the fault injection, the water level in tank T1
starts increasing while the water level at tanks T2 and T3
start decreasing due to the lower inflow.</p>
        <p>0
100
400</p>
        <sec id="sec-5-4-1">
          <title>Nominal</title>
          <p>V1-fault</p>
          <p>In contrast, Figure 4 shows the diagnostic accuracy and
isolation time with a linear model. First, note that there is
a false-positive identified early in the simulation, and the
model incorrectly identifies both valves 2 and 3 as being
faulty. This linear model thus delivers both poor
diagnostic accuracy (classification errors) and poor isolation time
(there is a lag between when the fault occurs and when
the model identifies the fault). After the fault injection at
t = 250 [s], the predictive accuracy improves and the
correct fault becomes the most likely fault.</p>
          <p>R_1
R_2</p>
          <p>R_3
100
400
500</p>
          <p>Figure 5 depicts the diagnostic performance with a mixed
linear/non-linear model (T1 is non-linear, while T2 and T3
are linear). The diagnostic accuracy is almost the same as
that of the non-linear model (cf. Figure 3), except for a
false-positive detection at the beginning of the scenario.
6</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Experimental Results</title>
      <p>This section describes our experimental results,
summarising the data first and then discussing the implications of the
results.</p>
      <sec id="sec-6-1">
        <title>6.1 Model Comparisons</title>
        <p>We have empirically compared the diagnostics performance
of several multi-tank models. In our first set of experiments,
we ran a simulation over 500 seconds, and induced a fault
(valve V1 at 50%) after 250 s. The model combinations
involved a non-linear (NL) model, a model (denoted M) with
tank T1 being linear (and other tanks non-linear), a fully
linear model (denoted L), and a Qualitative model (denoted
Q).</p>
        <p>
          To compare the relative performance of the models, we
compute a measure of diagnostics error (or loss), using the
difference between the true fault (which is known for each
simulation) and the computed fault. We denote the true fault
existing at time t using the pair (ω, t); the computed fault at
time t is denoted using the pair (ωˆ, tˆ). The inference system
that we use, LNG [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ], computes an uncertainty measure
associated with each computed fault, denoted P (ωˆ). Hence,
we define a measure of diagnostics error over a time window
[0, T ] using
        </p>
        <p>T
γ1D = X X |P (ωˆt) − ωt|,
t=0 ξ∈Ξ
(10)
where Ξ is the set of failure modes for the model, and ωt
denotes ω at time t.</p>
        <p>Our second metric covers the fault latency, i.e., how
ˆ
quickly the model identifies the true fault(ω, t): γ2 = t − t.</p>
        <p>Table 2 summarises our results. The first columns
compare the number of parameters for the different models,
followed by comparisons of the error (γ1) and the CPU-time
(γ2). The data show that the error (γ1) does not grow very
much as we increase model size, but it increases as we
decrease model fidelity from non-linear through to qualitative
models. In contrast, the CPU-time (a) increases as we
increase model size, and (b) is proportional to model fidelity,
i.e., it decreases as we decrease model fidelity from
nonlinear through to qualitative models.</p>
        <p>In a second set of experiments, we focused on multiple
model types for a 3-tank system, with simulations running
over 50s, and we induced a fault (valve V1 at 50%) after 25 s.
The model combinations involved a non-linear (NL) model,
a model with tank 3 linear (and other tanks non-linear), a
model with tanks 2 and 3 linear and tank 1 non-linear, a fully
linear model, and a qualitative model. Table 3 summarises
our results.</p>
        <p>The data show that, as model fidelity decreases, the
error γ1 increases significantly and the inference timesγ2
decrease modestly. If we examine the outputs from AICc, we
see that the best model is the mixed model (T3-linear). BIC</p>
        <sec id="sec-6-1-1">
          <title>Tanks # Parameters</title>
          <p>γ1
γ2</p>
          <p>NL
M
L
Q
NL
M
L
Q
NL
M
L
Q
indicates the qualitative model as the best; it is worth noting
that BIC typically will choose the simplest model.
• The error γ1 increases with fault cardinality.
• The CPU-time γ2 increases with model size (i.e.,
number of tanks).</p>
          <p>This article has introduced a framework that can be used
to trade off the different factors governing MBD “accuracy".
We have shown how one can extend a set of
informationtheoretic metrics to combine these competing factors in
diagnostics model selection. Further work is necessary
to identify how best to extend the existing
informationtheoretic metrics to suit the needs of different diagnostics
applications, as it is likely that the “best" model may be
domain- and task-specific.</p>
          <p>It is important to note that we conducted experiments with
un-calibrated models, and we have ignored the cost of
calibration in this article. The literature suggests that linear
models can be calibrated to achieve good performance,
although performance inferior to that of calibrated non-linear
models. This class of qualitative models does not possess
calibration factors, so calibration will not improve their
performance.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Conclusions</title>
      <p>This article has presented a framework for evaluating the
competing properties of models, namely fidelity and
computational complexity. We have argued that model
performance needs to be evaluated over a range of future
observations, and hence we need a framework that considers the
expected performance. As such, information-theoretic
methods are well suited.</p>
      <p>We have proposed some information-theoretic metrics for
MBD model evaluation, and conducted some preliminary
experiments to show how these metrics may be applied.
This work thus constitutes a start to a full analysis of model
performance. Our intention is to initiate a more formal
analysis of modeling and model evaluation, since there is no
framework in existence for this task. Further, the
experiments are only preliminary, and are meant to demonstrate
how a framework can be applied to model comparison and
evaluation.</p>
      <p>Significant work remains to be done, on a range of fronts.
In particular, a thorough empirical investigation is needs on
diagnostics modeling. Second, the real-world utility of our
proposed framework needs to be determined. Third, a
theoretical study of the issues of mode-based parameter
estimation and its use for MBD is necessary.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>George EP Box. Statistics and science. J Am Stat Assoc</source>
          ,
          <volume>71</volume>
          :
          <fpage>791</fpage>
          -
          <lpage>799</lpage>
          ,
          <year>1976</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Peter</given-names>
            <surname>Struss</surname>
          </string-name>
          .
          <article-title>What's in SD? Towards a theory of modeling for diagnosis. Readings in model-based diagnosis</article-title>
          , pages
          <fpage>419</fpage>
          -
          <lpage>449</lpage>
          ,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Peter</given-names>
            <surname>Struss</surname>
          </string-name>
          .
          <article-title>Qualitative modeling of physical systems in AI research</article-title>
          .
          <source>In Artificial Intelligence and Symbolic Mathematical Computing</source>
          , pages
          <fpage>20</fpage>
          -
          <lpage>49</lpage>
          . Springer,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Nuno</given-names>
            <surname>Belard</surname>
          </string-name>
          , Yannick Pencolé, and
          <string-name>
            <given-names>Michel</given-names>
            <surname>Combacau</surname>
          </string-name>
          .
          <article-title>Defining and exploring properties in diagnostic systems</article-title>
          .
          <source>System</source>
          ,
          <volume>1</volume>
          :
          <fpage>R2</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Alexander</given-names>
            <surname>Feldman</surname>
          </string-name>
          , Tolga Kurtoglu,
          <string-name>
            <given-names>Sriram</given-names>
            <surname>Narasimhan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Scott</given-names>
            <surname>Poll</surname>
          </string-name>
          , and
          <string-name>
            <given-names>David</given-names>
            <surname>Garcia</surname>
          </string-name>
          .
          <article-title>Empirical evaluation of diagnostic algorithm performance using a generic framework</article-title>
          .
          <source>International Journal of Prognostics and Health Management</source>
          ,
          <volume>1</volume>
          :
          <fpage>24</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Steven</surname>
            <given-names>D</given-names>
          </string-name>
          <string-name>
            <surname>Eppinger</surname>
            , Nitin R Joglekar, Alison Olechowski, and
            <given-names>Terence</given-names>
          </string-name>
          <string-name>
            <surname>Teo</surname>
          </string-name>
          .
          <article-title>Improving the systems engineering process with multilevel analysis of interactions</article-title>
          .
          <source>Artificial Intelligence for Engineering Design, Analysis and Manufacturing</source>
          ,
          <volume>28</volume>
          (
          <issue>04</issue>
          ):
          <fpage>323</fpage>
          -
          <lpage>337</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Sanjay</surname>
            <given-names>S Joshi</given-names>
          </string-name>
          and Gregory W Neat.
          <article-title>Lessons learned from multiple fidelity modeling of ground interferometer testbeds</article-title>
          .
          <source>In Astronomical Telescopes &amp; Instrumentation</source>
          , pages
          <fpage>128</fpage>
          -
          <lpage>138</lpage>
          .
          <source>International Society for Optics and Photonics</source>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Roxanne A Moore</surname>
          </string-name>
          ,
          <article-title>David A Romero, and Christiaan JJ Paredis. A rational design approach to gaussian process modeling for variable fidelity models</article-title>
          .
          <source>In ASME 2011 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference</source>
          , pages
          <fpage>727</fpage>
          -
          <lpage>740</lpage>
          . American Society of Mechanical Engineers,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Peter</surname>
            <given-names>D</given-names>
          </string-name>
          <string-name>
            <surname>Hanlon</surname>
          </string-name>
          and
          <string-name>
            <surname>Peter S Maybeck</surname>
          </string-name>
          .
          <article-title>Multiplemodel adaptive estimation using a residual correlation Kalman filter bank</article-title>
          .
          <source>Aerospace and Electronic Systems</source>
          , IEEE Transactions on,
          <volume>36</volume>
          (
          <issue>2</issue>
          ):
          <fpage>393</fpage>
          -
          <lpage>406</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Redouane</surname>
            <given-names>Hallouzi</given-names>
          </string-name>
          , Michel Verhaegen, Robert Babuška, and
          <string-name>
            <given-names>Stoyan</given-names>
            <surname>Kanev</surname>
          </string-name>
          .
          <article-title>Model weight and state estimation for multiple model systems applied to fault detection and identification</article-title>
          .
          <source>In IFAC Symposium on System Identification (SYSID)</source>
          , Newcastle, Australia ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Amardeep</surname>
            <given-names>Singh</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Afshin</given-names>
            <surname>Izadian</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Sohel</given-names>
            <surname>Anwar</surname>
          </string-name>
          .
          <article-title>Fault diagnosis of Li-Ion batteries using multiplemodel adaptive estimation</article-title>
          .
          <source>In Industrial Electronics Society, IECON 2013-39th Annual Conference of the IEEE</source>
          , pages
          <fpage>3524</fpage>
          -
          <lpage>3529</lpage>
          . IEEE,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Amardeep</given-names>
            <surname>Singh</surname>
          </string-name>
          <string-name>
            <surname>Sidhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Afshin</given-names>
            <surname>Izadian</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Sohel</given-names>
            <surname>Anwar</surname>
          </string-name>
          .
          <article-title>Nonlinear Model Based Fault Detection of Lithium Ion Battery Using Multiple Model Adaptive Estimation</article-title>
          . In World Congress, volume
          <volume>19</volume>
          , pages
          <fpage>8546</fpage>
          -
          <lpage>8551</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Aki</surname>
            <given-names>Vehtari</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Janne</given-names>
            <surname>Ojanen</surname>
          </string-name>
          , et al.
          <article-title>A survey of bayesian predictive methods for model assessment, selection and comparison</article-title>
          .
          <source>Statistics Surveys</source>
          ,
          <volume>6</volume>
          :
          <fpage>142</fpage>
          -
          <lpage>228</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Athanasios</surname>
            <given-names>C Antoulas</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Danny C Sorensen</surname>
            , and
            <given-names>Serkan</given-names>
          </string-name>
          <string-name>
            <surname>Gugercin</surname>
          </string-name>
          .
          <article-title>A survey of model reduction methods for large-scale systems</article-title>
          .
          <source>Contemporary mathematics</source>
          ,
          <volume>280</volume>
          :
          <fpage>193</fpage>
          -
          <lpage>220</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Alexander</surname>
            <given-names>Feldman</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gregory M Provan</surname>
          </string-name>
          , and Arjan JC van Gemund.
          <article-title>Computing observation vectors for maxfault min-cardinality diagnoses</article-title>
          .
          <source>In AAAI</source>
          , pages
          <fpage>919</fpage>
          -
          <lpage>924</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Amardeep</surname>
            <given-names>Singh</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Afshin</given-names>
            <surname>Izadian</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Sohel</given-names>
            <surname>Anwar</surname>
          </string-name>
          .
          <article-title>Nonlinear model based fault detection of lithium ion battery using multiple model adaptive estimation</article-title>
          .
          <source>In 19th IFAC World Congress, Cape Town</source>
          , South Africa,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Youmin</given-names>
            <surname>Zhan</surname>
          </string-name>
          and
          <string-name>
            <given-names>Jin</given-names>
            <surname>Jiang</surname>
          </string-name>
          .
          <article-title>An interacting multiplemodel based fault detection, diagnosis and faulttolerant control approach</article-title>
          .
          <source>In Decision and Control</source>
          ,
          <source>1999. Proceedings of the 38th IEEE Conference on</source>
          , volume
          <volume>4</volume>
          , pages
          <fpage>3593</fpage>
          -
          <lpage>3598</lpage>
          . IEEE,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Peter</given-names>
            <surname>Struss</surname>
          </string-name>
          and
          <string-name>
            <given-names>Oskar</given-names>
            <surname>Dressler</surname>
          </string-name>
          .
          <article-title>" physical negation" integrating fault models into the general diagnostic engine</article-title>
          .
          <source>In IJCAI</source>
          , volume
          <volume>89</volume>
          , pages
          <fpage>1318</fpage>
          -
          <lpage>1323</lpage>
          ,
          <year>1989</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Johan De Kleer</surname>
          </string-name>
          ,
          <article-title>Alan K Mackworth,</article-title>
          and
          <string-name>
            <given-names>Raymond</given-names>
            <surname>Reiter</surname>
          </string-name>
          .
          <article-title>Characterizing diagnoses and systems</article-title>
          .
          <source>Artificial Intelligence</source>
          ,
          <volume>56</volume>
          (
          <issue>2</issue>
          ):
          <fpage>197</fpage>
          -
          <lpage>222</lpage>
          ,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Elizabeth</surname>
            <given-names>H Keating</given-names>
          </string-name>
          ,
          <article-title>John Doherty, Jasper A Vrugt, and Qinjun Kang. Optimization and uncertainty assessment of strongly nonlinear groundwater models with high parameter dimensionality</article-title>
          .
          <source>Water Resources Research</source>
          ,
          <volume>46</volume>
          (
          <issue>10</issue>
          ),
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Saket</surname>
            <given-names>Pande</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mac McKee</surname>
          </string-name>
          , and
          <article-title>Luis A Bastidas. Complexity-based robust hydrologic prediction</article-title>
          .
          <source>Water resources research</source>
          ,
          <volume>45</volume>
          (
          <issue>10</issue>
          ),
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            G
            <surname>Schoups</surname>
            , NC Van de
            <given-names>G</given-names>
            iesen
          </string-name>
          , and
          <string-name>
            <given-names>HHG</given-names>
            <surname>Savenije</surname>
          </string-name>
          .
          <article-title>Model complexity control for hydrologic prediction</article-title>
          .
          <source>Water Resources Research</source>
          ,
          <volume>44</volume>
          (
          <issue>12</issue>
          ),
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>S</given-names>
            <surname>Pande</surname>
          </string-name>
          ,
          <string-name>
            <surname>L Arkesteijn</surname>
          </string-name>
          ,
          <article-title>HHG Savenije,</article-title>
          and
          <string-name>
            <given-names>LA</given-names>
            <surname>Bastidas</surname>
          </string-name>
          .
          <article-title>Hydrological model parameter dimensionality is a weak measure of prediction uncertainty</article-title>
          .
          <source>Natural Hazards and Earth System Sciences Discusions</source>
          ,
          <volume>11</volume>
          ,
          <year>2014</year>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>Martin</surname>
            <given-names>Kunz</given-names>
          </string-name>
          , Roberto Trotta, and
          <string-name>
            <surname>David R Parkinson.</surname>
          </string-name>
          <article-title>Measuring the effective complexity of cosmological models</article-title>
          . Physical
          <string-name>
            <surname>Review</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <volume>74</volume>
          (
          <issue>2</issue>
          ):
          <fpage>023503</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Gregory</surname>
            <given-names>M Provan</given-names>
          </string-name>
          and
          <string-name>
            <given-names>Jun</given-names>
            <surname>Wang</surname>
          </string-name>
          .
          <article-title>Automated benchmark model generators for model-based diagnostic inference</article-title>
          .
          <source>In IJCAI</source>
          , pages
          <fpage>513</fpage>
          -
          <lpage>518</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>David J Spiegelhalter</surname>
          </string-name>
          , Nicola G Best,
          <article-title>Bradley P Carlin,</article-title>
          and
          <string-name>
            <surname>Angelika Van Der Linde</surname>
          </string-name>
          .
          <article-title>Bayesian measures of model complexity and fit</article-title>
          .
          <source>Journal of the Royal Statistical Society: Series B (Statistical Methodology)</source>
          ,
          <volume>64</volume>
          (
          <issue>4</issue>
          ):
          <fpage>583</fpage>
          -
          <lpage>639</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>Jing</given-names>
            <surname>Du</surname>
          </string-name>
          .
          <article-title>The “weight" of models and complexity</article-title>
          .
          <source>Complexity</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Jasper</surname>
            <given-names>A Vrugt</given-names>
          </string-name>
          and
          <article-title>Bruce A Robinson. Treatment of uncertainty using ensemble methods: Comparison of sequential data assimilation and bayesian model averaging</article-title>
          .
          <source>Water Resources Research</source>
          ,
          <volume>43</volume>
          (
          <issue>1</issue>
          ),
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>Hirotugu</given-names>
            <surname>Akaike</surname>
          </string-name>
          .
          <article-title>A new look at the statistical model identification</article-title>
          .
          <source>Automatic Control</source>
          , IEEE Transactions on,
          <volume>19</volume>
          (
          <issue>6</issue>
          ):
          <fpage>716</fpage>
          -
          <lpage>723</lpage>
          ,
          <year>1974</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>Hirotugu</given-names>
            <surname>Akaike</surname>
          </string-name>
          .
          <article-title>Likelihood of a model and information criteria</article-title>
          .
          <source>Journal of econometrics</source>
          ,
          <volume>16</volume>
          (
          <issue>1</issue>
          ):
          <fpage>3</fpage>
          -
          <lpage>14</lpage>
          ,
          <year>1981</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>G.</given-names>
            <surname>Schwarz.</surname>
          </string-name>
          <article-title>Estimating the dimension of a model</article-title>
          . Ann. Statist.,
          <volume>6</volume>
          :
          <fpage>461</fpage>
          -
          <lpage>466</lpage>
          ,
          <year>1978</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <surname>Eric-Jan Wagenmakers</surname>
          </string-name>
          .
          <article-title>A practical solution to the pervasive problems ofp values</article-title>
          .
          <source>Psychonomic bulletin &amp; review</source>
          ,
          <volume>14</volume>
          (
          <issue>5</issue>
          ):
          <fpage>779</fpage>
          -
          <lpage>804</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <surname>Pol</surname>
            <given-names>D</given-names>
          </string-name>
          <string-name>
            <surname>Spanos</surname>
          </string-name>
          .
          <article-title>Linearization techniques for non-linear dynamical systems</article-title>
          .
          <source>PhD thesis</source>
          , California Institute of Technology,
          <year>1977</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>Benjamin</given-names>
            <surname>Kuipers</surname>
          </string-name>
          and
          <string-name>
            <given-names>Karl</given-names>
            <surname>Åström</surname>
          </string-name>
          .
          <article-title>The composition and validation of heterogeneous control laws</article-title>
          .
          <source>Automatica</source>
          ,
          <volume>30</volume>
          (
          <issue>2</issue>
          ):
          <fpage>233</fpage>
          -
          <lpage>249</lpage>
          ,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <surname>Alexander</surname>
            <given-names>Feldman</given-names>
          </string-name>
          , Helena Vicente de Castro, Arjan van Gemund,
          <string-name>
            <given-names>and Gregory</given-names>
            <surname>Provan</surname>
          </string-name>
          .
          <article-title>Model-based diagnostic decision-support system for satellites</article-title>
          .
          <source>In Proceedings of the IEEE Aerospace Conference</source>
          , Big Sky, Montana, USA, pages
          <fpage>1</fpage>
          -
          <lpage>14</lpage>
          ,
          <year>March 2013</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>