<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>VIAL-AD: Visual Interactive Labelling for Anomaly Detection - An approach and open research questions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andreas Theissler</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anna-Lena Kraft</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Max Rudeck</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fabian Erlenbusch</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Aalen University of Applied Sciences</institution>
          ,
          <addr-line>73430 Aalen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>IT-Designers Gruppe</institution>
          ,
          <addr-line>73730 Esslingen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>84</fpage>
      <lpage>89</lpage>
      <abstract>
        <p>In anomaly detection problems the available data is often not or not fully labelled. This leads to results that are usually significantly worse than in balanced classification problems. In this short paper VIAL-AD is proposed, which addresses this problem with a sequence of unsupervised, semi-supervised and supervised machine learning models allowing a user to interactively label data points. This allows to move towards supervised anomaly detection, starting with unlabelled data. The approach is introduced and identified open research questions are discussed.</p>
      </abstract>
      <kwd-group>
        <kwd>visual interactive labelling</kwd>
        <kwd>VIAL</kwd>
        <kwd>anomaly detection</kwd>
        <kwd>human-centered machine learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        This work addresses machine learning-based anomaly detection (AD) [
        <xref ref-type="bibr" rid="ref1 ref6">6, 1</xref>
        ], where
the aim is to classify data points as either normal or anomaly based on a set of
features f . This can be achieved by training AD models on data that is (a)
unlabelled, (b) contains labelled normal data, or (c) contains labelled normal data
and anomalies. Applications of AD can be found in system health monitoring,
intrusion detection, fraud detection, and the analysis of medical data. One main
application field is data-driven fault detection, e.g. addressed in [
        <xref ref-type="bibr" rid="ref20 ref21">20, 21</xref>
        ].
      </p>
      <p>This work is motivated by the question of how we can compensate for the
lack of a labelled and representative data set in AD problems by incorporating
human knowledge in order to move to supervised AD. While there are statistical
or unsupervised ML methods to identify outliers, only a human expert can
decide whether a data point is a true anomaly for a given application. Therefore, it
suggests itself to incorporate the user in the process. We argue that for anomaly
detection, this is indeed even more crucial than for balanced classification
problems.</p>
      <p>
        In this paper visual interactive labelling for anomaly detection (VIAL-AD)
is proposed which – starting with unlabelled data – allows to iteratively move
from unsupervised to supervised anomaly detection [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. This is achieved by a
© 2020 for this paper by its authors. Use permitted under CC BY 4.0.
combination of (1) a sequence of machine learning (ML) models with different
levels of supervision and (2) the incorporation of the user to interactively label
data. The idea is to use unsupervised AD to address the so-called cold-start
problem [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] in order to obtain an initial set of tentative labels. A sequence of
AD models is used to suggest labels and a human expert confirms or overrules
suggestions and labels data or regions in the feature space. We believe that
in AD, where it is unlikely to have a representative and labelled training set,
the user-in-the-loop is key to allow for the use of ML and move towards an
accuracy that allows for productive use. In order to validate the idea, a prototype
was implemented. Preliminary results are promising, however a number of open
research questions were uncovered and are discussed in Section 3.
      </p>
      <p>
        In the following, related work is briefly reviewed. Holzinger et al. showed
how a user in-the-loop with ML models can improve the overall performance of
a system [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. A generic process for visual interactive labelling was proposed by
Bernard et al. under the name of VIAL [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] it was shown that VIAL can
outperform pure active learning – specifically for two-class problems. In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] AD
models were used for interactive labelling, however not with the aim to label
an AD data set. Trittenbach et al. discuss open research challenges for one-class
active learning [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], e.g. the cold-start problem.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>The approach: VIAL-AD</title>
      <p>
        VIAL-AD consists of the steps unsupervised, semi-supervised, and supervised AD
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] (see Fig. 1(a)), where in each step model and user collaborate as follows: The
model classifies the data and suggests the labels normal and anomaly. The user
inspects the data points and their labels and (a) adjusts model hyperparameters,
or (b) confirms/overrules the labels proposed by the model, or (c) visually labels
data points or regions. Following that, the user decides to move to the follow-up
step or to refine the labelling in the current step.
1. unsupervised AD: An unsupervised model (LOF [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] in current
implementation) suggests initial labels which are confirmed or overruled by the user.
Confirmed anomalies are moved to the two-class data set for the supervised
step and are excluded for the current step. The model continues to report
anomalies which are again evaluated by the user. In addition, regions in the
feature space can be marked as normal or anomaly resulting in the creation
of artificial data points. The result of the unsupervised step is a tentatively
labelled train set.
2. semi-supervised AD: The reduced data set from the unsupervised step is
used to train a one-class classifier (current prototype uses a OC-SVM [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]).
For each data point the classifier suggests the labels normal or anomaly.
These are processed analogously to the first step. A promising alternative is
to make use of the anomalies from step 1 using methods like SVDDneg [
        <xref ref-type="bibr" rid="ref10 ref18">18,
10</xref>
        ]. This step’s output is a labelled train set of normal data and anomalies,
where the anomaly class is, however, not likely to be representative.
3. supervised AD: As VIAL-AD is used on real data, more and more
anomalies are detected, so one can move to a supervised scenario. In addition to
the normal data, the previously labelled anomalies are used to form a
twoclass train set that becomes increasingly representative. Hence, a variety of
common ML models becomes applicable. In case of high class imbalance,
sampling methods should be applied [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. In the prototype ν-SVM is used.
      </p>
      <p>
        The central element of interaction is an interactive 2D-scatter plot as in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
(see Fig. 1(b)). This does however not limit the approach to two-dimensional
data – higher dimensional data can be projected [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] onto two dimensions.
Alternatively, visualisations for multi-dimensional data could be used. However,
they introduce a higher complexity for the user.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Open research questions</title>
      <p>
        Potential disruption caused by subsequent models: Different AD
models have differing underlying assumptions [
        <xref ref-type="bibr" rid="ref18 ref6">6, 18</xref>
        ] about anomalies. Some work
with probabilistic distributions, others with distances, densities (e.g. LOF [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]),
reconstruction errors (autoencoders), or the adaption of the maximum-margin
assumption to the one-class case (one-class SVMs [
        <xref ref-type="bibr" rid="ref16 ref18">18, 16</xref>
        ]). Hence, the use of
different models in subsequent steps can induce disruptions in the way labels are
suggested. A subsequent model could come up with a different labelling, which
is confusing for the user. This disruption is to be minimised.
      </p>
      <p>
        Visualisation-vs.-model dilemma: For data with &gt; 2 dimensions the user
is presented projected data, while the model may work on the original or on
projected data. However, a projection with the aim to optimally visualise the
data is not necessarily the optimal projection for the ML model to work on
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. The original space or alternative projection methods might be more
appropriate. Visualising and classifying different representations of the data can,
however, induce undesired effects. Wenskovitch et al. give an overview and
potential solutions are discussed in [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ].
      </p>
      <p>
        Problem of non-interpretability of projected space: In AD problems,
typically no representative set of anomalies exists. To compensate for that,
entire regions could be marked as anomalous based on expert knowledge. In the
original feature space these would be outlier values that can be clearly specified
by experts. As discussed, the potentially high-dimensional data can be projected
onto a lower dimensional space, the user can interact with. However, for many
projection methods the relation between the visualization and the original input
space is not obvious. This makes it difficult or even impossible for the user to
label unoccupied regions in the feature space. Projection methods like t-SNE
[
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] aim to preserve the neighbourhood between data points, however do not
preserve the properties in unpopulated regions. This creates a dilemma trying
to mark unpopulated regions as normal or anomaly : in contrast to working on
the original feature space, users do not have an intuition about where anomaly
regions in the projected space are, as the projected feature space can be distorted
and hardly interpretable [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. This problem can be addressed in several ways:
1. Avoid projections by using visualization methods for high-dimensional data,
which however makes interaction with the data more complex and does not
scale well for a high number of dimensions.
2. Show original data objects for selected data points. Data types, where single
data objects can be intuitively presented due to some order within the data
are predestined for that, e.g. images, time series, or text. High-dimensional
data in the form of independent feature vectors can, however, not easily be
represented in an intuitive way.
3. Investigate projection methods and interaction facilities in order to allow for
a user-friendly interaction [
        <xref ref-type="bibr" rid="ref8 ref9">9, 8</xref>
        ]
4. Let the user explore different projections, e.g. as proposed in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
Problem of highly imbalanced data: In AD, the distribution of the classes
is typically highly imbalanced towards the normal class. As a consequence (a)
this poses particular challenges for the projection methods, and (b) it raises the
question if users will label the data accordingly.
      </p>
      <p>
        In projections, anomalies should be positioned well separated from normal
instances. Hence, an interesting issue is the sensitivity of projection methods
to outliers. Bernard et al. evaluated different projection methods in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. While
PCA’s sensitivity to outliers is considered promising [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] it is stated that
users prefer t-SNE [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. The appropriateness of a projection method can be
evaluated with a user study or using metrics specifying the readability of the
projections. In [
        <xref ref-type="bibr" rid="ref13 ref17">17, 13</xref>
        ] ways to measure this readability are discussed.
      </p>
      <p>The second question raised by the class imbalance is if users will label the
data accordingly: On the one hand, users might run into the risk of overlooking
anomalies due to their rareness. On the other hand, it might be that users
overestimate the anomaly class, labelling too many data points as anomalous.
Risk of manual overfitting: Furthermore, an identified challenge is the risk of
what we call “manual overfitting”. In supervised ML, overfitting is addressed e.g.
with regularization terms – preventing the model from too naively overfitting the
data. However, with the user in-the-loop and with direct control over class labels,
such a naive overfitting may take place: The user might be tempted to process the
data set in such a way to achieve optimal accuracy as opposed to strictly applying
domain knowledge to distinguish between normal or anomalous data points or
regions in the feature space. This could be addressed with a – potentially high
number – of blind test sets. Even after testing, this data should not be made
available to the expert in order not to overfit towards the test set. Ideally, a test
set should only be used once to evaluate performance. Another option would be
the introduction of some regularization method, putting reasonable constraints
on the user actions.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>This paper discussed how the incorporation of humans can compensate for the
lack of labelled data in anomaly detection. The proposed approach uses
unsupervised AD on an initially unlabelled data set and lets the user confirm or overrule
decisions. After having interactively processed the data set in collaboration with
unsupervised AD models, the user can move to semi-supervised or supervised
models. The key benefit of VIAL-AD is, that it allows to move towards
supervised ML where it was previously not applicable due to the lack of labelled data.
This is a problem often encountered in industry where data is recorded for a
different purpose and the opportunities of applying ML are discovered later. While
preliminary experiments indicate the applicability of VIAL-AD, open research
questions were identified which will be addressed in future work. Following that,
the goal is to evaluate VIAL-AD in a systematic user study and to apply it in a
real-world case study.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>1. An experimental evaluation of novelty detection methods</article-title>
          .
          <source>Neurocomputing</source>
          <volume>135</volume>
          ,
          <fpage>313</fpage>
          -
          <lpage>327</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bernard</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dobermann</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sedlmair</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fellner</surname>
            ,
            <given-names>D.W.:</given-names>
          </string-name>
          <article-title>Combining cluster and outlier analysis with visual analytics</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bernard</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hutter</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zeppelzauer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fellner</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sedlmair</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Comparing Visual-Interactive Labeling with Active Learning: An Experimental Study</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          <volume>24</volume>
          (
          <issue>1</issue>
          ),
          <fpage>298</fpage>
          -
          <lpage>308</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bernard</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zeppelzauer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sedlmair</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aigner</surname>
            ,
            <given-names>W.:</given-names>
          </string-name>
          <article-title>VIAL: a unified process for visual interactive labeling</article-title>
          .
          <source>The Visual Computer</source>
          <volume>34</volume>
          (
          <issue>9</issue>
          ),
          <fpage>1189</fpage>
          -
          <lpage>1207</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Breunig</surname>
            ,
            <given-names>M.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kriegel</surname>
            ,
            <given-names>H.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ng</surname>
          </string-name>
          , R.T.,
          <string-name>
            <surname>Sander</surname>
            ,
            <given-names>J.: LOF</given-names>
          </string-name>
          :
          <article-title>Identifying Density-Based Local Outliers</article-title>
          .
          <source>In: SIGMOD Conference</source>
          . pp.
          <fpage>93</fpage>
          -
          <lpage>104</lpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Chandola</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Banerjee</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumar</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Anomaly detection: A survey</article-title>
          .
          <source>ACM Computing Surveys</source>
          <volume>41</volume>
          (
          <issue>3</issue>
          ),
          <volume>15</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          :
          <fpage>58</fpage>
          (Jul
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Cutura</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holzer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aupetit</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sedlmair</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>VisCoDeR: A tool for visually comparing dimensionality reduction algorithms</article-title>
          .
          <source>In: 26th European Symposium on Artificial Neural Networks, ESANN</source>
          <year>2018</year>
          , Bruges, Belgium,
          <source>April 25-27</source>
          ,
          <year>2018</year>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Endert</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fiaux</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , North,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>Semantic interaction for sensemaking: Inferring analytical reasoning for model steering</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          <volume>18</volume>
          (
          <issue>12</issue>
          ),
          <fpage>2879</fpage>
          -
          <lpage>2888</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Faust</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Glickenstein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scheidegger</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>DimReader: Axis lines that explain non-linear projections</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          <volume>25</volume>
          (
          <issue>1</issue>
          ),
          <fpage>481</fpage>
          -
          <lpage>490</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. G¨ornitz, N.,
          <string-name>
            <surname>Kloft</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rieck</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brefeld</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Toward supervised anomaly detection</article-title>
          .
          <source>J. Artif. Int. Res</source>
          .
          <volume>46</volume>
          (
          <issue>1</issue>
          ),
          <fpage>235</fpage>
          -
          <lpage>262</lpage>
          (
          <year>Jan 2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Holzinger</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Plass</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kickmeier-Rust</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holzinger</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , Cri¸san, G.C.,
          <string-name>
            <surname>Pintea</surname>
            ,
            <given-names>C.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palade</surname>
            ,
            <given-names>V.:</given-names>
          </string-name>
          <article-title>Interactive machine learning: experimental evidence for the human in the algorithmic loop</article-title>
          .
          <source>Applied Intelligence</source>
          <volume>49</volume>
          (
          <issue>7</issue>
          ),
          <fpage>2401</fpage>
          -
          <lpage>2414</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Krawczyk</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Learning from imbalanced data: open challenges and future directions</article-title>
          .
          <source>Progress in Artificial Intelligence</source>
          pp.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Micallef</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palmas</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oulasvirta</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weinkauf</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Towards Perceptual Optimization of the Visual Design of Scatterplots</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          <volume>23</volume>
          (
          <issue>6</issue>
          ),
          <fpage>1588</fpage>
          -
          <lpage>1599</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Sacha</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sedlmair</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peltonen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weiskopf</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , North,
          <string-name>
            <given-names>S.C.</given-names>
            ,
            <surname>Keim</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.A.</surname>
          </string-name>
          :
          <article-title>What you see is what you can change: Human-centered machine learning by interactive visualization</article-title>
          .
          <source>Neurocomputing</source>
          <volume>268</volume>
          ,
          <fpage>164</fpage>
          -
          <lpage>175</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Sacha</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sedlmair</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peltonen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weiskopf</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , North,
          <string-name>
            <given-names>S.C.</given-names>
            ,
            <surname>Keim</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.A.</surname>
          </string-name>
          :
          <article-title>Visual Interaction with Dimensionality Reduction: A Structured Literature Analysis</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          <volume>23</volume>
          (
          <issue>1</issue>
          ),
          <fpage>241</fpage>
          -
          <lpage>250</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16. Scho¨lkopf,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Platt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.C.</given-names>
            ,
            <surname>Shawe-Taylor</surname>
          </string-name>
          , J.C.,
          <string-name>
            <surname>Smola</surname>
            ,
            <given-names>A.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williamson</surname>
          </string-name>
          , R.C.
          <article-title>: Estimating the support of a high-dimensional distribution</article-title>
          .
          <source>Neural Computation</source>
          <volume>13</volume>
          ,
          <fpage>1443</fpage>
          -
          <lpage>1471</lpage>
          (
          <year>July 2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Sedlmair</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tatu</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Munzner</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tory</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A taxonomy of visual cluster separation factors</article-title>
          .
          <source>Computer Graphics Forum</source>
          <volume>31</volume>
          (
          <issue>3pt4</issue>
          ),
          <fpage>1335</fpage>
          -
          <lpage>1344</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Tax</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          :
          <article-title>One-class classification. Concept-learning in the absence of counterexamples</article-title>
          .
          <source>Ph.D. thesis</source>
          , Delft University of Technology (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Theissler</surname>
          </string-name>
          , A.:
          <article-title>Multi-class novelty detection in diagnostic trouble codes from repair shops</article-title>
          .
          <source>In: 2017 IEEE 15th International Conference on Industrial Informatics (INDIN)</source>
          . pp.
          <fpage>1043</fpage>
          -
          <lpage>1049</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Theissler</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Detecting anomalies in multivariate time series from automotive systems</article-title>
          .
          <source>Ph.D. thesis</source>
          , Brunel University London (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Theissler</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Detecting known and unknown faults in automotive systems using ensemble-based anomaly detection</article-title>
          .
          <source>Knowledge-Based Systems 123(C)</source>
          ,
          <volume>163</volume>
          -
          <fpage>173</fpage>
          (May
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Trittenbach</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Englhardt</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , B¨ohm, K.:
          <article-title>Validating one-class active learning with user studies - A prototype and open challenges</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>van der Maaten</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hinton</surname>
          </string-name>
          , G.:
          <article-title>Visualizing Data using t-SNE</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          <volume>9</volume>
          (
          <issue>11</issue>
          ),
          <fpage>2579</fpage>
          -
          <lpage>2605</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Wenskovitch</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crandell</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramakrishnan</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>House</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leman</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , North,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>Towards a systematic combination of dimension reduction and clustering in visual analytics</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          <volume>24</volume>
          (
          <issue>1</issue>
          ),
          <fpage>131</fpage>
          -
          <lpage>141</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>