<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>EasyMiner - Short History of Research and Current Development</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tomáš Kliegr</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jaroslav Kucharˇ</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stanislav Vojírˇ</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Václav Zeman</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information and Knowledge Engineering, Faculty of Informatics and Statistics, University of Economics</institution>
          ,
          <addr-line>Prague, W. Churchill Sq. 4, Prague 3</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Web Intelligence Research Group, Faculty of Information Technology, Czech Technical University</institution>
          ,
          <addr-line>Thákurova 9, 160 00, Prague 6</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <volume>1885</volume>
      <fpage>235</fpage>
      <lpage>239</lpage>
      <abstract>
        <p>EasyMiner (easyminer.eu) is an academic data mining project providing data mining of association rules, building of classification models based on association rules and outlier detection based on frequent pattern mining. It differs from other data mining systems by adapting the “web search” paradigm. It is web-based, providing both a REST API and a user interface, and puts emphasis on interactivity, simplicity of user interface and immediate response. This paper will give an overview of research related to the EasyMiner project.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>In this paper, we present the history of research and
development of the EasyMiner project http://easyminer.
eu. EasyMiner is an academic data mining project
providing data mining of association rules, building of
classification models based on association rules and outlier
detection based on frequent pattern mining.</p>
      <p>EasyMiner was to our knowledge the first interactive
web-based data mining system that supported the
complete machine learning process. While today there are
several web-based machine learning systems on the market1,
owing to continuous development EasyMiner provides
distinct user experience. While most existing machine
learning systems offer versatile user interfaces, where the
user has to in some way for each task compose a new
machine learning workflow, in EasyMiner the user interface is
crafted to provide the “web search” experience. The user
visually constructs a query against the data, and the
system responds with a set of interesting patterns (presented
as rules) or a classifier (Figure 1).</p>
      <p>Over the years of development, EasyMiner served as
a testbed for a number of new technologies and research
ideas. The purpose of this paper is to give a brief overview
of this research.</p>
      <p>This paper is organized as follows. Section 2 is focused
on SEWEBAR-CMS, the predecessor of EasyMiner, used
in research on the use of domain knowledge in data
mining. Section 3 focuses on association rule discovery.
Section 4 presents the adaptation of EasyMiner for learning
business rules and Section 5 consequently for association
rule classification. Section 6 presents the current focus on
outlier detection. The architecture of the system is
presented in Section 7. Since the beginnings, the research
was accompanied with standardization efforts, which are
presented in Section 8. The current development efforts
focus also on distributed computation platforms – this is
covered in Section 9. Section 10 provides an overview of
the features that were at some point in time developed as
well as of those that are supported by the current version
of EasyMiner. Finally, the conclusions present a case for
using EasyMiner as a component in new project requiring
data mining functionality and refers the interested reader
to other publications regarding comparison with other
machine learning as a service (MLaaS) systems.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Handling of Domain Knowledge</title>
      <p>
        EasyMiner evolved from the SEWEBAR (SEmantic-WEB
Analytical Reports) project, which focused on
semantically readable machine learning. In [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], we presented
SEWEBAR-CMS as a set of extensions for the Joomla!
content management system (CMS) that extends it with
functionality required to serve as a communication
platform between the data analyst, domain expert and the
report user. The system later supported elicitation of
domain knowledge from the analyst [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Association rules
discovered from data with the LISp-Miner system (http:
//lispminer.vse.cz) were stored in a semantic form in
the SEWEBAR-CMS system. The background knowledge
was used to help answer user search queries, for example,
to find rules that are contradicting existing domain
knowledge [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Another novel element in the system was the use
of ontology for representation of the data mining domain.
      </p>
      <p>
        Related research focused on improving semantic
capabilities of content management systems [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and on
designing ontologies and schemata for representation of
background knowledge [
        <xref ref-type="bibr" rid="ref11 ref8">8, 11</xref>
        ].
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Association Rule Discovery</title>
      <p>
        In its first release, EasyMiner provided a web-based
interface for the LISp-Miner system, which was used for
association rule mining [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. EasyMiner interacted with
LISp-Miner using its LM-Connect component, which is a
web application providing the functionality of LISp-Miner
through REST API.
      </p>
      <p>EasyMiner with LISp-Miner backend offered several
unique features: 1. negation on attributes, 2. disjunction
between attributes, 3. subpatterns allowing for scoping
logical connectives, 4. multiple interest measures (called
quantifiers in GUHA), 5. mines directly on multivalued
attributes, no need to create "items", 6. dynamic binning
operators (called coefficients in GUHA), 7. PMML-based
import and export, 8. grid support.</p>
      <p>Since LM-Connect component is no longer developed
and maintained, the integration of the current version of
EasyMiner and LISp-Miner is thus currently not working.2</p>
      <p>
        The current version of EasyMiner primarily relies on the
R arules package [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], which wraps a C implementation of
the apriori association rule mining algorithm [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Learning Business Rules</title>
      <p>
        One of the first use cases for EasyMiner was learning
business rules. In [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] we presented a software module for
2It should be noted that all the features list above can be used directly
from the LISp-Miner system.
      </p>
      <p>
        EasyMiner, which allows to export selected rules to
Business Rules Management System (BRMS) Drools,
transforming the output of association rule learning into the
DRL format supported by Drools. We found that the main
obstacles for a straightforward use of association rules as
candidate business rules are the excessive number of rules
discovered even on small datasets, and the fact that
contradicting rules are generated. In [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] we propose that a
potential solution to these problems is provided by the seminal
association rule classification algorithm CBA [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]
we presented a software module for EasyMiner, which
allows the domain expert to edit the discovered rules.
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Association Rule Based Classification</title>
      <p>
        In [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] we started to use the CBA algorithm for
postprocessing association rule learning results into a classifier.
In [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] we presented an extension for EasyMiner for
building of classification models. A benchmark against
standard symbolic classification algorithms on a news
recommender task was presented in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
    </sec>
    <sec id="sec-6">
      <title>Outlier Detection</title>
      <p>
        The most recent addition of new tasks supported by
EasyMiner is frequent pattern-based anomaly (outlier)
detection. The main idea of the approach is that if an instance
contains more frequent patterns, it is unlikely to be an
anomaly. The presence or absence of the frequent patterns
is then used to assign the deviation level [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] we
present extension of EasyMiner REST API with our
innovated outlier detection algorithm called Frequent Pattern
Isolation (FPI)[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] that is inspired by an existing
algorithm called Isolation Forests (IF) [
        <xref ref-type="bibr" rid="ref17 ref18">17, 18</xref>
        ]. Since PMML
does not yet support outlier (anomaly) detection, in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]
we present our proposal for a new PMML outlier model.
The goal of our work was to design modular solution that
would support broader range of anomaly detection
algorithms including our FPI method.
7
      </p>
    </sec>
    <sec id="sec-7">
      <title>EasyMiner Architecture</title>
      <p>During the development of EasyMiner system, its
architecture was transformed to multiple reusable web services.
A schema of the architecture is shown in Figure 2. All the
services are fully documented in Swagger.
using EasyMiner-Data using user-defined preprocessing
methods. The attributes for data mining are created from
uploaded data fields using one of these preprocessing
algorithms: each value-one bin, enumeration of intervals,
enumeration of nominal values, equidistant intervals,
equifrequent intervals, equisized intervals (by minimal support of
every interval). The preprocessing algorithms as well as
data storage are independent of the selected data mining
algorithm. The implemented web services support
hashing functionality to avoid potentially problems with
special characters in attribute names and its values. The
mining following services work on the “safe” datasets with
hashed values.</p>
      <p>The main data mining functionality is provided by the
service EasyMiner-Miner. This web service provides
association rule learning, prunning of discovered association
rule sets and building of classification models and outlier
detection. EasyMiner-Miner initializes execution of used
R packages and another algorithms.</p>
      <p>EasyMiner-Scorer is a web service for testing of
classification models based on association rules.
8</p>
    </sec>
    <sec id="sec-8">
      <title>Distributed Backend: Spark/Hadoop</title>
      <p>
        As laid out in the previous section, EasyMiner is
modular in terms of mining backends. In addition to the default
mining backend provided by the arules and rCBA
packages, EasyMiner supports an alternate one built on top of
Apache Spark/Hadoop introduced in [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ].
      </p>
      <p>The Spark backend is suitable for larger datasets, which
can benefit from parallel computation distributed over
multiple machines. The Spark backend also uses
FPGrowth frequent pattern mining algorithm instead of
apriori. FP-Growth is generally considered as faster than
apriori. However, for smaller datasets using apriori with the
R backend is recommended as it provides faster response
times, due to the ability of the implementation to provide
intermediate results as the mining progresses.</p>
      <p>The central component (service) is EasyMinerCenter.
This component integrates the functionality of other
services and provides the main graphical web interface and
REST API for end users. Internally, this component
provides user account and task management, stores
discovered association rules and works as authentication service
for other components.</p>
      <p>For storing and preparing data before mining, the
system uses services EasyMiner-Data and
EasyMinerPreprocessing. EasyMiner-Data is a web services for
management of data sources. It supports upload of data
files in CSV and RDF and stores them into databases as
the set of transactions. EasyMiner-Preprocessing
service supports creation of datasets from data sources stored
Already the earliest research related to EasyMiner was
linked to work on standardization efforts. While
association rules were supported already in the early versions
of PMML, the industry standard format for exchange of
data mining models, the GUHA method that was initially
used did not comply to this standard, since it produced
rules containing number of constructs not supported by
PMML. Since our research involved background
knowledge elicited from domain experts, definition of data
format supporting this type of knowledge was also required.</p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] we proposed a topic map-based ontology for
association rule learning, which was based on the GUHA
method and in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] an extension of this approach that
dealt with domain knowledge. An extension of PMML for
GUHA-based models was presented in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and for
handling of background knowledge [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Neither of these
efforts was successful – the ISO Topic Maps standard waded
in favour of the W3C RDF/OWL stack. The industry was
not concerned with exchange of background knowledge at
the time, and support of GUHA method, implemented
essentially only by the LISp-Miner system, increased
complexity of the models as opposed to the existing PMML
association rule models.3 Our latest standardization effort
is related to outlier detection [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and targets PMML. This
proposal is closes industry adoption as it was included into
a roadmap for the next release of PMML.
10
      </p>
    </sec>
    <sec id="sec-9">
      <title>Features in the EasyMiner Version 2.4</title>
      <p>Table 1 presents an overview of the most salient
features that were in some for published between 2009, when
the first paper on EasyMiner’s predecessor
SEWEBARCMS appeared, and 2017, when the current version of
EasyMiner was released. As follows from the table, a
number of features is not supported in the current release.
11</p>
    </sec>
    <sec id="sec-10">
      <title>Using EasyMiner in Your Project</title>
      <p>During the years of development, EasyMiner was
extensively used by over thousand of students at the Faculty of
Informatics and Statistics to complete their assignments in
association rule learning. The software has also been used
in several applied research projects. For example, within
the linkedtv.eu project EasyMiner was used to analyze
user preferences and within the openbudgets.eu project
to analyze budgetary data.</p>
      <p>
        The full project is based on composition of components
and services with fully documented REST APIs. Most
of the components and services4 are available under open
source Apache License, Version 2.0. This is an
important factor which differentiates EasyMiner from the
commercial MLaaS offerings. For a more detailed comparison
with other machine learning systems refer to [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ].
      </p>
      <p>In addition to the visual web-based interface, the project
exposes a REST API. This API provides full functionality
of EasyMiner, including also functions, which are not yet
available in the GUI. It is possible to use this API to
extend your own project by data mining functionality. It is
suitable for building of mashup applications or data
processing using script languages. An example of data
mining using API is available at http://www.easyminer.
eu/api-tutorial.</p>
      <p>EasyMiner can also be extended with new algorithms
- rule mining, outlier detection or scorer service. For this
purpose, the integration component EasyMinerCenter
provides documented interfaces in PHP.</p>
      <p>3Currently, EasyMiner supports export of association rule models in
formats GUHA PMML also as in standard form PMML 4.3 Association
Rules.</p>
      <p>4The main services were presented in section 7.</p>
    </sec>
    <sec id="sec-11">
      <title>Acknowledgment</title>
      <p>This paper was supported by IGA grant 29/2016 of the
University of Economics, Prague.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Rakesh</given-names>
            <surname>Agrawal</surname>
          </string-name>
          , Tomasz Imielinski, and
          <string-name>
            <surname>Arun</surname>
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Swami</surname>
          </string-name>
          .
          <article-title>Mining association rules between sets of items in large databases</article-title>
          .
          <source>In SIGMOD</source>
          , pages
          <fpage>207</fpage>
          -
          <lpage>216</lpage>
          . ACM Press,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Hahsler</surname>
          </string-name>
          , Sudheer Chelluboina, Kurt Hornik, and
          <string-name>
            <given-names>Christian</given-names>
            <surname>Buchta</surname>
          </string-name>
          .
          <article-title>The arules r-package ecosystem: analyzing interesting patterns from large transaction data sets</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>12</volume>
          (Jun):
          <fpage>2021</fpage>
          -
          <lpage>2025</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Andrej</given-names>
            <surname>Hazucha</surname>
          </string-name>
          , Jakub Balhar, and
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          .
          <article-title>A PHP library for Ontopia-CMS integration</article-title>
          .
          <source>In TMRA 2010</source>
          . University of Leipzig,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Z.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Huang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Deng.</surname>
          </string-name>
          FP-outlier:
          <article-title>Frequent pattern based outlier detection</article-title>
          .
          <source>Computer Science and Information Systems/ComSIS</source>
          , 2(
          <issue>1</issue>
          ):
          <fpage>103</fpage>
          -
          <lpage>118</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          ,
          <string-name>
            <surname>Jaroslav</surname>
            <given-names>Kucharˇ</given-names>
          </string-name>
          , Davide Sottara, and
          <article-title>Stanislav Vojírˇ. Learning business rules with association rule classifiers</article-title>
          . In Antonis Bikakis, Paul Fodor, and Dumitru Roman, editors,
          <source>Rules on the Web. From Theory to Applications: 8th International Symposium, RuleML</source>
          <year>2014</year>
          ,
          <article-title>Co-located with the 21st</article-title>
          <source>European Conference on Artificial Intelligence, ECAI</source>
          <year>2014</year>
          , Prague, Czech Republic,
          <source>August 18-20</source>
          ,
          <year>2014</year>
          . Proceedings, pages
          <fpage>236</fpage>
          -
          <lpage>250</lpage>
          , Cham,
          <year>2014</year>
          . Springer International Publishing.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          , Andrej Hazucha, and
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Marek</surname>
          </string-name>
          .
          <article-title>Instant feedback on discovered association rules with PMMLbased query-by-example</article-title>
          .
          <source>In Web Reasoning and Rule Systems</source>
          . Springer,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          and
          <article-title>Jaroslav Kucharˇ. Benchmark of rulebased classifiers in the news recommendation task</article-title>
          .
          <source>In Josiane Mothe</source>
          , Jacques Savoy, Jaap Kamps,
          <string-name>
            <surname>Karen</surname>
            <given-names>PinelSauvagnat</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gareth</surname>
            <given-names>J. F.</given-names>
          </string-name>
          <string-name>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <surname>Eric</surname>
            <given-names>SanJuan</given-names>
          </string-name>
          , Linda Cappellato, and Nicola Ferro, editors,
          <source>Experimental IR Meets Multilinguality</source>
          , Multimodality, and Interaction - 6th
          <source>International Conference of the CLEF Association, CLEF</source>
          <year>2015</year>
          , Toulouse, France, September 8-
          <issue>11</issue>
          ,
          <year>2015</year>
          , Proceedings, volume
          <volume>9283</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>130</fpage>
          -
          <lpage>141</lpage>
          . Springer,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          , Marek Ovecˇka, and Jan Zemánek.
          <article-title>Topic maps for association rule mining</article-title>
          .
          <source>In Proceedings of TMRA 2009</source>
          . University of Leipzig,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          , Martin Ralbovský, Vojteˇch Svátek, Milan Šimunek, Vojteˇch Jirkovský, Jan Nemrava, and
          <string-name>
            <given-names>Jan</given-names>
            <surname>Zemánek</surname>
          </string-name>
          .
          <article-title>Semantic analytical reports: A framework for post-processing data mining results</article-title>
          .
          <source>In ISMIS'09: 18th International Symposium on Methodologies for Intelligent Systems</source>
          , pages
          <fpage>453</fpage>
          -
          <lpage>458</lpage>
          . Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          and
          <string-name>
            <given-names>Jan</given-names>
            <surname>Rauch</surname>
          </string-name>
          .
          <article-title>An XML format for association rule models based on the GUHA method</article-title>
          .
          <source>In Proceedings of the 2010 International Conference on Semantic Web Rules</source>
          ,
          <source>RuleML'10</source>
          , pages
          <fpage>273</fpage>
          -
          <lpage>288</lpage>
          , Berlin, Heidelberg,
          <year>2010</year>
          . Springer-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Tomáš</surname>
            <given-names>Kliegr</given-names>
          </string-name>
          , Vojteˇch Svátek, Milan Šim˚unek, Daniel Štastný, and
          <string-name>
            <given-names>Andrej</given-names>
            <surname>Hazucha</surname>
          </string-name>
          .
          <article-title>An XML schema and a topic map ontology for formalization of background knowledge in data mining</article-title>
          .
          <source>In IRMLeS-2010, 2nd ESWC Workshop on Inductive Reasoning and Machine Learning for the Semantic Web</source>
          , Heraklion, Crete, Greece,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Tomáš</surname>
            <given-names>Kliegr</given-names>
          </string-name>
          , Vojteˇch Svátek, Milan Šim˚unek, and
          <string-name>
            <given-names>Martin</given-names>
            <surname>Ralbovský</surname>
          </string-name>
          .
          <article-title>Semantic analytical reports: A framework for post-processing of data mining results</article-title>
          .
          <source>Journal of Intelligent Information Systems</source>
          ,
          <volume>37</volume>
          (
          <issue>3</issue>
          ):
          <fpage>371</fpage>
          -
          <lpage>395</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Tomáš</surname>
            <given-names>Kliegr</given-names>
          </string-name>
          , Stanislav Vojírˇ, and Jan Rauch.
          <article-title>Background knowledge and PMML: first considerations</article-title>
          .
          <source>In Proceedings of the 2011 workshop on Predictive markup language modeling</source>
          ,
          <source>PMML '11</source>
          , pages
          <fpage>54</fpage>
          -
          <lpage>62</lpage>
          , New York, NY, USA,
          <year>2011</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Jaroslav</surname>
            <given-names>Kucharˇ</given-names>
          </string-name>
          , Adam Ashenfelter, and
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          .
          <article-title>Outlier (anomaly) detection modelling in PMML</article-title>
          .
          <source>In RuleML 2017 Poster and Challenge Proceedings. CEURWS</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Jaroslav</given-names>
            <surname>Kucharˇ and Vojteˇch Svátek</surname>
          </string-name>
          .
          <article-title>Spotlighting anomalies using frequent patterns</article-title>
          .
          <source>In KDD 2017 Workshop on Anomaly Detection in Finance</source>
          , Halifax, Nova Scotia, Canada,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Bing</surname>
            <given-names>Liu</given-names>
          </string-name>
          , Wynne Hsu, and Yiming Ma.
          <article-title>Integrating classification and association rule mining</article-title>
          .
          <source>In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, KDD'98</source>
          , pages
          <fpage>80</fpage>
          -
          <lpage>86</lpage>
          . AAAI Press,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>F. T.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. M.</given-names>
            <surname>Ting</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z. H.</given-names>
            <surname>Zhou</surname>
          </string-name>
          .
          <article-title>Isolation forest</article-title>
          .
          <source>In Proceedings of the 8th IEEE International Conference on Data Mining (ICDM'08)</source>
          , pages
          <fpage>413</fpage>
          -
          <lpage>422</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Fei</given-names>
            <surname>Tony</surname>
          </string-name>
          <string-name>
            <given-names>Liu</given-names>
            , Kai Ming Ting, and
            <surname>Zhi-Hua Zhou</surname>
          </string-name>
          .
          <article-title>Isolationbased anomaly detection</article-title>
          .
          <source>ACM Trans. Knowl. Discov. Data</source>
          ,
          <volume>6</volume>
          (
          <issue>1</issue>
          ):3:
          <fpage>1</fpage>
          -
          <lpage>3</lpage>
          :
          <fpage>39</fpage>
          ,
          <string-name>
            <surname>March</surname>
          </string-name>
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Stanislav</surname>
            <given-names>Vojírˇ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jaroslav</surname>
            <given-names>Kucharˇ</given-names>
          </string-name>
          , Václav Zeman, and
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          .
          <article-title>Using easyminer API for financial data analysis in project openbudgets.eu</article-title>
          .
          <source>In RuleML 2017 Poster and Challenge Proceedings. CEUR-WS</source>
          ,
          <year>2017</year>
          . To appear.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Stanislav</surname>
            <given-names>Vojírˇ</given-names>
          </string-name>
          , Prˇemysl Václav Duben, and
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          .
          <article-title>Business rule learning with interactive selection of association rules</article-title>
          .
          <source>RuleML Challenge</source>
          ,
          <year>2014</year>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Stanislav</surname>
            <given-names>Vojírˇ</given-names>
          </string-name>
          , Tomáš Kliegr, Andrej Hazucha, Radek Skrabal, and
          <article-title>Milan Šim˚unek. Transforming association rules to business rules: EasyMiner meets Drools</article-title>
          . In Paul Fodor, Dumitru Roman, Darko Anicic, Adam Wyner, Monica Palmirani, Davide Sottara, and Francois Lévy, editors,
          <source>RuleML-2013 Challenge</source>
          , volume
          <volume>1004</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Stanislav</surname>
            <given-names>Vojírˇ</given-names>
          </string-name>
          , Václav Zeman, Jaroslav Kucharˇ, and Tomáš Kliegr. Easyminer/R preview:
          <article-title>Towards a web interface for association rule learning and classification in R</article-title>
          . In Challenge+ DC@ RuleML,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Radek</surname>
            <given-names>Škrabal</given-names>
          </string-name>
          , Milan Šim˚unek,
          <string-name>
            <surname>Stanislav</surname>
            <given-names>Vojírˇ</given-names>
          </string-name>
          , Andrej Hazucha, Tomáš Marek, David Chudán,
          <string-name>
            <given-names>and Tomáš</given-names>
            <surname>Kliegr</surname>
          </string-name>
          .
          <article-title>Association rule mining following the web search paradigm</article-title>
          .
          <source>In Peter A. Flach</source>
          , Tijl Bie, and Nello Cristianini, editors,
          <source>Machine Learning and Knowledge Discovery in Databases</source>
          , volume
          <volume>7524</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>808</fpage>
          -
          <lpage>811</lpage>
          . Springer Berlin Heidelberg,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>Václav</given-names>
            <surname>Zeman</surname>
          </string-name>
          .
          <article-title>Analýza cloudového rˇešení akademického nástroje pro dolování pravidel z databází</article-title>
          .
          <source>Systémová Integrace</source>
          ,
          <volume>23</volume>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Václav</surname>
            <given-names>Zeman</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stanislav</surname>
            <given-names>Vojírˇ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jaroslav</surname>
            <given-names>Kucharˇ</given-names>
          </string-name>
          , and Tomáš Kliegr.
          <article-title>Využití cloudu pro dolování asociacˇních pravidel z velkých dat prˇes webové rozhraní</article-title>
          .
          <source>In WIKT/DaZ</source>
          <year>2016</year>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>