<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Hybrid Multi-Agent System for Metalearning in Data Mining</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jakub Sˇm´ıd</string-name>
          <email>jakub.smid@ktiml.mff.cuni.cz</email>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Charles University in Prague, Faculty of Mathematics and Physics</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of Computer Science, Academy of Sciences of the</institution>
          <country country="CZ">Czech Republic</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper, a multi-agent system for metalearning in the data mining domain is presented. The system provides a user with intelligent features, such as recommendation of suitable data mining techniques for a new dataset, parameter tuning of such techniques, and building up a metaknowledge base. The architecture of the system, together with different user scenarios, and the way they are handled by the system, are described.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Ondrˇej Kaz´ık</title>
      <p>and</p>
    </sec>
    <sec id="sec-2">
      <title>Roman Neruda</title>
      <p>
        1
2
Lately, data mining — an automated process of gaining information
from datasets — has become an issue of interest in the artificial
intelligence. This interest have been whetted by the progress in the
computational technology, such as high performance machine clusters or
large storage devices, but most importantly by the possibility of an
access to enormous amount of data that are collected on daily basis.
The datasets vary in many factors as they origin in different areas of
human or nature activities. It is hard even for a data mining expert
to choose from the wide range of machine learning methods that are
used in data mining and to set its parameters to the values that would
produce a reasonable results for the specific dataset. Tools that ease
up the parameter set up can significantly boost up the productivity of
data mining process. Moreover, the automation of the whole process
would help those researchers, who are not data mining experts, to
enjoy the benefits of this research line. This is where the metalearning
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] comes into play.
      </p>
      <p>Metalearning over data mining methods and datasets is a very
demanding task, especially with respect to computational performance
as it uses results of data mining methods applied on various datasets
as its training/testing data. The software that is capable of both data
mining and metalearning is by definition a large and complex
system. To design the architecture of our system, we have chosen the
agent-based approach as it brings many advantages to this complex
task. The main one being its distributed and parallel nature — the
system can spread over computer networks and be accessed by many
users who only by using the system and running their experiments
provide the data needed for metalearning algorithms. It also supplies
a fast parallel execution of performance demanding tasks. The
interconnection of different parts of the system (i.e. the communication
among agents) is done only by sending messages which results in
an easy extensibility and re-usability of the parts of the system —
agents. It enables researchers to easily add their own components
(e.g. custom data mining methods) and to re-use the implemented
components in different situations.</p>
      <p>
        We have designed and implemented a multi-agent system (MAS)
which is capable of executing simple data mining tasks as well as
complex metalearning problems (involving not only recommending
of data mining methods but also setting their parameters), and it
provides all the mechanisms necessary for experimenting with different
metalearning approaches. The system is hybrid — it employs
combination of different artificial intelligence methods [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        We use JADE [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] — the multi-agent framework, as a base for our
agents; most of the computational agents in our system use Weka [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
data mining methods. The extensibility of our system is assured by
the use of the structured ontology language and following the FIPA
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] international standards of agents’ communication.
2
      </p>
    </sec>
    <sec id="sec-3">
      <title>Scenarios</title>
      <p>To propose an appropriate architecture of our computational MAS,
we have considered the following basic scenarios for processing a
dataset. In the most simple case the user knows which method and
what parameters of this method she would like to use. In the other
two basic scenarios, the system uses its intelligent meta-learning
features: If the user knows what method to use but does not know how to
set its parameters, the system is able to search the parameter space of
the method and find a setting that provides good results. In the third
case, the user does not even know what method to use and lets the
system decide by itself. In this case the system recommends the best
possible method or provides a ranking of the methods based on
predicted errors and duration. These simple scenarios can be extended
into more complex ones — e.g. it is also possible to combine the
recommendation of the best method the with parameter space search,
when the recommender chosen by the user recommends an interval
of the parameter’s values.</p>
      <p>As a positive side effect, the metaknowledge base for metalearning
purposes is being build up by each experiment.
3</p>
    </sec>
    <sec id="sec-4">
      <title>Role-based Architecture</title>
      <p>
        In order to effectively design our system, we have chosen the
organization-centered formalism AGR (Agent-Group-Role). The role
is a set of capabilities and responsibilities that the agent accepts by
handling the role. Group — the building block of a MAS — is a
set of agents with allowed roles and interactions, defined by a group
structure. The multi-agent system then consists of multiple groups
which can overlap by agents belonging to more than one group. In
this formalism, we abstract from the algorithmic details and inner
logic of the agents in the MAS. In our previous work [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], we have
Data-management
      </p>
      <p>Group</p>
      <sec id="sec-4-1">
        <title>DDAATTAA</title>
      </sec>
      <sec id="sec-4-2">
        <title>MMAANNAAGGEERR</title>
        <p>RREECCOOMMMMEENNDDEERR</p>
      </sec>
      <sec id="sec-4-3">
        <title>OOPPTTIIOONNSS</title>
      </sec>
      <sec id="sec-4-4">
        <title>MMAANNAAGGEERR</title>
      </sec>
      <sec id="sec-4-5">
        <title>SSEEAARRCCHH</title>
      </sec>
      <sec id="sec-4-6">
        <title>AAGGEENNTT</title>
        <p>used the ontological formalism of OWL-DL to describe the
organizational model.</p>
        <p>The following group structures were defined according to the
aforementioned scenarios: administrative group structure,
computational group structure, search group structure, recommendation
group structure, data group structure and data-management group
structure.</p>
        <p>Our MAS is composed of groups that are instances of these group
structures. The architecture is depicted in the Figure 1.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Metalearning</title>
      <p>The key parts of our system are those providing intelligent
metalearning behavior, i.e. agents that provide parameter space search methods
and recommender agents. These agents are intended to (at least
partially) replace a human expert. They make use of the previous
experience gathered by the system, which is captured in the
metaknowledge base. It contains results of machine learning experiments and
metadata — general features of the datasets.</p>
      <p>
        The MAS-based solution allows a flexibility in choice of the
parameter space search algorithms, each of these is encapsulated in a
search agent. General tabulation, random search, simulated
annealing [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], or parallel methods, such as evolutionary algorithms [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], are
implemented in our system. Another great benefit of the agent-based
approach is the natural capability of parallel execution of
computations with various parameters which significantly decreases the time
needed for the execution of the parameter space search process.
      </p>
      <p>
        One of essential features of our MAS is its capability of
recommending a suitable computational method for a new dataset,
according to datasets similarity and previously gathered experience. The
choice of the similar dataset(s) is based on various previously
proposed metrics [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], which measure the similarity of their metadata.
Our database contains over two million records, that are used to
suggest the proper method (including its parameters) and estimate its
performance on a new dataset.
      </p>
      <p>The latest version of our MAS contains the following types of
recommenders, which differ in the metric used and in the number of
recommended methods they provide:
Search
Group
5</p>
      <p>
        Conclusions
• Basic recommender chooses a method based on the single closest
dataset using the unweighted metadata metric.
• Clustering Based Classification [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] chooses the whole cluster of
similar datasets and the corresponding methods, using different
sets of metadata features.
• Evolutionary Optimized Recommenders are similar to the two
above described recommender types, using different weighted
metrics, optimized by an evolutionary algorithm.
• Interval Recommender recommends intervals of suitable
parameter values and leaves their fine-tuning to the parameter space
search methods.
      </p>
      <p>Another functionality of our system is a multi-objective
optimization of data mining configurations. The search algorithm is employed
in order to find beneficial combinations of pre-processings and
machine learning methods to the presented data. The minimization is
performed in error-rate as well as run-time criteria.</p>
      <p>In this paper, we presented a multi-agent system for metalearning
in data mining, which includes solving of the most important and
challenging metalearning tasks – the recommendation of a suitable
method for a new dataset, and the tuning of parameters of such
methods. We have proposed the systems architecture and proved its
usability by an implementation that is used by our research team on a
regular basis to conduct metalearning and data mining experiments.
The role-based multi-agent approach brings in many advantages into
a complex task of metalearning, the main benefit being its easy
extensibility. The multi-agent parallel nature of the system speeds up
the time consuming tasks significantly.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>Jakub Sˇm´ıd and Kla´ra Pesˇkova´ have been supported by the Charles
University Grant Agency project no. 610214, R. Neruda has been
supported by the Ministry of Education of the Czech Republic
project COST LD 13002.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[1] The Foundation for Intelligent Physical Agents (FIPA)</article-title>
          . http://www.fipa.org/.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Java</given-names>
            <surname>Agent</surname>
          </string-name>
          <article-title>DEvelopment framework</article-title>
          . http://jade.tilab.com/.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Pavel</given-names>
            <surname>Brazdil</surname>
          </string-name>
          , Christophe G.
          <string-name>
            <surname>Giraud-Carrier</surname>
            ,
            <given-names>Carlos</given-names>
          </string-name>
          <string-name>
            <surname>Soares</surname>
          </string-name>
          , and Ricardo Vilalta, Metalearning - Applications to Data Mining.,
          <string-name>
            <surname>Cognitive</surname>
            <given-names>Technologies</given-names>
          </string-name>
          , Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Oscar</given-names>
            <surname>Castillo</surname>
          </string-name>
          , Patricia Melin, Janusz Kacprzyk, and Witold Pedrycz,
          <source>Hybrid Intelligent Systems</source>
          , Springer,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Agoston</surname>
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Eiben</surname>
            and
            <given-names>J. E.</given-names>
          </string-name>
          <string-name>
            <surname>Smith</surname>
          </string-name>
          , Introduction to Evolutionary Computing, SpringerVerlag,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hall</surname>
          </string-name>
          et al.,
          <source>'The weka data mining software: An update.', SIGKDD Explorations</source>
          ,
          <volume>11</volume>
          , (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Ondˇrej</given-names>
            <surname>Kaz</surname>
          </string-name>
          <article-title>´ık and Roman Neruda, 'Ontological modeling of meta learning multi-agent systems in OWL-DL'</article-title>
          ,
          <source>IAENG International Journal of Computer Science</source>
          ,
          <volume>39</volume>
          (
          <issue>4</issue>
          ),
          <fpage>357</fpage>
          -
          <lpage>362</lpage>
          , (
          <year>Dec 2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Ondrej</given-names>
            <surname>Kaz</surname>
          </string-name>
          <article-title>´ık, Kla´ra Pesˇkova´, Jakub Sˇ m´ıd, and Roman Neruda, 'Clustering based classification in data mining method recommendation'</article-title>
          ,
          <source>in ICMLA '13: Proceedings of the 2013 12th International Conference on Machine Learning and Applications</source>
          , pp.
          <fpage>356</fpage>
          -
          <lpage>361</lpage>
          , Washington, DC, USA, (
          <year>2013</year>
          ). IEEE Computer Society.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kirkpatrick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Gelatt</surname>
          </string-name>
          , Jr., and
          <string-name>
            <given-names>M. P.</given-names>
            <surname>Vecchi</surname>
          </string-name>
          , '
          <article-title>Optimization by simulated annealing'</article-title>
          ,
          <source>Science</source>
          ,
          <volume>220</volume>
          ,
          <fpage>671</fpage>
          -
          <lpage>680</lpage>
          , (
          <year>1983</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>