<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Category-based Inductive Learning in Shared NeMuS</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ana Carolina Melik Schramm</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Edjard de Souza Mota</string-name>
          <email>edjardg@icomp.ufam.edu.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jacob M. Howe</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Artur S. d'Avila Garcez</string-name>
          <email>a.garcezg@city.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>City, University of London</institution>
          ,
          <addr-line>London, EC1V 0HB</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Shared NeMuS Approach to Train Problem</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Universidade Federal do Amazonas, Instituto de Computaca~o, Campus Setor Norte Coroado - Manaus - AM - Brasil CEP: 69080-900</institution>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        One of the main objectives of cognitive science is to use abstraction to create
models that represent accurately the cognitive processes that constitute learning,
such as categorisation. Relational knowledge is important in this task, since
through the reasoning processes of induction and analogy over relations that the
mind "creates" categories (it later estabilishes causal relations between them by
using induction and abduction), and analogies exemplify crucial properties of
relational processing, like structure-consistent mapping[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        Given the complexity of the task, no model today has accomplished it
completely. The associacionist/connectionist approach represents those processes
through associations between di erent informations. That is done by using arti
cial neural networks. However, it faces a great obstacle: the idea (called
propositional xation) that neural networks could not represent relational knowledge. A
recent attempt to tackle the symbolic extraction from arti cial neural networks
was proposed in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
      </p>
      <p>
        The cognitive agent Amao uses a shared Neural Multi-Space (Shared NeMuS)
of coded rst-order expressions to model the various aspects of logical formulae
as separate spaces, with importance vectors of di erent sizes. Amao [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] uses
inverse uni cation as the generalization mechanism for learning from a set of
logically connected expressions of the Herbrand Base (HB). Here we present an
experiment to use such learning mechanism to model a simple version of train
set from Michalski's train problem[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
In Michalski's train problem, there are 10 trains: 5 eastbound and 5 westbound.
      </p>
      <p>Whether a train is going east or west is determined by its properties. Using these
trains, a simple base has been created, taking into account the size of the train
wagons (short or not) and whether these wagons are closed or not. The number
Copyright © 2017 for this paper by its authors. Copying permitted for private and academic purposes.
of wheels, wagon format and other attributes have been ignored in order to make
the base simpler.</p>
      <p>All the eastbound trains have at least one wagon which is both short and
closed. That is what determines whether a train is eastbound or westbound. The
idea is to use the shared NeMuS structure to induce the rule eastbound knowing
that t1 (the rst train) is going east. Having that information, we can directly
get all predicate instances, called as bindings, which have t1 is an attribute.</p>
      <p>They are the following:
train(t1).
car(t1, c1 t1). short(c1 t1).
car(t1, c2 t1). closed(c1 t1).
car(t1, c3 t1).
car(t1, c4 t1).</p>
      <p>The predicate car links t1 to all its wagons (or carriages), so car(t1, c1 t1)
means that c1 t1 is a wagon that belongs to t1. Taking the rst instance of the
predicate car, we now know that t1 has a wagon named c1 t1. Amao, through
its shared NeMuS, accesses c1 t1's bindings and using a polynomial search, nds
both occurrences of c1 t1 in short and closed, as seen above. This mechanism
is called linkage pattern in Amao's learning mechanism.</p>
      <p>At this point t1 is a train that has c1 t1 as a wagon, and this wagon is not
closed. Amao also has the linkage predicate connecting both c1 t1 and t1. Thus,
a candidate hypothesis generated would look like eastbound(X) car(X, Y)
^ short(Y) ^ closed(Y). However, this may not be the only possible
hypothesis, so the other wagons being carried by t1 need to be considered.</p>
      <p>short(c2 t1). short(c3 t1). short(c4 t1).
closed(c2 t1. closed(c3 t1). closed(c4 t1).</p>
      <p>Among the possible hypotheses that may de ne a train as being eastbound,
we have:
eastbound(X)
eastbound(X)
eastbound(X)
car(X, Y) ^ short(Y) ^ closed(Y).</p>
      <p>car(X, Y) ^ short(Y) ^ closed(Y).</p>
      <p>car(X, Y) ^ short(Y) ^ closed(Y).</p>
      <p>Adding negative examples, we can reduce the number of possible hypotheses.</p>
      <p>In this case, the simplest way to do that is to use the 10th train t10 as a negative
example. Using the same method as explained above, the structure can select
all predicates that have t10 as an attribute:
car(t10, c1 t10).</p>
      <p>car(t10, c2 t10).</p>
      <p>Then, all the predicates that have t10s wagons as attributes:
short(c1 t10).</p>
      <p>closed(c1 t10).
short(c2 t10).
closed(c2 t10).</p>
      <p>Thus, the hypotheses that de nitely do not de ne a train as being eastbound
are:</p>
      <p>Both hypotheses are among the possible options de ned above. Excluding them,
the correct option remains. The target eastbound(X) can be de ned by:
eastbound(X)</p>
      <p>car(X, Y) ^ short(Y) ^ closed(Y).</p>
      <p>Formalizing what was explained above:
1. With the positive example ( t1), get all predicates (bindings) that have t1</p>
      <p>as an attribute;
2. Access bindings of attributes linked to t1 using polynomial search (linkage
pattern)</p>
      <p>{ in this case, the attributes are c1 t1, c2 t1 and c3 t1
3. repeat the rst two steps for the negative example ( t10)]</p>
      <p>{ in this case, the attributes linked to t10 are c1 t10 and c2 t10
4. if there are hypotheses generated by using the positive example that are
repeated in the negative example, they will not be in the list of possible
hypotheses.</p>
      <p>{ some of the hypotheses generated by using only the positive example
are:
eastbound(X) car(X, Y) ^ short(Y) ^ closed(Y).</p>
      <p>eastbound(X) car(X, Y) ^ short(Y) ^ closed(Y).</p>
      <p>eastbound(X) car(X, Y) ^ short(Y) ^ closed(Y).</p>
      <p>However, using only the negative example, the rst and third hypotheses
would also be generated. By using both examples, these two don't make
it into the list of possible hypotheses, and the correct one, which is
eastbound(X) car(X, Y) ^ short(Y) ^ closed(Y), remains.
3</p>
    </sec>
    <sec id="sec-2">
      <title>Concluding Remarks</title>
      <p>The knowledge base created is only a simpli cation of the original train problem.
As explained before, many attributes such as number of wheels, wagon format,
load shape and roof shape have been ignored. Had they been included, more
hypotheses could have been generated through Amao's inductive learning
mechanism over the shared NeMuS. One current limitation is not being able to deal
with predicate invention, that would allow to automatically create categories by
means of abstraction/new predicates.</p>
      <p>One possible road to explore is to take advantage of shared NeMuS weights
to integrate a neural network classi cation method to help identify categories. In
the train set, we know which trains are eastbound, but whatever rule de nes the
eastbound category is not known before using Amao to de ne it. Understanding
what makes a train eastbound or not can help us categorize any train that might
be added to the set in the future.</p>
      <p>Another goal we aim to pursue is to make use of weights to implement neural
mechanisms. We expect to envisage more e cient heuristics to guide hypotheses
generation, improving Amao's learning mechanism.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Franca</surname>
            ,
            <given-names>M.V.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>D'Avila Garcez</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaverucha</surname>
          </string-name>
          , G.:
          <article-title>Relational knowledge extraction from neural networks</article-title>
          .
          <source>In: Proceedings of the 2015th International Conference on Cognitive Computation: Integrating Neural and Symbolic Approaches</source>
          - Volume
          <volume>1583</volume>
          . pp.
          <volume>146</volume>
          {
          <fpage>154</fpage>
          . COCO'15,
          <string-name>
            <surname>CEUR-WS</surname>
          </string-name>
          .org, Aachen, Germany, Germany (
          <year>2015</year>
          ), http://dl.acm.org/citation.cfm?id=
          <volume>2996831</volume>
          .
          <fpage>2996849</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Halford</surname>
            ,
            <given-names>G.S.</given-names>
          </string-name>
          , Wilson,
          <string-name>
            <given-names>W.H.</given-names>
            ,
            <surname>Phillips</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          :
          <article-title>Relational knowledge: the foundation of higher cognition 14,</article-title>
          <volume>597</volume>
          {
          <fpage>505</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Larson</surname>
            ,
            <given-names>J.B.</given-names>
          </string-name>
          , Michalski, R.S.:
          <source>Inductive Inference of VL Decision Rules</source>
          <volume>14</volume>
          ,
          <issue>16</issue>
          {
          <fpage>20</fpage>
          (
          <year>1977</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Mota</surname>
          </string-name>
          , E.d.S.,
          <string-name>
            <surname>Howe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          : In: Besold,
          <string-name>
            <given-names>T.</given-names>
            R.,
            <surname>d'Avila Garcez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Noble</surname>
          </string-name>
          , I. (eds.) To appear
          <source>NeSy 2017 Neural-Symbolic Learning and Reasoning (July)</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>