<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Determining the Acceptability of Abstract Arguments with Graph Convolutional Networks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lars MALMQVIST</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tommy YUAN</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter NIGHTINGALE</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Suresh MANANDHAR</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>NAAMI</institution>
          ,
          <addr-line>Kathmandu</addr-line>
          ,
          <country country="NP">Nepal</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of York</institution>
          ,
          <addr-line>York</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <fpage>47</fpage>
      <lpage>56</lpage>
      <kwd-group>
        <kwd />
        <kwd>argumentation</kwd>
        <kwd>abstract argumentation</kwd>
        <kwd>solvers</kwd>
        <kwd>graph convolutional networks</kwd>
        <kwd>deep learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>volutional Networks (GCN) [6] that had mixed results depending on the dataset under
consideration.</p>
      <p>There are, however, major benefits to approximate approaches such as Graph
Neural Networks in this area that makes them worth considering in some cases. In
particular, they have greatly superior run-time performance and are more easily integrated into
heavily parallelised Cloud-based architectures, although this comes at the cost of large
memory consumption and GPU support. In this paper, we compare only solver accuracy
and runtime performance as memory consumption metrics are not available.</p>
      <p>This paper contributes to realising this goal by setting a new and improved state of
the art for the use of Deep Neural Networks to solve abstract argumentation problems.
This improvement applies both to the overall accuracy and to the balance between
positive and negative inference. Furthermore, this paper is the first to consider both
credulous and sceptical acceptability. Finally, this paper introduces a new randomised training
approach that may have broader applicability to improving the training of GCN
architectures.</p>
      <p>The rest of the paper will be structured as follows. First, the relevant background
material will be presented. Then we will detail the specific approach and architecture we
have used to approach the problem. We will, then, present our results and offer some
concluding remarks.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <sec id="sec-2-1">
        <title>2.1. Abstract Argumentation</title>
        <p>Extensions are subsets of argumentation frameworks that are collectively acceptable.
Extensions are evaluated based on semantics that define rules for what arguments can be
accepted together. Almost all semantics require the property of conflict-freeness.
Definition 2.3 (Conflict-Freeness) Given an argumentation framework F = (args; atts),
a given subset, S, of this argumentation framework, is said to be conflict-free iff there
does not exist (A; B) 2 atts with A, B 2 S:</p>
        <p>Dung’s original paper [7] defined four semantics, but for the current paper, only the
preferred semantic concerns us as that is the one used in the experiments.</p>
      </sec>
      <sec id="sec-2-2">
        <title>Definition 2.4 (Preferred Extension) An extension, pr args, is a preferred extension</title>
        <p>iff it has the property of conflict-freeness, all its arguments are acceptable with respect
to pr and the extension is maximal with respect to set inclusion.</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.2. Convolutional Graph Neural Networks</title>
        <p>Convolutional graph neural networks (CGNNs) draw on the popularity and success of
traditional CNN’s in particular in computer vision. There are, however, different ways
of defining the convolutional operation when it is applied to graphs, which gives rise to
different types of CGNNs. The most common approach bases itself on the digital signal
processing where convolution is seen effectively as a noise removal operation. The
difference between most variants in this approach including the seminal GCN architecture
by Kipf and Welling [8], ChebNet [9], and CaleyNet [10] consists mainly in how they
represent, approximate, and simplify the filter operations used in the convolution of the
graph to achieve computational improvements. The second main approach to CGNNs
stays closer to the conventional CNN definition by considering convolution based on a
node’s spatial relationships[11]. That means spatial based methods in some way
aggregates information from a node’s neighbourhood. This can be seen for instance in the
Message Passing Neural Network [12] that explicitly defines a framework for looking at
graph convolution as a message-passing process.</p>
        <p>There has been one previous paper that has applied GCN methods to Abstract
Argumentation [6]. In this paper, the authors conducted a feasibility study by applying Kipf’s
approach to a range of datasets but using random and real world argumentation
frameworks with limited success. Graph Neural Networks have also been used more
successfully in the related fields of Automated Theorem Proving [13] and the Graph Colouring
Problem [14].</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. GCN for Abstract Argumentation</title>
      <sec id="sec-3-1">
        <title>3.1. GCN Architecture</title>
        <p>The architecture used in this paper builds and extends on the seminal approach
introduced by Kipf and Welling [8], but extends it in a number of areas. In the original
formulation, the GCN consisted of an input layer, two hidden layers with RELU
nonlinearities inserted in between, and ending with an output layer. Node embeddings were
generated using a propagation rule following a first-order approximation of spectral graph
convolutions. We follow the same basic pattern, but add a number of features to allow
for greater depth and to tailor the approach to abstract argumentation graphs that do not
have node-level features.</p>
        <p>The core components of the GCN architecture used in this paper include the
following elements:
1. Graph Embeddings generated using DeepWalk [15], a random walk based
method, fed as input features along with the adjacency matrix of the
argumentation framework. For the experiments in this paper the dimension of the
embedding was 64
2. An input layer receiving these inputs
3. 4 to 6 repeating blocks of a GCN layer [8] and a Dropout layer [16]. For all
experiments the Dropout rate was set to 0.5 and the size of the GCN layers 128
4. Residual connections feeding the original features and the normalised adjacency
matrix as additional input at each block
5. A Sigmoid output layer generating a probability for the acceptability of each
argument in the framework
The model was trained using Adam [17] with Binary Cross-Entropy as the loss function.
The training regime is described in subsequent sections.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Randomised Training Batches</title>
        <p>Real-world abstract argumentation frameworks tend to have a skewed distribution
between acceptable and non-acceptable arguments both for credulous and sceptical
acceptance. In particular, there tends to be a large preponderance of non-acceptable arguments.
In the argumentation frameworks used for the experiments in this paper, the percentage
of non-acceptable arguments ranges from 69.5% to 99.95%. This affects GCN training
as the neural net will by default learn to predict a negative outcome even in cases where
it is incorrect. This problem was also noted by Kuhlmann and Thimm [6], who
generated balanced training data to attempt to address it, but this approach does not seem to
have generalised well. Furthermore, a traditional split into fixed training, validation, and
test sets for training does not work well for this problem. When using such an approach,
the GCN could not effectively generalise to unseen data as confirmed by preliminary
experiments.</p>
        <p>To overcome these limitations, we have devised a randomised training scheme that
generates random training batches at the start of each epoch. The overall training scheme
feeds multiple argumentation frameworks to the neural network as a single graph with
each argumentation framework represented by a single connected component. To
preserve the maximal amount of structural information for learning the entire graph is fed to
each network layer. The output layer of the neural network is a prediction of the
acceptability of the argument. The randomised training operates by generating a new mask of
outcomes to be predicted at the start of each new epoch, essentially asking the neural net
to fill in the blanks. This is done to encourage the network to learn to generalise based
on structural properties of the graphs.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Dynamic Balancing and Outlier Exclusion</title>
        <p>Two additional measures were taken to address the problems related to imbalanced
training data and poor generalisation performance. First, the training mask created to facilitate
the generalisation of the neural network was developed to have the option of dynamically
balancing the training mask to include equal amounts of acceptable and non-acceptable
arguments. This has the intention to avoid the skew caused by unbalanced training data,
but also has the unfortunate side effect of reducing the amount of data used for training.
This mode is therefore not used in all experiments described in the results section.</p>
        <p>A second optimisation was added to handle extremely skewed argumentation
frameworks. Some frameworks have no or almost no acceptable arguments and when included
tend to skew the training disproportionately. These frameworks have been excluded from
the training set using a z-score test with a threshold of 3.5.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Deep Residual Connections</title>
        <p>The original formulation of Graph Convolutional Networks suffers from major
performance degradation with an increase of depth beyond a certain limit. Kipf’s [8] original
GCN, for instance, used only 2-layers in the model. In practice, as the depth of the GCN
increases beyond this limit the model stops responding to training data and instead
converges to a fix-point. This problem is known as the suspended animation problem and
the limit as the suspended animation limit.</p>
        <p>Several approaches have been applied to overcome this limit and allow greater depth
in GCN architectures. Among the most fruitful approaches have been those that adapt the
notion of residual connections to the GCN context[18] by feeding in the graph structure
and node features across layers in a variety of ways.</p>
        <p>In this paper, we follow a similar approach by adapting the graph-raw residual
defined by Zhang and Meng[18]. They define the residual term as the multiplication of the
normalised adjacency matrix and the raw input features. This residual term is fed as input
to each layer in the model, which achieves the aim of extending the suspended animation
limit.</p>
        <p>The only difference in our approach is that the normalised adjacency matrix and raw
input features are fed to each layer separately rather than as a unit, largely for reasons
related to the implementation approach.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experiments</title>
      <sec id="sec-4-1">
        <title>4.1. Dataset and Experimental Setup</title>
        <p>The experiments were run against a dataset consisting of 900 argumentation frameworks
selected from the ICCMA 20172 Benchmark datasets. The selection includes frameworks
from benchmark sets A, B, and C including all 5 difficulty categories. Except for the
exclusion of a small number of very large argumentation frameworks that could not be
processed in the computational environment used for the experiment because of
memory limitations, no systematic exclusions were made. Input frameworks contained up to
15,605 arguments and 6,250,500 attacks with an average number of arguments of 1,595
and average number of attacks of 187,542. We divided the dataset into 9 folds to facilitate
the training process and the neural net was trained sequentially on each fold. Training
and validation sets were generated using a random mask as described above and 10% of
graphs were held out as a test set for final evaluation.</p>
        <p>The GCN model has been evaluated using two separate tasks: credulous
acceptability and sceptical acceptability under the preferred semantic. For an argument to be
credulously acceptable in the context of a given argumentation framework it must belong to
one of the extensions of that argumentation framework; to be sceptically acceptable, it
must belong to all. To be able to frame the problem for supervised learning, we generated
ground truth solutions using the Pyglaf argumentation solver [19] for all argumentation
frameworks in the dataset.</p>
        <p>The experiments were run on a single K80 GPU instance with 12GB ram. Each
model took between 4 and 12 hours to train. Separate models were trained for each
experiment listed in the results below. The following table details what elements were
used for each model.</p>
        <p>Model
4-Layers Modified GCN
5-Layers Modified GCN
6-Layers Modified GCN
Mod GCN with Balanced Data
Mod GCN with Fixed Batches</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Results</title>
      </sec>
      <sec id="sec-4-3">
        <title>4.2.1. Credulous Acceptability</title>
        <p>The results for Credulous Acceptability improve by 30 percentage points on the
previous baseline set by Kuhlmann and Thimm. While the best performing model in terms
of overall accuracy is the 4-Layer Modified GCN, the better performance on positive
acceptability coupled with the very slight decrease in overall accuracy means that the
5-Layer Modified GCN is the overall best performing model.</p>
        <p>Excepting the Modified GCN with Balanced Data, all the other models have a
disparity in favour of negative acceptability, which is consistent with the previous findings
by Kuhlmann and Thimm. The Modified GCN with Balanced Data inverts the pattern
with a similar disparity between positive and negative in favour of the positive side,
which demonstrates that there is nothing inherent about negative acceptability that makes
it easier to learn. If anything, the opposite would seem to be the case as the Modified
GCN with Balanced Data will see roughly equal amounts of positive and negative
training instances. The poorer overall performance of this model could be due to the lower
amounts of training instances seen because of dynamic balancing.</p>
        <p>The extremely poor performance when predicting positive acceptability of the
Modified GCN with Fixed Batches demonstrates the key contribution of the randomised
training regime in obtaining good results in this domain.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.2.2. Sceptical Acceptability</title>
        <p>The results for Sceptical Acceptability seem to reflect the much starker imbalance
between negative and positive instances in this setting. By the nature of the problem there
will tend to be much fewer positive instances in the sceptical setting. This is reflected in
all the models having a large discrepancy between positive and negative accuracy.</p>
        <p>Similar to the credulous results, the Modified GCN with Fixed Batches is the worst
performer. It is, however, even more extreme for the sceptical case, possibly for the
same reason as before. For all intents and purposes the model is incapable of correctly
predicting positive acceptability and only gets good overall accuracy from the extreme
skew of the underlying dataset.</p>
        <p>In contrast to the credulous setting, the Modified GCN with Balanced Data is the
clear overall winner under the sceptical setting. It has both the best overall accuracy
and is vastly superior in predicting positive acceptability with a relatively minor hit to
negative accuracy.</p>
      </sec>
      <sec id="sec-4-5">
        <title>4.3. Ablation Studies</title>
      </sec>
      <sec id="sec-4-6">
        <title>4.3.1. Effects of Depth</title>
        <p>For both credulous and sceptical acceptability, three separate models were trained
differing only in the number of layers they possess3. Based on these results the effect of
depth is inconclusive, but at best minor. Under the credulous setting the 5-Layer
Modified GCN outperforms the other two slightly. Under the sceptical setting the results are
not significantly different enough to assess which model is superior.</p>
        <p>The reasons why greater depth does not seem to lead to better performance are
unclear at this stage and will require further research. Four layers may already be adequate
to capture what can be learned from the training data. It may also be related to the well
known issues experienced by other GCN models that have tried to increase depth.
However, unlike some of these models, we are not seeing a decline in performance with the
addition of layers, but a more or less stationary set of results.</p>
      </sec>
      <sec id="sec-4-7">
        <title>4.3.2. Effects of Randomisation and Balancing</title>
        <p>The largest and most substantial difference in this study is between the models that
include the randomised training regime and those that do not, including both models from
Kuhlmann and Thimm and the Modified GCN with Fixed Batches for both the
credulous and the sceptical case. That points towards this training regime being the largest
contributor to increased model performance.</p>
        <p>The effects of balancing are more mixed, but also substantial. Kuhlmann and Thimm
found a small improvement from using a balanced dataset. In our model under the
credulous setting, the Model GCN with Balanced Data performed substantially worse than
the other models that also use the randomised training regime. However, for the sceptical
setting it was hands-down the best performing model. This may be due to a relationship
with the extent to which the underlying training data is skewed. One could speculate that
the more skewed the training data, the greater the value of dynamic balancing.</p>
        <p>3Note that the tables for the ablation studies do not contain additional results, but are constructed based on
the previous models.</p>
      </sec>
      <sec id="sec-4-8">
        <title>4.4. Runtime Performance</title>
        <p>Kuhlmann and Thimm [6] report an average runtime of 0.000007 seconds per argument
measured over their entire test set compared against a mean runtime for their reference
solver CoQuiAAS of 0.119561 seconds. Our runtime performance is consistent with this
achieving an average runtime of 0.000005 seconds per argument over the complete test
set.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>The results in this article improve substantially on the previous baseline and while the
accuracy achieved is perhaps not yet sufficient for most real-world applications, it does
in some cases come close enough for consideration. Problems, however, remain around
the disparity between positive and negative accuracy, although this in some cases is less
marked than others. This problem would need to be addressed prior to any real-world
deployment.</p>
      <p>The reasonable success of this approach seems to be largely down to the use of the
randomised training regime, which may have broader applicability beyond the domain
of abstract argumentation especially in addressing other NP-hard graph structured
problems. The effects of depth and balancing, while they do contribute in some models, are
more mixed.</p>
      <p>Finding ways to improve on this baseline may necessitate solving the question of
why a deeper model does not in this case lead to better results or alternately combining
this model with additional layers of a different type or a different kind of input features
to improve learnability.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Charwat</surname>
            <given-names>G</given-names>
          </string-name>
          , Dvorˇa´k W,
          <string-name>
            <surname>Gaggl</surname>
            <given-names>SA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wallner</surname>
            <given-names>JP</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Woltran</surname>
            <given-names>S</given-names>
          </string-name>
          .
          <article-title>Methods for solving reasoning problems in abstract argumentation - A survey</article-title>
          .
          <source>Artificial Intelligence</source>
          .
          <year>2015</year>
          ;
          <volume>220</volume>
          :
          <fpage>28</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Woltran</surname>
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Abstract Argumentation - All Problems</surname>
          </string-name>
          Solved ? In
          <source>: ECAI</source>
          <year>2014</year>
          ;
          <year>2014</year>
          . p.
          <fpage>1</fpage>
          -
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Cerutti</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dunne</surname>
            <given-names>PE</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giacomin</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vallati</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>Computing preferred extensions in abstract argumentation: A SAT-based approach</article-title>
          .
          <source>In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</source>
          . vol.
          <volume>8306</volume>
          LNAI;
          <year>2014</year>
          . p.
          <fpage>176</fpage>
          -
          <lpage>193</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Cerutti</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaggl</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thimm</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wallner</surname>
            <given-names>J</given-names>
          </string-name>
          .
          <article-title>Foundations of implementations for formal argumentation</article-title>
          .
          <source>IfCoLog Journal of Logics and their Applications</source>
          .
          <year>2017</year>
          ;
          <volume>4</volume>
          (
          <issue>8</issue>
          ):
          <fpage>2623</fpage>
          -
          <lpage>2705</lpage>
          . Available from: http://argumentationcompetition.org.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Thimm</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>Stochastic local search algorithms for abstract argumentation under stable semantics</article-title>
          .
          <source>Frontiers in Artificial Intelligence and Applications</source>
          .
          <year>2018</year>
          ;
          <volume>305</volume>
          :
          <fpage>169</fpage>
          -
          <lpage>180</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Kuhlmann</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thimm</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>Using Graph Convolutional Networks for Approximate Reasoning with Abstract Argumentation Frameworks: A Feasibility Study</article-title>
          .
          <source>Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</source>
          .
          <year>2019</year>
          ;
          <volume>11940 LNAI</volume>
          :
          <fpage>24</fpage>
          -
          <lpage>37</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Dung</given-names>
            <surname>PM</surname>
          </string-name>
          .
          <article-title>On the acceptability of arguments and its fundamental role in nonmonotonic reasoning, logic programming and n-person games</article-title>
          .
          <source>Artificial Intelligence</source>
          .
          <year>1995</year>
          :
          <fpage>321</fpage>
          -
          <lpage>357</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Kipf</surname>
            <given-names>TN</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Welling</surname>
            <given-names>M</given-names>
          </string-name>
          .
          <article-title>Semi-supervised classification with graph convolutional networks</article-title>
          .
          <source>In: 5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings; 2019.</source>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Available</surname>
          </string-name>
          from: http://arxiv.org/abs/1609.02907.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Monti</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boscaini</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masci</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodola</surname>
          </string-name>
          ` E,
          <string-name>
            <surname>Svoboda</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bronstein</surname>
            <given-names>MM</given-names>
          </string-name>
          .
          <article-title>Geometric deep learning on graphs and manifolds using mixture model CNNs</article-title>
          .
          <source>In: Proceedings - 30th IEEE Conference on Computer Vision</source>
          and Pattern Recognition,
          <string-name>
            <surname>CVPR</surname>
          </string-name>
          <year>2017</year>
          . vol. 2017-Janua;
          <year>2017</year>
          . p.
          <fpage>5425</fpage>
          -
          <lpage>5434</lpage>
          . Available from: https://arxiv.org/pdf/1611.08402.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Levie</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Monti</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bresson</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bronstein</surname>
            <given-names>MM</given-names>
          </string-name>
          .
          <article-title>CayleyNets: Graph Convolutional Neural Networks with Complex Rational Spectral Filters</article-title>
          .
          <source>IEEE Transactions on Signal Processing</source>
          . 2019 may;
          <volume>67</volume>
          (
          <issue>1</issue>
          ):
          <fpage>97</fpage>
          -
          <lpage>109</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Available</surname>
          </string-name>
          from: http://arxiv.org/abs/1705.07664.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <source>IEEE Transactions on Neural Networks and Learning Systems</source>
          .
          <year>2020</year>
          :
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Riba</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fischer</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <article-title>Llado´ s J, Forne´s A. Learning Graph Distances with Message Passing Neural Networks</article-title>
          .
          <source>In: Proceedings - International Conference on Pattern Recognition</source>
          . vol. 2018-Augus. IEEE;
          <year>2018</year>
          . p.
          <fpage>2239</fpage>
          -
          <lpage>2244</lpage>
          . Available from: https://ieeexplore.ieee.org/document/8545310/.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Wang</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tang</surname>
            <given-names>Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deng</surname>
            <given-names>J</given-names>
          </string-name>
          .
          <article-title>Premise selection for theorem proving by deep graph embedding</article-title>
          .
          <source>In: Advances in Neural Information Processing Systems</source>
          . vol. 2017-Decem;
          <year>2017</year>
          . p.
          <fpage>2787</fpage>
          -
          <lpage>2797</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Lemos</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prates</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Avelar</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lamb</surname>
            <given-names>L</given-names>
          </string-name>
          .
          <article-title>Graph Colouring Meets Deep Learning: Effective Graph Neural Network Models for Combinatorial Problems</article-title>
          .
          <year>2019</year>
          . Available from: http://arxiv.org/abs/
          <year>1903</year>
          .04598.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Perozzi</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Al-Rfou</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skiena</surname>
            <given-names>S.</given-names>
          </string-name>
          <article-title>DeepWalk: Online learning of social representations</article-title>
          .
          <source>Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          .
          <year>2014</year>
          :
          <fpage>701</fpage>
          -
          <lpage>710</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Available</surname>
          </string-name>
          from: https://arxiv.org/pdf/1403.6652.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Srivastava</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hinton</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krizhevsky</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salakhutdinov</surname>
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Dropout</surname>
          </string-name>
          :
          <article-title>A Simple Way to Prevent Neural Networks from Overfitting</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          .
          <year>2014</year>
          ;
          <volume>15</volume>
          (
          <issue>56</issue>
          ):
          <fpage>1929</fpage>
          -
          <lpage>1958</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Available</surname>
          </string-name>
          from: http://jmlr.org/papers/v15/srivastava14a.html.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Kingma</surname>
            <given-names>DP</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ba</surname>
            <given-names>JL</given-names>
          </string-name>
          .
          <article-title>Adam: A method for stochastic optimization</article-title>
          .
          <source>In: ICLR</source>
          <year>2015</year>
          ;
          <year>2015</year>
          . p.
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Available</surname>
          </string-name>
          from: https://arxiv.org/pdf/1412.6980.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Zhang</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meng</surname>
            <given-names>L</given-names>
          </string-name>
          .
          <article-title>GResNet: Graph Residual Network for Reviving Deep GNNs from Suspended Animation</article-title>
          .
          <year>2019</year>
          ;(
          <year>2016</year>
          ):
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          . Available from: http://arxiv.org/abs/
          <year>1909</year>
          .05729.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Alviano</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>The pyglaf argumentation reasoner</article-title>
          . OpenAccess Series in Informatics.
          <year>2018</year>
          ;
          <volume>58</volume>
          :
          <fpage>2</fpage>
          -
          <lpage>5</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>