<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Siamese Network applied to Authorship Verification</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Emir Araujo-Pino</string-name>
          <email>emiraraujoing@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Helena Gómez-Adorno</string-name>
          <email>helena.gomez@iimas.unam.mx</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gibran Fuentes-Pineda</string-name>
          <email>gibranfp@unam.mx</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas Universidad Nacional Autónoma de México</institution>
          ,
          <addr-line>Mexico City</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Posgrado en Ciencia e Ingeniería de la Computación Universidad Nacional Autónoma de México</institution>
          ,
          <addr-line>Mexico City</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <abstract>
        <p>In this paper, we present our approach to the Authorship task at PAN 2020. The task consists in comparing two documents and automatically determine if they are written by the same author. To solve this task, we introduce a Siamese network architecture that is trained on character n-grams of the document pairs to be compared. We experimented with different hyperparameters when training the model on a large and a small dataset. Our best model achieved an overall evaluation of 0.804, which is the average of AUC, c@1, f_05_u, and F1 scores.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        PAN 3 is a CLEF 4 Lab on uncovering plagiarism, authorship, and social software
misuse. PAN [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] has a series of shared tasks of text forensics and stylometry, this year’s
campaign focuses on authorship verification, celebrity profiling, profiling fake news
Spreaders on Twitter, and Style Change Detection. In this paper, we describe our
approach to the authorship verification task, which aims to identify if both texts belong to
the same author or not. From the machine learning perspective, this task can be seen as
a binary classification problem.
      </p>
      <p>
        Nowadays, there are many unknown authorship letters such as email Fraud [24],
suicide [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], and terrorism [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] for which it is necessary to verify the authorship.
Currently, to solve authorship verification problems there are different approaches such as
distance based [23], machine learning based [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], and impostors [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] which have shown
great results in previous PAN tasks. Besides, there are four different approaches to solve
these problems and can be separated by instance or profile based and intrinsic or
extrinsic based [20]. This year’s shared task consists of closed set problems, which we solved
by an instance-intrinsic based approach.
      </p>
      <p>
        As far as we know, there are few implementations of deep learning based methods
to solve this problem. Lopez-Velazco 2016 [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] introduced a siamese architecture [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
with character embeddings as input, to the authorship verification problem. Exploring
architectures and methods we finally train a deep learning siamese network on character
3-grams to solve the PAN 2020 authorship verification shared task.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Previous Work</title>
      <p>
        Authorship identification related problems still have challenges to overcome, there are
many methods to solve specific problems [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Last year, PAN 2019 shared the cross
domain authorship attribution task [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] which consisted of identifying the author of a
given document among a closed set of authors. Most of the submitted solutions of PAN
2019 were SVM variations and no even one participant used a deep learning based
approach. The main differences between a traditional machine learning and a deep
learning application from the development perspective are the hardware requirements, the
amount of data to handle, and the ability to handle raw data as input [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Conventional
machine learning techniques need extensive feature engineering to transform the raw
data in order to use it for training and testing a model.
      </p>
      <p>
        Deep learning algorithms were used in PAN 18 authorship attribution shared task [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ],
but as far as we know there is no much research on Siamese networks to solve these
problems. In [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], a Siamese architecture composed of a character embeddings input
layer, a convolution [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], and LSTM [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] networks was tested on an English books
corpus. This architecture achieved good performance on a Gutenberg [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] project corpus
specifically designed to test authorship verification methods. The main differences with
our architecture are the input which consists of n-grams, the way of extracting the
embedded vector from each input, and the compare method which originally uses
euclidean distance.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Datasets</title>
      <p>The PAN 2020 authorship verification organization provided two datasets: large and
small. The purpose of these two corpus was to train two models, one for each dataset,
to compare a hungry data method as a deep learning model with a symbolic machine
learning approach. As it is known, one of the main problems for training a neural
network is to get enough data to achieve good performance. So, we decided to train two
models with the aim to compare how much the amount of data affects the performance
of our method.</p>
      <p>Table 1 shows the results of a basic data analysis over both datasets. The Samples
column shows the number of documents pairs, the Different Texts column shows the
number of different documents in the corpus, and the last three columns show the
characters statistics of the documents. The small dataset corresponds to 19% of the size
Dataset
Large
Small</p>
      <p>
        Samples
of the large dataset. Most of the documents are around 21000 characters and there are
almost no duplicated documents. The details of the datasets are available in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Methodology</title>
      <p>In the machine learning literature, there is a wide variety of deep learning architectures,
as far as we know, conventionally most of these receive as input raw data. In other
words, deep learning models learn to extract relevant features from raw data to reach
good performance. Despite this, our neural architecture approach receives character
n-grams as inputs. So in order to train our network, the first step is to transform the
texts dataset into character n-grams (with n varying from 1 to 3) frequency vectors (the
dimensions of the vectors correspond to the frequency of each n-gram).
4.1</p>
      <sec id="sec-4-1">
        <title>Features Extraction</title>
        <p>In order to obtain the n-grams of the documents pairs, we use the Scikit Learn Python
module [19]. It extracts all the n-grams (with n varying from 1 to 3) within the
documents and produces a vector representation with the frequency of such n-grams. With
the purpose of optimizing time and memory resources and without compromising the
classifier performance we performed basic hyperparameters tuning during the n-gram
features extraction process. The hyperparameters we tune are the minimum document
frequency and the maximum document frequency. To archive this we train the same
Siamese architecture on different n-grams sets. Due to time restriction, we were not
able to perform a complete grid search over all hyperparameters. Our best model on the
training dataset receives as input n-grams (with n varying from 1 to 3) that appear at
least in the 1% of all documents (min document frequency) and with no restriction on
the maximum document frequency. Finally, `1 normalization was performed over the
n-grams vectors.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Network Architecture and Training</title>
        <p>
          A siamese network can be seen generally as a comparison network, its main
components are a set of identical subnetworks and a final layer to contrast the subnetworks
outputs [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. The main idea behind this architecture is to extract a set of features from
each input using a subnetwork and in the same way train another network with the aim
to compare the two outputs of the subnetworks.
        </p>
        <p>Figure 1 shows the network architecture. The architecture has only one extracting
subnetwork which receives each input n-gram vector separately from each document to
compare. The first layers of the extracting subnetwork are Batch normalization (BN),
Gaussian Noise (GN), and Dropout (DP). These layers were set at the input with the
aim to speed up the training and achieve a better validation score.</p>
        <p>
          Then, a residual network converts the input into a feature space of 512 dimensions.
The first dense layer inside the residual network acts as an adapter, it allows to connect
the input to the residual network. This residual network has a depth of 8 and all the
internal layers have a feature space of 512. Some differences from the ResNet [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] are
the connections inside the residual network which perform a subtraction instead of an
addition, and the use of dense layers instead of convolutional layers. Finally, the
classifier network (Dense network with binary output) takes the absolute difference of both
subnetworks outputs and performs a binary classification with a 3 layers dense network.
In order to speed up training, we also use BN between the dense layers of the
classifier network. We use the elu [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] activation, and Radam [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] optimization to train the
complete Siamese network.
        </p>
        <p>We trained two models, one with the small dataset and the other with the large
one. In order to speed up the n-gram extraction process, we only used the first 10000
(ten thousand) characters from each document. Both datasets were split on training and
validation with 70 and 30 percent of samples respectively. To train the network we used
the default value of the alpha learning rate which is 10 3 and the n-grams range was set
from 1 to 3. Table 2 shows the examined hyperparameters of the complete architecture.
For our final submission, we used the parameters that achieved the best classification
performance in terms of AUC on 30% of both large and small training datasets. The
final hyperparameters are shown in Table 2 in bold.</p>
        <p>Both models were trained on a 2070super GPU with 32bits configuration, 64GB
RAM. The training time was around 6 and 14 hours on the small and large datasets
respectively. We used the pipelines of TensorFlow5.
5 https://www.tensorflow.org
We trained a model for each dataset (large and small). Figure 2 shows the loss and
validation loss obtained by training. We set the same hyperparameters for training,
preprocessing, and architecture in both models. Figure 2a (Large dataset model) is smoother
due to the amount of data available for training the model. It can be observed that the
model trained on the large dataset converges faster.</p>
        <p>(a) Large dataset
(b) Small dataset</p>
        <p>Table 3 shows the validation scores of the models trained on the small and large
datasets. Both datasets were split in Training and Validation following a 70%-30%
proportion. In this way, we evaluated two times each model, first on the same dataset but
with different examples ( 30% validation set extracted from the same dataset), second,
with a different dataset (30% validation set comes from the other dataset). In this sense,
we obtain four validation scores from the two models and the two validation sets. It can
be observed that the AUC score is 1.0 when the Training and Validation sets come from
the same dataset. On the other hand, the small dataset model achieves only 0.823 AUC
on the validation set that comes from the large training set. We believe that the reason
for this behavior is that the same document can appear in both training and validation
sets because we only divided the document pairs samples.</p>
        <p>Finally, Table 4 shows the final results of the PAN 2020 authorship verification
task evaluated on the TIRA platform [21]. It can be observed, that the overall score
shows that the small dataset model achieves better results than the large dataset model,
even though the performance of both models are very similar, for example, the AUC
difference between both results is 0.015.
In this paper, we presented a siamese network approach to solve the PAN 2020
authorship verification problem. The network receives document pairs as character n-grams
(with n varying from 1 to 3) as input and learns to identify if these documents are
written by the same author. The network hyperparameters were adjusted on the two
datasets provided by the task organizers. The final configuration submitted to the
conference achieved the best performance during training time on both, large and small
datasets. Our experiments showed that both models achieve similar results when
training on different data and amount of examples.</p>
        <p>Even while using as few as 19% of the size of the large dataset, the model trained
on the small dataset achieved similar results on the validation set. On the other hand,
the AUC score reached 1.0 at validation if the training and validation sets come from
the same dataset. In the case of PAN 2020 test dataset, the small dataset model
outperformed the large dataset model but only by 0.015 of AUC score. The validation of
the small dataset model on the large validation set achieves a score near to the score
obtained on the PAN 2020 test dataset.</p>
        <p>We demonstrated that this architecture can achieve good results at the PAN 2020
verification task, even when trained on a small dataset and using only 10000 characters
per document. Besides, our approach does not implement score corrections and we only
tune our models with the AUC score as reference. As future research directions, we
plan to perform a better tuning of the described hyperparameters using a genetic-based
parameter tuning [22].</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Aknowledgements</title>
      <p>This work has been partially supported by PAPIIT-UNAM projects TA100520.
19. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M.,
Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D.,
Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: Machine learning in Python. Journal
of Machine Learning Research 12, 2825–2830 (2011)
20. Potha, N., Stamatatos, E.: Improving author verification based on topic modeling (2019)
21. Potthast, M., Gollub, T., Wiegmann, M., Stein, B.: TIRA Integrated Research Architecture.</p>
      <p>In: Ferro, N., Peters, C. (eds.) Information Retrieval Evaluation in a Changing World.</p>
      <p>Springer (Sep 2019)
22. Sanchez-Perez, M.A., Gelbukh, A., Sidorov, G., Gómez-Adorno, H.: Plagiarism detection
with genetic-based parameter tuning. International Journal of Pattern Recognition and
Artificial Intelligence 32(01), 1860006 (2018)
23. Teahan, W.J., Harper, D.J.: Using compression-based language models for text
categorization (2003)
24. Zaharia, A.: 300+ terrifying cybercrime and cybersecurity statistics &amp; trends [2020 edition]
(2020),
https://www.comparitech.com/vpn/cybersecurity-cyber-crime-statistics-facts-trends/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Organization</surname>
          </string-name>
          (
          <year>2020</year>
          ), https://pan.webis.de/
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Project gutenberg (
          <year>2020</year>
          ), www.
          <source>gutenberg.org. urldate = 2020-06-29</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Abbasi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
          </string-name>
          , H.:
          <article-title>Applying authorship analysis to extremist-group web forum messages</article-title>
          .
          <source>In: Homeland Security</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bromley</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guyon</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          , LeCun,
          <string-name>
            <given-names>Y.</given-names>
            ,
            <surname>Sickinger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Shah</surname>
          </string-name>
          , R.:
          <article-title>Signature verification using a "siamese" time delay neural network</article-title>
          . In: American Telephone and Telegraph Company papers (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Clevert</surname>
            ,
            <given-names>D.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Unterthiner</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hochreiter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Fast and accurate deep network learning exponential linear units (elus) (</article-title>
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>He</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ren</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sun</surname>
          </string-name>
          , J.:
          <article-title>Deep residual learning for image recognition (</article-title>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Hochreiter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidhuber</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>Long short-term memory (</article-title>
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Jasim-Basim</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Author attribution in suicide notes: Evidence from applied linguistics</article-title>
          .
          <source>In: Comparative Legilinguistics</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Juola</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Especial Problems of Linguistic Forensics (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kestemont</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manjavacas</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markov</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bevendorff</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiegmann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stamatatos</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potthast</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Overview of the Cross-Domain Authorship Verification Task at PAN 2020</article-title>
          . In: Cappellato,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Eickhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Névéol</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . (eds.)
          <article-title>CLEF 2020 Labs and Workshops, Notebook Papers</article-title>
          .
          <source>CEUR-WS.org (Sep</source>
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Kestemont</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stamatatos</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manjavacas</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Daelemans</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potthast</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Overview of the cross-domain authorship attribution task at pan</article-title>
          <year>2019</year>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Kestemont</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tschuggnall</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stamatatos</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Daelemans</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Specht</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potthast</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Overview of the Author Identification Task at PAN-2018 cross-domain authorship attribution and style change detection (</article-title>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Koppel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Winter</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Determining if two documents are written by the same author</article-title>
          .
          <source>In: Journal of the American society for information science and technology (</source>
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>LeCun</surname>
          </string-name>
          , Y.,
          <string-name>
            <surname>Bengio</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hinton</surname>
          </string-name>
          , G.:
          <article-title>Deep learning (</article-title>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>LeCun</surname>
          </string-name>
          , Y.,
          <string-name>
            <surname>Boser</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Denker</surname>
            ,
            <given-names>J.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Henderson</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Howard</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hubbard</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jackel</surname>
          </string-name>
          , L.D.:
          <article-title>Backpropagation applied to handwritten zip code recognition</article-title>
          .
          <source>In: Neural computation</source>
          (
          <year>1989</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jiang</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Pengcheng</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.C.</given-names>
            ,
            <surname>Xiaodong</surname>
          </string-name>
          <string-name>
            <surname>Liu</surname>
          </string-name>
          , J.G., Han,
          <string-name>
            <surname>J</surname>
          </string-name>
          .:
          <article-title>On the variance of the adaptive learning rate and beyond (</article-title>
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Lopez-Velasco</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Verificacíon de autoría en textos mediante redes neuronales convolucionales y recurrentes (</article-title>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Martín-Del-Campo-Rodríguez</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gómez-Adorno</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sidorov</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Batyrshin</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Cic-gil approach to cross-domain authorship attribution: Notebook for pan at clef 2018</article-title>
          . In: PAN working notes (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>