<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>More Data and New Tools. Advances in Parsing the Index Thomisticus Treebank</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Federica Gamba</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Passarotti</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paolo Rufolo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CIRCSE Research Centre - Università Cattolica del Sacro Cuore</institution>
          ,
          <addr-line>Largo A. Gemelli 1 - 20123 Milan -</addr-line>
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Istituto Universitario di Studi Superiori (IUSS)</institution>
          ,
          <addr-line>Palazzo del Broletto, Piazza della Vittoria 15, 27100 Pavia -</addr-line>
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <fpage>108</fpage>
      <lpage>122</lpage>
      <abstract>
        <p>This paper investigates the recent advances in parsing the Index Thomisticus Treebank, which encompasses Medieval Latin texts by Thomas Aquinas. The research focuses on two types of variables. On the one hand, it examines the impact that a larger dataset has on the results of parsing; on the other hand, performances of new parsers are analysed with respect to less recent tools. Term of comparison to determine the efective parsing advances are the results in parsing the Index Thomisticus Treebank described in a previous work. First, the best performing parser among those concerned in that study is tested on a larger dataset than the one originally used. Then, some parser combinations that were developed in the same study are evaluated as well, assessing that more training data result in more accurate performances. Finally, to examine the impact that newly available tools have on parsing results, we train, test, and evaluate two neural parsers chosen among those best performing in the CoNLL 2018 Shared Task. Our experiments reach the highest accuracy rates achieved so far in automatic syntactic parsing of the Index Thomisticus Treebank and of Latin overall.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;dependency parsing</kwd>
        <kwd>Latin</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        the overall accuracy rates of diferent parsers, as they tend to provide higher accuracy rates
on those texts that resemble the specific textual variety they were trained on (cf. [
        <xref ref-type="bibr" rid="ref23">22</xref>
        ]).
      </p>
      <p>
        This paper describes a study aimed to improve the performances of automatic dependency
parsing for the IT-TB in its native annotation scheme, taking as a benchmark the research
described by Ponti and Passarotti [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ], who, after testing diferent parsers, individuated DeSR
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] as the best performing one. After building a new feature model for DeSR specifically suited
for the IT-TB, Ponti and Passarotti [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ] applied a post-processing combination technique and
showed that combining parsers using diferent types of algorithms returned better parsing
results than plain DeSR.
      </p>
      <p>
        Recent years have seen many steps forward for what concerns both the size and the type of
the available linguistic resources for Latin, as well as the performances of probabilistic tools
for natural language processing (NLP) purposes. As for the IT-TB, the size of the treebank
has grown remarkably since the study of Ponti and Passarotti [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ], thus making it possible
to evaluate what impact a larger training set can have on the performances of probabilistic
tools in parsing the IT-TB. As for the NLP tools, across the very last years new techniques
and tools have been developed that exploit the ever growing amount of available training data,
thus making it possible to prove whether the most recent tools turn out to be more efficient
than less recent ones. This paper presents the results obtained by investigating the impact
that these two variables, namely a larger set of training data and new NLP tools, have on
parsing the IT-TB.
      </p>
      <p>The paper is organised as follows. Section 2 presents an overview of relevant related studies.
In Section 3 the data are presented. Section 4 focuses on the re-evaluation of DeSR
performances on the new (larger) training set of the IT-TB. Section 5 explores the impact of using
more training data on the accuracy rates of two combinations of parsers. Section 6 reports
the performances of two neural parsers (namely, TurkuNLP and ICS-PAS). In Section 7, we
present the results provided by three combinations of DeSR with the two neural parsers. In
Section 8, an in-depth evaluation of the results is performed and discussed. Finally, Section 9
concludes the paper.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        As mentioned, five treebanks are currently available for Latin. Beside the IT-TB, the other
Latin treebanks (all dependency-based) are the following: the PROIEL treebank [
        <xref ref-type="bibr" rid="ref16">15</xref>
        ], the Latin
Dependency Treebank (part of the Ancient Greek and Latin Treebank) [
        <xref ref-type="bibr" rid="ref4">2</xref>
        ], the Late Latin
Charter Treebank [
        <xref ref-type="bibr" rid="ref11">10</xref>
        ] and the UDante treebank [
        <xref ref-type="bibr" rid="ref10">9</xref>
        ]. All the Latin treebanks are annotated
both according to their native scheme and to the Universal Dependencies one (UD) [
        <xref ref-type="bibr" rid="ref20">19</xref>
        ], except
for the UDante treebank, which is available only in the UD scheme.
      </p>
      <p>
        With respect to parsing the IT-TB, the above-mentioned study by Ponti and Passarotti [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ],
which we here take as a benchmark, is preceded by other relevant works in the field. In 2010
Passarotti and Rufolo [
        <xref ref-type="bibr" rid="ref23">22</xref>
        ] trained and tested a number of probabilistic dependency parsers,
by using data from both the IT-TB and the Latin Dependency Treebank (LDT). In the same
year, Passarotti and Dell’Orletta [
        <xref ref-type="bibr" rid="ref22">21</xref>
        ] employed DeSR to parse the IT-TB. They delineated an
ad-hoc configuration of DeSR features so as to adapt the parser to the specific processing of
Medieval Latin and improve accuracy rates. They also defined a revision parsing method and
combined the outputs of diferent algorithms.
      </p>
      <p>
        However, the most recent study on parsing the IT-TB is the one carried out by Ponti and
Passarotti [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ]. In particular, for what concerns DeSR, the best results are achieved when
the tool exploits a multilayer perceptron (MLP) algorithm, a reversed direction of the parsing
transition (right-to-left) and a specifically-tuned settings: the best Labeled Attachment Score
reported (LAS) is 83.14 and the highest Unlabeled Attachment Score (UAS) is 88.46 [
        <xref ref-type="bibr" rid="ref9">7</xref>
        ].
Regarding the best performing combination of parsers, referred to as C4 in [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ], the best
results are 86.5 in LAS and 90.97 in UAS. The results obtained through combination already
represent an improvement with respect to [
        <xref ref-type="bibr" rid="ref22">21</xref>
        ], which were the state of the art in parsing
Medieval Latin before [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ].
      </p>
      <p>
        Thanks to the availability of the UD treebanks for Latin, the CoNLL Shared Task 2018 on
Multilingual Parsing from Raw Text to Universal Dependencies [
        <xref ref-type="bibr" rid="ref29">28</xref>
        ] provided results also for
Latin. The tool that proved to perform best on Latin is HIT-SCIR [
        <xref ref-type="bibr" rid="ref12">11</xref>
        ], which ranked highest
among all the participants, both in terms of LAS and UAS, for all the Latin treebanks. In
particular, it obtained a 87.08 LAS and a 89.31 UAS on the IT-TB, a 73.61 LAS and a 77.62
UAS on the PROIEL treebank, and a 72.63 LAS and a 80.47 UAS on the Latin Dependency
Treebank.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Data</title>
      <p>
        The data used in the experiments consist in the latest release of the IT-TB in its native
annotation style [
        <xref ref-type="bibr" rid="ref5">3</xref>
        ], which resembles that of the analytical layer of the Prague Dependency
Treebank for Czech.2 This version of the treebank features the entire Summa contra Gentiles
(four books) and some excerpts from Summa theologiae and Scriptum super Sententiis Petri
Lombardi selected as part of the concordances of lemma forma ‘form’. Such release of the IT-TB
makes available more data than the versions used as data sources for previous experiments in
dependency parsing for Latin. In particular, with respect to [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ] the missing part of the third
book and the entire fourth book of Summa contra Gentiles are now included in the dataset,
corresponding to 11,881 additional sentences and 193,422 additional tokens. For practical
reasons, we define T2 the enlarged dataset and T1 the dataset used in [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ].
      </p>
      <p>For evaluation purposes, the treebank is split in a training set and a test set with a ratio
of about 9:1. Table 1 illustrates the size of the T2 training and test sets resulting from such
partition. When required for the training phase, a development set with the same size of the
test set is excerpted from the training data.</p>
      <p>1The LLCT and the UDante treebanks were not used in the the CoNLL Shared Task 2018, since they
have been made available in the UD repository since release v2.6 (May 15th 2020) and v2.8 (May 15th 2021)
respectively.</p>
      <p>2https://ufal.mff.cuni.cz/pdt2.0/doc/manuals/en/a-layer/html/index.html.</p>
    </sec>
    <sec id="sec-4">
      <title>4. DeSR Evaluation</title>
      <p>
        After subsetting the dataset in training set and test set, we replicate the first part of the
experiments performed by Ponti and Passarotti [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ]. We evaluate the accuracy rates of the
dependency parser DeSR when trained on the new dataset, yet preserving the algorithms and
the feature model defined in [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ]. In this way, the only variable to be evaluated is the extended
size of the training and test sets, as all the others remain the same, thus allowing to assess the
impact of a larger amount of training data on the accuracy rates of the parser.
      </p>
      <p>
        DeSR [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is a shift-reduce parser which in its basic settings exploits an MLP algorithm and
performs a left-to-right transition while parsing. We make use of the same version of DeSR
used in [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ] (v. 1.4.3).
      </p>
      <p>
        First, the performance of DeSR with its basic settings is evaluated. Secondly, we reverse the
direction of transition from standard left-to-right to right-to-left, keeping the same MLP
algorithm. Thirdly, the MLP algorithm is replaced by a support vector machine (SVM) algorithm,
while the transition direction is maintained right-to-left, like in [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ].
      </p>
      <p>To sum up, three diferent settings of DeSR are trained and tested:
• MLP algorithm, left-to-right (MLP, l);
• MLP algorithm, right-to-left (MLP, r);
• SVM algorithm, right-to-left (SVM, r).</p>
    </sec>
    <sec id="sec-5">
      <title>5. DeSR Combination</title>
      <p>
        The following step in replicating the experiment of [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ] concerns the combination of diferent
parsers.
      </p>
      <p>
        After examining the outputs of single parsers, Ponti and Passarotti [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ] employed a
postprocessing technique that combines the outputs of diferent (types of) parsers. Such technique
exploits the benefits of combination, following the assumption that the mutual diference
between parsers promises to improve the final accuracy rates. For the purposes of combination,
an algorithm based on unweighted voting was used [
        <xref ref-type="bibr" rid="ref28">27</xref>
        ]. In our work, we replicate the
experiments run on the combinations named respectively C3 and C4 in [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ], chosen as those that
reach the highest accuracy rates among the ones concerned. Both C3 and C4 combine outputs
produced by diferent settings of DeSR with outputs from other types of parsers, namely:
• MTGB: a graph-based parser from the MATE-tools collection [
        <xref ref-type="bibr" rid="ref6">4</xref>
        ], in its latest version
(anna-3.61);
• Joint: a shift-reduce parser, part of the MATE-tools collection and developed by Bohnet
et al. [
        <xref ref-type="bibr" rid="ref7">5</xref>
        ].
      </p>
      <sec id="sec-5-1">
        <title>The structure of the combinations C3 and C4 is the following:</title>
        <p>• C3: DeSR (MLP, r) + DeSR (MLP, l) + Joint + MTGB;
• C4: DeSR (MLP, r) + DeSR (SVM, r) + DeSR (MLP, l) + Joint + MTGB.</p>
        <p>The results of the application of the post-processing combination technique on both T1 and
T2 are shown in Table 3. Both for C3 and for C4, the larger dataset proves to lead to higher
accuracy rates. In particular, the C4 combination results in the highest gap between T1 and
T2 both for LAS (T1: 86.50, T2: 87.37, improvement: +0.87) and UAS (T1: 90.97, T2:
91.56, improvement: +0.59). These results will serve as a baseline for the next combination
experiments.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. New Tools</title>
      <p>
        After examining the impact that a larger training/test set has on the accuracy rates of parsing
the IT-TB, we focus on the second variable taken into account in our work, namely training and
testing NLP tools of diferent (and more recent) type than those used by Ponti and Passarotti
[
        <xref ref-type="bibr" rid="ref24">23</xref>
        ]. To determine which parsers to consider, we refer to the CoNLL Shared Task 2018 on
Multilingual Parsing from Raw Text to Universal Dependencies [
        <xref ref-type="bibr" rid="ref29">28</xref>
        ]. Among the systems that
took part in the Shared Task, we select the two (both neural) parsers that ranked highest with
respect to Latin, and in particular to the IT-TB:
• TurkuNLP: end-to-end full neural parsing pipeline, developed by Kanerva et al. [
        <xref ref-type="bibr" rid="ref17">16</xref>
        ];
• ICS-PAS: a semi-supervised neural system developed in Warsaw by Rybak and Wróblewska
[
        <xref ref-type="bibr" rid="ref25">24</xref>
        ].
      </p>
      <p>
        The two neural parsers are run on the same dataset on which DeSR was trained and tested
in Sections 3 and 4, in order to evaluate the specific impact that neural parsing has on the
accuracy rates.
6.1. TurkuNLP
TurkuNLP [
        <xref ref-type="bibr" rid="ref17">16</xref>
        ] is a neural pipeline that performs four tasks: segmentation, morphological
tagging, parsing and lemmatisation.
      </p>
      <p>
        Lemmatisation is carried out thanks to a novel approach that exploits the OpenNMT neural
machine toolkit [
        <xref ref-type="bibr" rid="ref18">17</xref>
        ]. As for parsing, the tool is based on Stanford’s parser by Dozat, Qi, and
Manning [
        <xref ref-type="bibr" rid="ref14">13</xref>
        ], which ranked highest in the CoNLL Shared Task 2017 on Multilingual
Parsing from Raw Text to Universal Dependencies [
        <xref ref-type="bibr" rid="ref30">29</xref>
        ]. First, a word encoder embeds tokens by
summing together a set of learned token embeddings, pretrained token embeddings, and token
embeddings encoded from the sequence of its characters by using unidirectional LSTM. Then,
token embeddings are embedded with Part-of-Speech embeddings as well. Afterwards,
representations of tokens in context are created, building relations and attachments in dependency
trees. See [
        <xref ref-type="bibr" rid="ref13">12</xref>
        ] for further details.
      </p>
      <p>
        We begin by training a new model of TurkuNLP on the IT-TB extended dataset (T2). To
this end, we first employ the pre-trained word embeddings for Latin, published by Facebook
and developed with the fastText tool [
        <xref ref-type="bibr" rid="ref8">6</xref>
        ]. We both test the embeddings trained on Wikipedia3
[
        <xref ref-type="bibr" rid="ref8">6</xref>
        ] and their newer version [
        <xref ref-type="bibr" rid="ref15">14</xref>
        ], trained on Wikipedia and Common Crawl.4 Afterwards,
we develop another model on T2 by using our own embeddings for the Index Thomisticus
(IT), which we create with fastText (default settings) [
        <xref ref-type="bibr" rid="ref8">6</xref>
        ] from the opera omnia of Thomas
Aquinas provided by the IT corpus [8]. We build two kinds of IT embeddings: (1) token-based
embeddings (stored in a one-token-per-line format); (2) sentence-based embeddings (stored in
a one-sentence-per-line format).
      </p>
      <p>All the trained models are then evaluated with respect to the test set described in Section
3. Table 4 shows the results (LAS, UAS) obtained by TurkuNLP in comparison to the best
performing settings of DeSR and to the best performing combination pipeline (C4).</p>
      <p>
        The models that exploit the Facebook embeddings and the IT sentence-based embeddings
prove to perform best, with the IT sentence-based embeddings (LAS: 82.7, UAS: 85.9)
outperforming their token-based counterpart (LAS: 82.1, UAS: 85.5). However, as it clearly emerges
from Table 4, TurkuNLP proves to obtain significantly lower accuracy rates than both DeSR
(MLP, r and SVM, r) and C4, especially in terms of UAS.
ICS-PAS [
        <xref ref-type="bibr" rid="ref25">24</xref>
        ] is a neural system consisting of a jointly trained tagger, lemmatiser, and
dependency parser. A cross-entropy loss function predicts the output dependency tree. To avoid
3https://github.com/facebookresearch/fastText/blob/master/docs/pretrained-vectors.md.
4https://fasttext.cc/docs/en/crawl-vectors.html.
cycles in predictions, a ‘cycle-penalty’ loss function is used. During both phases of arc
prediction and label prediction, head and dependent are represented as vectors. See [
        <xref ref-type="bibr" rid="ref25">24</xref>
        ] for further
details.
      </p>
      <p>
        We evaluate ICS-PAS in the same manner as TurkuNLP. We thus begin by training a model
on the extended IT-TB dataset (T2), employing the pre-trained fastText word embeddings for
Latin [
        <xref ref-type="bibr" rid="ref8">6</xref>
        ] and their newer version [
        <xref ref-type="bibr" rid="ref15">14</xref>
        ]. Two further models are then trained on T2, by using
respectively the token- and sentence-based embeddings of the IT-TB described in Section 6.1.
      </p>
      <p>Table 5 shows the results (LAS, UAS) obtained by testing ICS-PAS with our trained models.
ICS-PAS accuracy rates are displayed together with the accuracy rates provided by the best
performing settings of DeSR and the best performing combination pipeline.</p>
      <p>As illustrated in Table 5, ICS-PAS and TurkuNLP obtain extremely similar accuracy rates,
with TurkuNLP slightly outperforming ICS-PAS by some tenths of percent. The best settings
of DeSR and the C4 combination still provide better parsing performances.</p>
    </sec>
    <sec id="sec-7">
      <title>7. A New Combination</title>
      <p>
        As mentioned in Section 5, mutual diference between parsers can represent a concrete way to
improve their performances. The two parsers we selected from the CoNLL Shared Task 2018
[
        <xref ref-type="bibr" rid="ref29">28</xref>
        ] difer substantially from DeSR, particularly in their choice to employ embeddings and
implement neural systems. Such a sizeable diference between the parsers raises a question
about the performances they could reach if combined together. To answer such question,
we evaluate three diferent combinations of DeSR together with TurkuNLP and ICS-PAS, by
applying the same algorithm for unweighted voting used in the experiment described in Section
5:
• CombA: DeSR (MLP, r) + DeSR (SVM, r) + DeSR (MLP, l) + ICS-PAS;
• CombB: DeSR (MLP, r) +
      </p>
      <p>NLP;</p>
      <sec id="sec-7-1">
        <title>DeSR (SVM, r) +</title>
      </sec>
      <sec id="sec-7-2">
        <title>DeSR (MLP, l) +</title>
      </sec>
      <sec id="sec-7-3">
        <title>Turku</title>
        <p>• CombC: DeSR (MLP, r) + DeSR (SVM, r) + DeSR (MLP, l) + ICS-PAS + TurkuNLP.</p>
        <p>As for TurkuNLP and ICS-PAS, we include in the combinations the outputs obtained with
the IT sentence-based embeddings, as they proved to achieve better performances than the
token-based ones.</p>
        <p>
          Table 6 reports the accuracy rates, in terms of LAS and UAS, provided by the three
combinations. Results obtained by C4, the best performing combination among the ones proposed
in [
          <xref ref-type="bibr" rid="ref24">23</xref>
          ], are displayed as well for comparison purposes.
        </p>
        <p>The results in Table 6 show how the combinations that include DeSR and, respectively,
ICS-PAS (CombA) and TurkuNLP (CombB) outperform the performances provided by the
two tools alone (see Subsections 6.1 and 6.2). In particular, CombA reaches 87.65 of LAS and
91.66 of UAS, while plain ICS-PAS exploiting IT sentence-based embeddings obtains 82.2 of
LAS and 85.7 of UAS. As for CombB, it obtains a LAS of 87.72 and a UAS of 91.72, while plain
TurkuNLP, using IT sentence-based embeddings, reaches a LAS of 82.7 and a UAS of 85.9.
The gap is very remarkable, being around +5 in terms of LAS and around +6 in terms of UAS.
Yet, a further, even more remarkable improvement is provided by CombC, that combines the
three diferent DeSR settings with both ICS-PAS and TurkuNLP. While CombA and CombB
achieve accuracy rates similar, although slightly higher, to the ones of C4, CombC outperforms
CombA and CombB by almost 2 points in LAS (89.44) and by more than 1 in UAS (92.85).
These results outperform of more than 2 points also those provided by the HIT-SCIR parser
in the 2018 CoNLL Shared Task, reported here in Section 2 (LAS: 87.08, UAS: 89.31).</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>8. In-depth Evaluation</title>
      <p>
        Given the remarkable improvement of the quality of parsing obtained by combining diferent
parsers, we examine the specific contribution that they provide in the combination. We present
here an in-depth evaluation of the results achieved on the T2 test set, by focusing on a number
of relevant dependency relations and examining the parser-specific performances on them. 5
5The in-depth evaluation is performed by using the MaltEval evaluation tool for dependency parsers [
        <xref ref-type="bibr" rid="ref19">18</xref>
        ],
available at http://www.maltparser.org/malteval.html.
Deprel
Adv
Atr
Atr_Co
Atv
AtvV
AuxC
AuxP
AuxZ
Coord
Obj
Obj_Co
Pnom
Pred
Pred_Co
Sb
Sb_Co
      </p>
      <p>As highlighted by the results reported in Table 7, TurkuNLP turns out to perform best on
most of the selected relations. Only in few cases (attributes: Atr; verbal attributes: Atv and
AtvV; main predicates: Pred; coordinated main predicates: Pred_Co), it is outperformed by
other parsers - mostly by ICS-PAS. Such remark does not match with what observed in
Subsection 6.1, where TurkuNLP did not obtain high results with respect to the other parsers. A
deeper analysis of its performances, though, shows the main reason of such potential
discrepancy. In fact, TurkuNLP fails to handle the terminal punctuation of sentences (dependency
relation: AuxK). While the parser assigns the correct relation to terminal punctuation, it
always fails to select the right head. Specifically, TurkuNLP scores a 0.00 LAS with respect
to terminal punctuation, whereas DeSR performs excellently (LAS between 98.6 and 100),
regardless of the adopted configuration. ICS-PAS behaves similarly to TurkuNLP, obtaining
a 0.8 LAS with respect to terminal punctuation.</p>
      <p>Not surprisingly, the main predicates of sentences (Pred) are the most easily recognised
relation (also when they appear in coordinated constructions: Pred_Co), as they concern nodes
that do not depend on another node, but on the root of the tree (represented by a technical
node assigned relation AuxS). Conversely, the treatment of coordinated constructions
represents an issue, as usual in dependency parsing. All parsers included in CombC provide quite
low accuracy rates for what concerns coordinated dependency relations, namely coordinated
attributes (Atr_Co), objects (Obj_Co) and subjects (Sb_Co). Another tricky relation is
represented by verbal attributes not participating in verb government (Atv and AtvV), which
prove to be difficult for all parsers.</p>
      <p>After the main predicates, attributes (Atr) are the second best-handled relation. The LAS for
adverbials (Adv), subjects (Sb) and objects (Obj) are still high and very similar to each other
(around 90.0 in CombC). Predicate nominals (Pnom), instead, seem to be more difficult to
recognise (CombC: 87.6) and their LAS shows remarkable diferences between parsers, ranging
from 78.3 (DeSR (MLP, l)) to 86.6 (TurkuNLP).</p>
      <p>With respect to subordinating conjunctions (AuxC), prepositions (AuxP) and coordinating
nodes (Coord), no parser obtains high results, although all words that are assigned these
relations belong to closed lexical classes, which should make them easier to spot and parse.
The syntactic ambiguity of some of these words could play a role in such trend. Consider,
for instance, the following: (a) cum, which can be both a subordinating conjunction (AuxC)
and a preposition with meaning ‘with’ (AuxP); (b) nec, which can syntactically behave like a
coordination meaning ‘and not’ (Coord) or an emphasising word meaning ‘not even’ (AuxZ);
(c) et, which can have the syntactic function of a coordination meaning ‘and’ (Coord) or of an
emphasising word meaning ‘also’ (AuxZ).</p>
      <p>The parser-based in-depth evaluation above shows the added value of combining diferent
tools, as a means to efficiently exploit the parser-specific contribution to achieve a substantial
improvement of the accuracy rates. To give an example of the specific contribution provided by
the single parsers to their combination, Figure 1 shows the dependency trees produced by five
parsers and by their combination for the following sentence taken from the IT-TB: Ergo licet
aliquid de forma subtrahere ‘Therefore, it is permitted to leave something out of the form’.6</p>
      <p>From the top to the bottom, Figure 1 lists the Gold Standard and the outputs predicted
respectively by th CombC combination (as the best performing one: see Table 6) and by the
following parsers: DeSR (MLP, l), DeSR (MLP, r), DeSR (SVM, r), ICS-PAS and TurkuNLP.</p>
      <p>In Figure 1, correct dependency relations and labels are displayed in green, while in red are
the incorrect ones. Given an ordered set of parsing outputs, the combined output is built by
selecting the value proposed by the majority of parsers. For instance, ICS-PAS erroneously
attaches the token licet to ergo instead of the root, and assigns the AuxC relation to their
dependency. On its turn, TurkuNLP correctly individuates the root of the tree as the head
node for licet, but fails in labelling the relation (assigning AuxC instead of Pred). However,
CombC succeeds in predicting both the arc and the label for the dependency in question, by
choosing the output proposed by the majority of the parsers (namely, DeSR in all its three
configurations). The same can be observed with respect to the full stop at the end of the
sentence. Even though TurkuNLP and ICS-PAS fail to attach it to the correct head (the root
of the tree), in the prediction made by CombC the terminal full stop is made dependent on the
correct head node and is assigned the right relation (AuxK), thanks to the correct prediction
of DeSR.</p>
      <p>Moreover, from Figure 1 we can observe how the diferent types of parsers here concerned
(two neural ones vs DeSR) tend to make the same (or similar) mistakes. For instance,
terminal punctuation is attached to the wrong head by both the neural parsers (ICS-PAS and
TurkuNLP), which also fail in considering subtrahere an ellipsis (ExD) and licet a
subordinating conjunction (AuxC). The same errors are not made by any of the three configurations of
DeSR, which correctly analyse both licet and the terminal punctuation mark.</p>
    </sec>
    <sec id="sec-9">
      <title>9. Conclusion and Future Work</title>
      <p>
        In this paper we presented various experiments on automatic dependency parsing of the Index
Thomisticus Treebank. We began by replicating some of the experiments described in [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ], in
order to evaluate how a larger dataset impacts parsing results. To this end, we first tested
diferent algorithms and settings of the parser DeSR (MLP left-to-right, MLP right-to-left,
SVM right-to-left). Then, we evaluated two combinations between the outputs of DeSR and
other parsers (C3, C4). Results show that the larger dataset improves the accuracy rates of
parsing with respect to those reported in [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ].
      </p>
      <p>We then trained and tested two recently available neural parsers, so as to assess how and
if such approach afects the accuracy rates. Although the two selected parsers (ICS-PAS and
TurkuNLP) had ranked highest in parsing the IT-TB at the CoNLL Shared Task 2018 on
Multilingual Parsing, in our experiment they provided lower accuracy rates than the most accurate
DeSR settings (MLP, right-to-left), and substantially lower rates than the C4 combination.</p>
      <p>Lastly, we applied a post-processing technique of combination to verify if and to what extent
combining together diferent types of parsers would result in higher accuracy results. The
combination that joins the outputs of three DeSR settings, ICS-PAS and TurkuNLP (CombC)
resulted in a substantial enhancement of parsing performances, in terms of both LAS (+2.07)
and UAS (+1.29).7</p>
      <p>In the near future, we plan to build and test a new set of sentence-/token-based embeddings
for the IT-TB by using specifically defined parameters, instead of the default ones. The
experiments on parsing the IT-TB described in this paper are just one piece of the much
larger picture of dependency parsing of the Latin language. Such picture features two main,
important variables.</p>
      <p>First, the high level of diversity of Latin texts, which are spread all over (what today is
called) Europe across a period of more than two millennia, heavily afects the diatopic and
diachronic portability of trained model of probabilistic NLP tools.</p>
      <p>Second, like most of the Latin treebanks, also the IT-TB is available both in its native
annotation style and in the UD one, which is nowadays a standard de facto in syntactic
(dependency) annotation.</p>
      <p>As for the former, although the results of the work presented in this paper are very promising
for the specific needs of the IT-TB project, they must be taken carefully when talking about
Latin parsing in general terms. Indeed, in the near future it will be necessary to test techniques
of domain-adaptation of the available trained models, in order to restrain the decrease of the
accuracy rates when models are applied to texts of a diferent era and/or genre than those of
their training set.</p>
      <p>
        As for the latter, there are several initiatives in support of parsing the UD treebanks, like
the UDPipe tool8 [
        <xref ref-type="bibr" rid="ref27">26</xref>
        ] and the various shared tasks on UD parsing at international conferences
like CoNLL and IWPT.9
      </p>
      <p>Finally, in the coming years, one edition of the EvaLatin evaluation campaign of the NLP
tools for Latin will include a task specifically devoted to syntactic dependency parsing. 10</p>
    </sec>
    <sec id="sec-10">
      <title>Acknowledgments</title>
      <p>This project has received funding from the European Research Council (ERC) under the
European Union’s Horizon 2020 research and innovation programme - Grant Agreement No 769994.
[8] R. Busa. “Index Thomisticus Sancti Thomae Aquinatis Operum Omnium Indices Et
Concordantiae in Quibus Verborum Omnium Et Singulorum Formae Et Lemmata Cum
Suis Frequentiis Et Contextibus Variis Modis Referuntur”. In: (1974).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>G. Attardi.</surname>
          </string-name>
          “
          <article-title>Experiments with a Multilanguage Non-Projective Dependency Parser”</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <source>In: Proceedings of the Tenth Conference on Computational Natural</source>
          Language
          <string-name>
            <surname>Learning (CoNLL-X)</surname>
          </string-name>
          .
          <year>2006</year>
          , pp.
          <fpage>166</fpage>
          -
          <lpage>170</lpage>
          .
          <article-title>7All models, datasets, outputs and scripts that either we used to perform the experiments described in this paper, or that result from them</article-title>
          , are openly available at https://github.com/CIRCSE/IT-TB_
          <article-title>Parsing</article-title>
          . 8https://ufal.mff.cuni.
          <article-title>cz/udpipe. 9See the webpage of the UD-related events</article-title>
          at https://universaldependencies.org/events.html.
          <article-title>The best performing system on the IT-TB at the CoNLL 2018 Shared Task (HIT-SCIR [11]) provided a LAS of 87</article-title>
          .08.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <article-title>The results of the competition are</article-title>
          available at http://universaldependencies.org/conll18/results-las.
          <article-title>html. 10Information on the first edition of EvaLatin, dedicated to lemmatisation and Part-of-Speech tagging</article-title>
          , can be found at https://circse.github.io/LT4HALA/EvaLatin.
          <article-title>An overview of the results of the evaluation campaign is provided by [25].</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Bamman</surname>
          </string-name>
          and
          <string-name>
            <surname>G. Crane.</surname>
          </string-name>
          “
          <article-title>The Latin Dependency Treebank in a Cultural Heritage Digital Library”</article-title>
          .
          <source>In: Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH</source>
          <year>2007</year>
          ).
          <year>2007</year>
          , pp.
          <fpage>33</fpage>
          -
          <lpage>40</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Bamman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Busa</surname>
          </string-name>
          , and
          <string-name>
            <surname>G. Crane.</surname>
          </string-name>
          “
          <article-title>The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin”</article-title>
          .
          <source>In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)</source>
          . Marrakech,
          <source>Morocco: European Language Resources Association (ELRA)</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Bohnet</surname>
          </string-name>
          .
          <article-title>“Very High Accuracy and Fast Dependency Parsing is not a Contradiction”</article-title>
          .
          <source>In: Proceedings of the 23rd International Conference on Computational Linguistics (Coling</source>
          <year>2010</year>
          ). Beijing, China,
          <year>2010</year>
          , pp.
          <fpage>89</fpage>
          -
          <lpage>97</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B.</given-names>
            <surname>Bohnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Nivre</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Boguslavsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Farkas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ginter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Hajič</surname>
          </string-name>
          . “
          <article-title>Joint Morphological and Syntactic Analysis for Richly Inflected Languages”</article-title>
          .
          <source>In: Transactions of the Association for Computational Linguistics</source>
          <volume>1</volume>
          (
          <year>2013</year>
          ), pp.
          <fpage>415</fpage>
          -
          <lpage>428</lpage>
          . doi:
          <volume>10</volume>
          .1162/tacl\_a\ _
          <volume>00238</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>P.</given-names>
            <surname>Bojanowski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Grave</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joulin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Mikolov</surname>
          </string-name>
          . “
          <article-title>Enriching Word Vectors with Subword Information”</article-title>
          .
          <source>In: Transactions of the Association for Computational Linguistics</source>
          <volume>5</volume>
          (
          <year>2017</year>
          ), pp.
          <fpage>135</fpage>
          -
          <lpage>146</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Buchholz</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Marsi</surname>
          </string-name>
          . “
          <string-name>
            <surname>CoNLL-X Shared</surname>
          </string-name>
          <article-title>Task on Multilingual Dependency Parsing”</article-title>
          .
          <source>In: Proceedings of the Tenth Conference on Computational Natural</source>
          Language
          <string-name>
            <surname>Learning (CoNLL-X)</surname>
          </string-name>
          . New York City,
          <year>2006</year>
          , pp.
          <fpage>149</fpage>
          -
          <lpage>164</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Cecchini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sprugnoli</surname>
          </string-name>
          , G. Moretti, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          . “
          <article-title>UDante: First Steps Towards the Universal Dependencies Treebank of Dante's Latin Works”</article-title>
          .
          <source>In: Seventh Italian Conference on Computational Linguistics. CEUR Workshop Proceedings</source>
          .
          <year>2020</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Cecchini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Korkiakangas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          .
          <article-title>“A New Latin Treebank for Universal Dependencies: Charters between Ancient Latin and Romance Languages”</article-title>
          .
          <source>In: Proceedings of the 12th Language Resources and Evaluation Conference. Marseille, France: European Language Resources Association</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>W.</given-names>
            <surname>Che</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Zheng</surname>
          </string-name>
          , and T. Liu. “
          <article-title>Towards Better UD Parsing: Deep Contextualized Word Embeddings, Ensemble, and Treebank Concatenation”</article-title>
          .
          <source>In: Proceedings of the CoNLL</source>
          <year>2018</year>
          <article-title>Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies</article-title>
          . Brussels, Belgium,
          <year>2018</year>
          , pp.
          <fpage>55</fpage>
          -
          <lpage>64</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>T.</given-names>
            <surname>Dozat</surname>
          </string-name>
          and
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Manning</surname>
          </string-name>
          . “
          <article-title>Deep Biaffine Attention for Neural Dependency Parsing”</article-title>
          .
          <source>In: arXiv preprint arXiv:1611.01734</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>T.</given-names>
            <surname>Dozat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Qi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Manning</surname>
          </string-name>
          . “
          <article-title>Stanford's Graph-based Neural Dependency Parser at the CoNLL 2017 Shared Task”</article-title>
          .
          <source>In: Proceedings of the CoNLL</source>
          <year>2017</year>
          <article-title>Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies</article-title>
          . Vancouver, Canada,
          <year>2017</year>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>K17</fpage>
          -3002.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>E.</given-names>
            <surname>Grave</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bojanowski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joulin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Mikolov</surname>
          </string-name>
          . “
          <article-title>Learning Word Vectors for 157 Languages”</article-title>
          .
          <source>In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC</source>
          <year>2018</year>
          ). Miyazaki,
          <source>Japan: European Language Resources Association (ELRA)</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>D. T.</given-names>
            <surname>Haug</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Jøhndal</surname>
          </string-name>
          . “
          <article-title>Creating a Parallel Treebank of the Old Indo-European Bible Translations”</article-title>
          .
          <source>In: Proceedings of the Second Workshop on Language Technology for Cultural Heritage Data (LaTeCH</source>
          <year>2008</year>
          ).
          <year>2008</year>
          , pp.
          <fpage>27</fpage>
          -
          <lpage>34</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>J.</given-names>
            <surname>Kanerva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ginter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Miekka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Leino</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Salakoski</surname>
          </string-name>
          . “
          <article-title>Turku Neural Parser Pipeline: An End-to-End System for the CoNLL 2018 Shared Task”</article-title>
          .
          <source>In: Proceedings of the CoNLL</source>
          <year>2018</year>
          <article-title>Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies</article-title>
          . Brussels, Belgium,
          <year>2018</year>
          , pp.
          <fpage>133</fpage>
          -
          <lpage>142</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>G.</given-names>
            <surname>Klein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Deng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Senellart</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Rush</surname>
          </string-name>
          . “OpenNMT:
          <article-title>Open-Source Toolkit for Neural Machine Translation”</article-title>
          .
          <source>In: Proceedings of ACL</source>
          <year>2017</year>
          ,
          <string-name>
            <given-names>System</given-names>
            <surname>Demonstrations</surname>
          </string-name>
          . Vancouver, Canada: Association for Computational Linguistics,
          <year>2017</year>
          , pp.
          <fpage>67</fpage>
          -
          <lpage>72</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>J.</given-names>
            <surname>Nilsson</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Nivre</surname>
          </string-name>
          . “
          <article-title>MaltEval: an Evaluation and Visualization Tool for Dependency Parsing”</article-title>
          .
          <source>In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)</source>
          . Marrakech,
          <source>Morocco: European Language Resources Association (ELRA)</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Nivre</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-C. de Marnefe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Ginter</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Hajič</surname>
            ,
            <given-names>C. D.</given-names>
          </string-name>
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>McDonald</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Petrov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Pyysalo</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Silveira</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Tsarfaty</surname>
            , and
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Zeman</surname>
          </string-name>
          . “
          <article-title>Universal Dependencies v1: A Multilingual Treebank Collection”</article-title>
          .
          <source>In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)</source>
          . Portorož,
          <source>Slovenia: European Language Resources Association (ELRA)</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>1659</fpage>
          -
          <lpage>1666</lpage>
          . url: https://universaldependencies.org.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          . “
          <article-title>The Project of the Index Thomisticus Treebank”</article-title>
          .
          <source>In: Digital Classical Philology</source>
          <volume>10</volume>
          (
          <year>2019</year>
          ), pp.
          <fpage>299</fpage>
          -
          <lpage>320</lpage>
          . doi:
          <volume>10</volume>
          .1515/
          <fpage>9783110599572</fpage>
          -
          <lpage>017</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Dell</surname>
          </string-name>
          <article-title>'Orletta. “Improvements in Parsing the Index Thomisticus Treebank. Revision, Combination and a Feature Model for Medieval Latin”</article-title>
          .
          <source>In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)</source>
          . Valletta,
          <source>Malta: European Language Resources Association (ELRA)</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Rufolo</surname>
          </string-name>
          . “
          <article-title>Parsing the Index Thomisticus Treebank. Some Preliminary Results”</article-title>
          .
          <source>In: 15th International Colloquium on Latin Linguistics. Innsbrucker Beiträge zur Sprachwissenschaft</source>
          .
          <year>2010</year>
          , pp.
          <fpage>714</fpage>
          -
          <lpage>725</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>E. M.</given-names>
            <surname>Ponti</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          . “
          <article-title>Diferentia compositionem facit. A Slower-paced and Reliable Parser for Latin”</article-title>
          .
          <source>In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)</source>
          .
          <year>2016</year>
          , pp.
          <fpage>683</fpage>
          -
          <lpage>688</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>P.</given-names>
            <surname>Rybak</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Wróblewska</surname>
          </string-name>
          . “
          <article-title>Semi-Supervised Neural System for Tagging, Parsing and Lematization”</article-title>
          .
          <source>In: Proceedings of the CoNLL</source>
          <year>2018</year>
          <article-title>Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies</article-title>
          . Brussels, Belgium,
          <year>2018</year>
          , pp.
          <fpage>45</fpage>
          -
          <lpage>54</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sprugnoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Cecchini</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Pellegrini</surname>
          </string-name>
          . “
          <article-title>Overview of the EvaLatin 2020 Evaluation Campaign”</article-title>
          .
          <source>In: Proceedings of LT4HALA 2020 - 1st Workshop on Language Technologies for Historical and Ancient Languages. Marseille, France: European Language Resources Association (ELRA)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>105</fpage>
          -
          <lpage>110</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>M.</given-names>
            <surname>Straka</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Straková</surname>
          </string-name>
          . “Tokenizing, POS Tagging,
          <article-title>Lemmatizing and Parsing UD 2.0 with UDPipe”</article-title>
          .
          <source>In: Proceedings of the CoNLL</source>
          <year>2017</year>
          <article-title>Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies</article-title>
          . Vancouver, Canada,
          <year>2017</year>
          , pp.
          <fpage>88</fpage>
          -
          <lpage>99</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>M.</given-names>
            <surname>Surdeanu</surname>
          </string-name>
          and
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Manning</surname>
          </string-name>
          . “
          <article-title>Ensemble Models for Dependency Parsing: Cheap and Good?” In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics</article-title>
          . Los Angeles, California,
          <year>2010</year>
          , pp.
          <fpage>649</fpage>
          -
          <lpage>652</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zeman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hajič</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Popel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Straka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ginter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Nivre</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Petrov</surname>
          </string-name>
          . “
          <article-title>CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies”</article-title>
          .
          <source>In: Proceedings of the CoNLL</source>
          <year>2018</year>
          <article-title>Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies</article-title>
          . Brussels, Belgium,
          <year>2018</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          . doi:
          <volume>10</volume>
          . 18653/v1/
          <fpage>K18</fpage>
          -2001.
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zeman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Popel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Straka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hajic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Nivre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ginter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Luotolahti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pyysalo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Petrov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Tyers</surname>
          </string-name>
          , E. Badmaeva,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gokirmak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nedoluzhko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cinkova</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          <article-title>Hajic jr</article-title>
          .,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hlavacova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kettnerová</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Uresova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kanerva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ojala</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Missilä</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Manning</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schuster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Reddy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Taji</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Habash</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Leung</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-C. de Marnefe</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Sanguinetti</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Simi</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Kanayama</surname>
            , V. dePaiva,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Droganova</surname>
            ,
            <given-names>H.</given-names>
            Martínez Alonso, Ç. Çöltekin, U. Sulubacak, H.
          </string-name>
          <string-name>
            <surname>Uszkoreit</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Macketanz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Burchardt</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Harris</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Marheinecke</surname>
            , G. Rehm,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Kayadelen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Attia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Elkahky</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Pitler</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Lertpradit</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Mandl</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Kirchner</surname>
            ,
            <given-names>H. F.</given-names>
          </string-name>
          <string-name>
            <surname>Alcalde</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Strnadová</surname>
            , E. Banerjee,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Manurung</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Stella</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Shimada</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Kwak</surname>
            , G. Mendonca,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Lando</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Nitisaroj</surname>
            , and
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
          </string-name>
          . “
          <article-title>CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies”</article-title>
          .
          <source>In: Proceedings of the CoNLL</source>
          <year>2017</year>
          <article-title>Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies</article-title>
          . Vancouver, Canada,
          <year>2017</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>