<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Extraction of Legal References from Court Decisions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dávid Varga</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Gojdič</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zoltán Szoplák</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Gurský</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Šimon Horvát</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stanislav Krajči</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ľubomír Antoni</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Computer Science, Faculty of Science, Pavol Jozef Šafárik University in Košice</institution>
          ,
          <addr-line>Jesenná 5, 040 01 Košice</addr-line>
          ,
          <country country="SK">Slovakia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The natural language processing of court decisions have become an interesting research issue worldwide. We present rulebased methods for automated extraction of legal references from Slovak court decisions. We focus on extracting references to laws and previous court decisions by using regular expressions and Levenshtein similarity measure. We created a dictionary of law names and their aliases, in order to be able to extract not only full names of laws from references, but also their generally used aliases. We annotated a set of court decisions for the evaluation of our proposed methods.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;extraction</kwd>
        <kwd>court decisions</kwd>
        <kwd>references</kwd>
        <kwd>legal acts</kwd>
        <kwd>hyperlinks</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>ITAT 2023 Information Technologies – Applications and Theory 2023,
S$epdteamvibde.vra2r2g–a2@6,s2t0u2d3e,nTta.utrpajsn.sskké(mD.aVtliaarrgea,)S;lovakia • extraction of references to exact cited laws and
martin.gojdic@student.upjs.sk (M. Gojdič); court decisions;
zoltan.szoplak@student.upjs.sk (Z. Szoplák); peter.gursky@upjs.sk • evaluation of results with a manually annotated
(P. Gurský); simon.horvat@upjs.sk (Š. Horvát); dataset.
stanislav.krajci@upjs.sk (S. Krajči); lubomir.antoni@upjs.sk
(Ľ. Antoni) This paper is organized into five sections, continuing
(M.0G00o0jd-0ič0)0;20-030107-60-080130-618(2D3.-V05a3r6ga(Z);.0S0z0o9p-0lá0k0)6;-02030701--02020025-4744-7390 from this point with Section 2, where we present state of
(P. Gurský); 0000-0002-3191-8469 (Š. Horvát); 0000-0001-5612-3534 the art of extracting references from legal texts. In
Sec(S. Krajči); 0000-0002-7526-8146 (Ľ. Antoni) tion 3, we describe all our rule-based methods, starting
© 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 with constructing our dictionary containing commonly
International (CC BY 4.0).</p>
      <p>CEUR Workshop Proceedings (CEUR-WS.org)
used abbreviations and aliases of laws and then describ- tent, or even making predictive tools. We also believe
ing the extraction methods. In Section 4, we evaluate our that extracting references from court decisions to other
methods on our golden annotated dataset, and we end legal documents will be more helpful than using them
this paper with a conclusion in Section 5. only as clickable hyperlinks in our future court decisions
browser.</p>
      <p>One of the most recent works [11] on legal reference
2. Related work extraction has been done on a German data set, where
the task was to identify references to law or other court
decisions within court decisions. They could not extract
specific documents that references pointed to; however,
this work can be helpful for us in the future. The authors’
highest-performing algorithm used BERT and had an
F1-score of around 98%. Our work presents rule-based
methods to extract specific references to legal acts and
other court decisions. Still, later in this work, we will
present our results where we have obtained a problem
with false positives. We see the positive results of
mentioned BERT-based model, and it is our inspiration to use
a large language model to identify the reference location
ifrst and lower the number of false positive examples.</p>
      <sec id="sec-1-1">
        <title>Extracting references from legal documents has already</title>
        <p>been studied in many works. Works stretch through
diferent legal systems, languages, and types of legal
documents. This chapter will focus on the most recent
works extracting references from legal documents.</p>
        <p>
          In 2015, a software framework called xLLx [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] was
created for court decisions from the Netherlands. This
software uses rule-based methods for extracting references
from court decisions, specifically a parsing expression
grammar (from now on PEG). They claim that a large
rule-based system has advantages over regular
expressions due to the better maintainability of PEG. In this
work, authors describe the recognition of aliases of full
names of laws within a court decision. These aliases are
called ’local aliases’ and are declared after a reference 3. Methods
with a full law name. In our work, we also focus on local
aliases and found them very helpful in identifying the This chapter will focus on our implemented algorithms,
law that the reference points to. which extract references to laws and other court
deci
        </p>
        <p>
          Authors in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] describe a NER tool, which uses enti- sions. However, we will first describe a dictionary where
ties from ontologies like LKIF [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] to describe legal con- keys are IDs of laws and values are abbreviations and
cepts. Using Word2Vec [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] as embeddings for English their short forms, which we call aliases. This dictionary
texts of court decisions from the European Court of Hu- is used to identify a specific law in a reference in the legal
man Rights and applying Support Vector Machine [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] acts extraction method.
classifier and simple neural network with one hidden
layer. They reached relatively high accuracy. However,
ifnal F-score measures were mostly below 50%, and the
discovered named entities did not reference the exact
articles of laws.
        </p>
        <p>Work [8] presents a newly created corpus of 350
annotated court decisions from the Czech Republic. The most
specific annotated type was the court decision type. This
means that annotated sub-types were the id of the court
decision, the name of the court, and the date on which
the decision was issued. Another annotated type was
a literature reference, precisely title, author, and other
possible information, such as the literature’s place, year,
and publisher. However, they intentionally omitted ref- Figure 1: Example of two keys and their multiple values from
erences to laws because they found them irrelevant to the dictionary of law IDs and aliases.
their broader inquiry.</p>
        <p>
          On the other hand, the president of Lexum company,
Ivan Mokanov, which is one of the founding members of
the Free Access to Law Movement [9] presented in his 3.1. Dictionary of laws and their aliases
web post [10] titled ’Good Old Hyperlinks,’ why
hyperlinks to specific law act and court decisions are still
relevant. Lexum provides all services for CanLII [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] website,
where citations are used for grouping similar documents,
creating citators, visualizing presentations of legal
conIn the Slovak Republic, there are currently more than 2000
laws that we have to consider and can be cited in court
decisions. The full names and their IDs are well-known
and obtainable from published laws. It is relatively easy
to identify the reference to a particular law if the law’s
full name or law ID is present in a text. We extracted the STEP 3: Law identification. First, we search for the
full names of laws and their IDs from HTML documents law ID between the alias declaration and the ’§’ sign. If
of laws from Slov-Lex [12] website, an oficial website of the law ID was not found, we tried to find one of the full
the Government of the Slovak Republic that publishes names of laws from our database in the mentioned text
laws online. of a reference.
        </p>
        <p>The problem in identifying law from a reference starts
when judges use aliases of law names in their decisions. We measure similarities between law names from
Until now, there has not been a database or a dictionary the database and the text of a reference with the
of frequently used aliases for law names, so we decided Levenshtein distance [13]. To obtain better results, we
to create our dictionary of law names and aliases used in lemmatized both full names of laws from our database
court decisions. Figure 1 shows an example of two laws, and the text from the reference because the Slovak
with law IDs of Civil court procedure and Criminal law. language uses diferent inflected forms of the exact</p>
        <p>
          The dictionary has been created by extracting local words. The lemmatization was done by a word form
aliases from Slovak court decisions similarly as described dictionary called Tvaroslovník [14], a database of word
in [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. In our dataset, local alias declarations were present forms with their lemmatized form.
after some references to laws. Typical reference to law Article numbers, sections, and letters, which specify
starts with the ’§’ sign followed by the article number the exact parts of laws, also created a problem with
matchand section number or letters. These local alias declara- ing the full names of laws. The solution was that only
tions are usually contained in parentheses where the text a chain of words from the reference that Tvaroslovnik
’ďalej len’ (meaning ’from now on’) and the text of the could lemmatize was used in the Levenshtein distance
alias are present. The critical observation is that there comparison.
is almost always a full name of the law before the local Then, the dictionary was manually cleaned from
unalias declaration, which we use to match extracted alias wanted results, e.g., when an abbreviation of some other
with the correct law ID. We show two examples of these term than a law name was extracted. After processing all
declarations of local aliases for two diferent laws in Fig- court decisions (cca 4 million) from our dataset, the final
ure 2, where full names of laws are highlighted in pink, dictionary contained 209 aliases that matched 111 laws.
local alias declarations are highlighted in lime color, and
aliases are written in bold format.
        </p>
        <sec id="sec-1-1-1">
          <title>3.2. Identifying law from a reference</title>
          <p>...Podľa §17 zákona zmenkového a šekového</p>
          <p>(ďalej len ZŠZ) ...</p>
          <p>STEP 2: Alias extraction. The alias was extracted
from text between ’(ďalej len’ and the closing parenthesis
’)’.</p>
        </sec>
      </sec>
      <sec id="sec-1-2">
        <title>As we described earlier, we created the dictionary for the</title>
        <p>method that identifies a specific law from a given text.
This method has a short text at the input, and the output
contains information about extracted law. We show the
example of the output extraction object in Figure 3, where
the judge used the alias ’O.s.p.’ for referencing Civil court
procedure.</p>
        <p>{
}
isAliasDeclaration=False
isAliasUsed=True
lawId=99/1963
lawName=’Občiansky súdny poriadok’
previousLawUsed=False
textInDecision=’O.s.p.’
start=13
end=22</p>
        <p>We describe the algorithm of identifying the law from
a single part of text following ’§’ sign in the next steps,
where from each step, if we extract a result, we compare it
to the previous best result. More specifically, we compare
the ofset (field start in the Figure 3) of the result within
a given text, and when it is closer to the ’§’ sign, i.e.,
the ofset is smaller, then we take that result as the best
current result. This set of steps runs iteratively for each
found ’§’ sign.</p>
        <p>STEP 1: Local alias. If the set of local aliases is not
empty (i.e. the set was filled with local aliases in previous
iterations), then the presence of some already extracted
local alias from the current court decision is verified. If
we do not find a local alias, we repeat this verification on
the lemmatized text.</p>
        <p>§ 172, § 234 section 1 and § 171b letter c) to e) Cr. proc.
name is used as the input for this algorithm, which is
a part before the starting ofset of the law name. This
single reference points to these laws:</p>
      </sec>
      <sec id="sec-1-3">
        <title>1. § 172 of the Criminal procedure;</title>
        <p>2. § 234 section 1 of the Criminal procedure;
3. § 171b letter c) of the Criminal procedure;
4. § 171b letter d) of the Criminal procedure;
5. § 171b letter e) of the Criminal procedure.</p>
        <p>STEP 2: Law ID and alias declaration. Law ID is
extracted by regular expression \d{1,3}/\d{4}. If law
ID is present, we check whether there is a local alias
declaration. If the local alias declaration is present, we To avoid repeating ourselves, after the first step of the
extract it in the same way described in the previous sub- algorithm, we will only show the parsing of details on
section 3.1 and insert it into a set of local aliases for the the last part of reference – ’§ 171b letter c) to e)’, and the
current decision. results from this part will be laws 3, 4, and 5. However,
the algorithm outputs all five laws from the reference.</p>
        <p>STEP 3: Dictionary of aliases. In this step, the
dictionary of laws and aliases is used to check for the exact
match of any alias from the dictionary. If we do not find
an alias from the dictionary, verification proceeds again
on the lemmatized text.</p>
        <p>STEP 4: Full name of the law. The full names of
laws are compared to lemmatized text as described in the
previous subsection 3.1.</p>
        <p>STEP 5: Previous law. This step deals with a
situation when we do not find matches in previous
steps, i.e., the text following the ’§’ sign. We found
out from our observations that most of these happen
because no law is specified after the ’§’ sign of a current
reference. Also, that judge cites the articles of law
mentioned in a previous reference. That is why we
maintain the most recently extracted law from the current
court decision in the previous context and assign this law.</p>
      </sec>
      <sec id="sec-1-4">
        <title>As seen in Figure 3 we extract the start and end ofset within the text of the current examined reference for future use in creating clickable links.</title>
        <sec id="sec-1-4-1">
          <title>3.3. Parsing details from law reference</title>
        </sec>
      </sec>
      <sec id="sec-1-5">
        <title>In this subsection, we describe parsing article numbers,</title>
        <p>section numbers, and letters from a given reference to
create a hyperlink to the referenced law. We have to count
with diferent abbreviations, multiple article numbers,
typos, ranges of letters, and section numbers.</p>
        <p>In Figure 4, we show a complex reference translated to
English where we highlighted the input for our parsing
algorithm in pink. As we can see, only a part before law</p>
      </sec>
      <sec id="sec-1-6">
        <title>STEP 1: Separate by §. In the first step, we separate the highlighted part by ’§’ signs.</title>
        <p>§ 172, § 234 section 1 and § 171b letter c) to e) Cr. proc.</p>
      </sec>
      <sec id="sec-1-7">
        <title>STEP 2: Article number. For each part of the refer</title>
        <p>ence, separated in the previous step, we extract the
article number and separate it from the rest of the part.
We achieve this separation using the following
regular expression (\d+[a-z]{0,2})((.|\s)*). The first
parentheses of the regular expression extract the article
number, which we highlighted in gray. In contrast, the
second parentheses extract the rest of the text, which we
highlighted in lime.
§ 172, § 234 section 1 and § 171b letter c) to e) Cr. proc.
STEP 3: Tokenizing article details. We tokenize
article details, which we highlighted in lime. We use a
regular expression for tokenization, which extracts only
alphanumeric parts of tokens.
§ 172, § 234 section 1 and § 171b letter c ) to e ) Cr. proc.
STEP 4: Parsing article tokens. We process and
categorize each token from the previous step. These
categories are:
• the letter or section word (lime);
• specific letter or number of the section (yellow);</p>
      </sec>
      <sec id="sec-1-8">
        <title>To identify whether one of the previously mentioned</title>
        <p>tokens is a letter word or a section word was done by
comparing Levenshtein distances to words ’písme’ (part
of word ’písmeno’, meaning letter) and ’odse’ (part of
word ’odsek’, meaning section).</p>
        <p>We discovered the use of ’písme’ and ’odse’ by
analyzing every word in extracted references longer than two
characters from a set of 320,000 court decisions. Judges
use mostly abbreviations ’pís.’, ’písm.’ or the full word
’písmeno’ to specify upcoming letters. Part of the word
’písme’ gives us the smallest average Levenshtein
distance to these abbreviations and complete words. Part of
the word ’odse’ was discovered analogically.</p>
        <sec id="sec-1-8-1">
          <title>3.4. References to other court decisions</title>
        </sec>
      </sec>
      <sec id="sec-1-9">
        <title>This subsection explains how we extracted references to</title>
        <p>other court decisions. We must note that we have focused
only on references to other Slovak court decisions. In the
future, we plan to extract references to judgments from
the European Court of Human Rights and the Court of
Justice of the European Union too.</p>
        <p>A reference to a court decision mainly consists of three
parts. In most cases, the judge specifies the name of the
court on which referenced court decision was made. After
that, the date of the procedure is stated, after which a file
number, or what we call a docket number, is specified.
We show an example of a reference to the court decision
that we translated to English in Figure 5.
. . . Regional court Bratislava from day 17.10.2011 d. num.</p>
        <p>We need to extract all three parts of the reference to
pinpoint the court decision the judge refers to because
docket numbers are not unique. A case can be processed
on the same court under the same docket number in more
than one court decision, and to distinguish them, we need
to extract the date as well. Also, diferent court cases can
be processed under the same docket number. That is why
we need to extract the name of the court as well. We
explain this extraction in the following steps.
STEP 1: Docket numbers. We extract docket
numbers with the use of a regular expression. Then, we take
90 characters long text before the docket number for
STEP 2: Court name. First, we try to find the full
name or abbreviation of special courts in the text, e.g.,
the supreme court. If the name of a special court is
absent, we proceed with identification, whether a county
or regional court is present. Then we search only for city
names with a court of the identified type. To measure
the similarity between texts and court names, we use the
ifnd_near_matches() method from the fuzzysearch [15]
library.</p>
        <p>STEP 3: Date. There can be many dates present in
court decisions, but in most cases, the dates between
the names of the courts and the dockets are the ones
that belong to the references. Therefore, we search for a
date primarily between the court name and the docket
number. The second most common way of stating the
reference date is not far after the docket number in the
text.</p>
      </sec>
      <sec id="sec-1-10">
        <title>From our observations, we have noticed that court</title>
        <p>names were always present before the docket number,
and therefore, we searched for them only in this part of
the text. We also have a collection of the most commonly
used abbreviations of court names, e.g., ’NSSR’ (short
for Najvyšší súd Slovenskej republiky) for the Slovak
Supreme Court.</p>
        <p>To extract the date, we currently use our date
extractor, which uses the regular expression
(\d{1,2}).(\d{1,2}).(\d{4}) supporting the
most widely used date format in court decisions. All
extracted parts contain the ofset of the court decision
text, which we use to create clickable hyperlinks in our
court decision browser.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>4. Evaluation</title>
      <sec id="sec-2-1">
        <title>We evaluated our methods on two datasets, each con</title>
        <p>taining 20 manually annotated court decisions. The first
dataset is annotated with references to laws, and the
second is annotated with references to other court decisions.
We should also mention that our datasets do not contain
ofsets of the references, and each court decision
contains a set of distinct references. In the future, we plan to
make a larger annotated dataset, even with ofsets and
all occurrences.</p>
        <sec id="sec-2-1-1">
          <title>4.1. Results for law references</title>
          <p>In the dataset for testing extraction of references to laws,
we created an attribute called annotated_laws, which
contained references to laws present in the court
decision. We used the same format as the format used in
the original dataset from the Ministry of Justice, which
contains the law’s ID, article number, section number,
and a letter.</p>
          <p>We show an example of annotated_laws attribute in
Figure 6 for one of the court decisions from our annotated
dataset.
annotated_laws: [
/SK/ZZ/1963/99/#paragraf-172.odsek-1
/SK/ZZ/1963/99/#paragraf-174.odsek-1
/SK/ZZ/1963/99/#paragraf-175
/SK/ZZ/2005/300/#paragraf-74.odsek-1.pismeno-b
references that we missed, which gives us an F1-score
of 92.05%. The reasons we did not manage to extract
these 24 references were very varied. For example, our
algorithm did not extract article number ranges from a
reference ’§ 243i to 243k’.</p>
          <p>The Slov-Lex dataset has been annotated with only 34
exact matches, 43 partial matches, and missed completely
203 references. However, there were only three false
positive annotations, which is better than our method.</p>
          <p>Table 2 presents results of extracting references to
other court decisions. Out of 34 annotated references, we
managed to extract 24 of them, giving us a sensitivity of
70.59%. This result is worse than the result of extracting
references to laws, but on the other hand, we got only
one false positive reference, which gives us a precision
of 96%.</p>
          <p>On the other hand, we obtained seven partial matches,
which make up 20.59% of annotated references. We
consider a partial match a match that does not contain one
of the three extracted attributes of references. In 5 cases,
we could not extract the date of the reference, and in the
rest, we did not extract the name of the court.</p>
          <p>We could not extract 3 out of 34 annotated references,
which is 8.8%. We could not extract these three references
because we did not manage to find a docket number in
the first place, and then we did not even search for a
court name and a date. Overall, for this task, we achieved
an F1-score of 81.36%.</p>
          <p>annotated references
exact matches
partial matches
false positives
not found</p>
          <p>We have also managed to extract six incomplete ref- 5. Conclusion
erences, where we have managed to identify the correct
law and article number. Still, sometimes we miss the This paper presented rule-based methods for extracting
extraction of a letter or a section. There was also one references to other court decisions and laws. We achieved
case where the annotated reference contained a sentence an F1-score of 92.05% for extracting references to laws
as a part of the article, which our algorithm cannot ex- and an F1-score of 81.36% for extracting references to
tract. These seven incomplete references make up 2.5% other court decisions. We believe there is room for
imof annotated references. provement, especially in lowering the number of false</p>
          <p>Another row of the 1 contains the number of false positives for extracting references to laws and extracting
positives, which added up to 12. Those are the references dates and court names for extracting references to other
we extracted but are not present in the court decision. court decisions.</p>
          <p>In most cases, we did not identify the correct law, and
they make up about 4.4% of all extracted references by
our algorithm, giving us a precision of 95.4%.</p>
          <p>The last row contains 24 references that we did not
manage to find, and they make up about 8.6% of actual</p>
          <p>One of the challenges that emerged during the
extraction of references to laws in court decisions is the
identiifcation of the correct version of the law. Slovak laws are
continuously modified and updated, and it is common
for a judge to refer to a version of the law that was not
up to date but was applicable when the act was com- [8] J. Harašta, J. Šavelka, F. Kasl, A. Kotková,
mitted. Therefore, incorporating domain-specific knowl- P. Loutocký, J. Míšek, D. Procházková, H.
Pulledge, such as legal systems or contextual understanding, mannová, P. Semenišín, T. Šejnová, N. Šimková,
could enhance the accuracy and comprehensiveness of M. Vosinek, L. Zavadilová, J. Zibner, Annotated
reference extraction in the Slovak legal domain. corpus of czech case law for reference recognition</p>
          <p>We consider this experiment to be a baseline for ex- tasks, in: P. Sojka, A. Horák, I. Kopeček, K. Pala
tracting references from Slovak court decisions, and we (Eds.), Text, Speech, and Dialogue, Springer
Interalso published a golden annotated dataset[16]. By ad- national Publishing, Cham, 2018, pp. 239–250.
dressing these challenges and pursuing future research [9] The Free Access to Law Movement (FALM), 2002.
directions, we aim to establish a solid foundation for au- URL: http://falm.info/.
tomated reference extraction in Slovak court decisions, [10] I. Mokanov, Good Old Hyperlinks, 2017. URL: https:
ultimately facilitating a system for searching and analyz- //www.slaw.ca/2017/10/20/good-old-hyperlinks/.
ing court decisions. [11] S. Peikert, C. Birle, J. Al Qundus, V. Le Duyen
Sandra, A. Paschke, Extracting references from german
legal texts ing named entity recognition (2022).</p>
          <p>Acknowledgments [12] Slov-Lex, 2022. URL: https://www.slov-lex.sk/
vyhladavanie-pravnych-predpisov.</p>
          <p>The Slovak Research and Development Agency supported [13] V. I. Levenshtein, et al., Binary codes capable of
this work under contract No. APVV-21-0336 Analysis correcting deletions, insertions, and reversals, in:
of court decisions by methods of artificial intelligence. Soviet physics doklady, volume 10, Soviet Union,
Pavol Jozef Šafárik University in Košice supported this 1966, pp. 707–710.
work with the internal project at vvgs-2023-2547 Legal [14] S. Krajči, R. Novotný, Tvaroslovník–databáza
text analysis using computer linguistics. This article was tvarov slov slovenského jazyka, in: Proceedings of
also supported by the Scientific Grant Agency of the Min- international conference ITAT 2012, SAIA, 2012, pp.
istry of Education, Science, Research and Sport of the 57–61.</p>
          <p>Slovak Republic under contract VEGA 1/0645/22 enti- [15] T. Einat, fuzzysearch, https://github.com/taleinat/
tled by Proposal of novel methods in the field of Formal fuzzysearch, 2022. GitHub repository.
concept analysis and their application. [16] M. Gojdič, D. Varga, Slovak court decisions,
annotated dataset, https://github.com/vargadavid304/
References slovak-court-decisions-dataset, 2023. GitHub
repository.</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[1] Rozvoj elektronických služieb súdnictva (RESS) (</article-title>
          <year>2016</year>
          ). URL: https://obcan.justice.sk/.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>The</given-names>
            <surname>Canadian Legal Information Institute (CanLII)</surname>
          </string-name>
          (
          <year>2001</year>
          ). URL: https://www.canlii.org/.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>M. van Opijnen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Verwer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Meijer</surname>
          </string-name>
          ,
          <article-title>Beyond the experiment: the extendable legal link extractor</article-title>
          ,
          <source>in: Workshop on Automated Detection, Extraction and Analysis of Semantic Information in Legal Texts, held in conjunction with the 2015 International Conference on Artificial Intelligence and Law (ICAIL)</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Cardellino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Teruel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Alemany</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Villata</surname>
          </string-name>
          ,
          <article-title>A low-cost, high-coverage legal named entity recognizer, classifier and linker</article-title>
          ,
          <source>in: Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>9</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hoekstra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Breuker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Di</given-names>
            <surname>Bello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Boer</surname>
          </string-name>
          , et al.,
          <article-title>The lkif core ontology of basic legal concepts</article-title>
          .
          <source>, LOAIT</source>
          <volume>321</volume>
          (
          <year>2007</year>
          )
          <fpage>43</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>T.</given-names>
            <surname>Mikolov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          , G. Corrado,
          <string-name>
            <given-names>J.</given-names>
            <surname>Dean</surname>
          </string-name>
          ,
          <article-title>Eficient estimation of word representations in vector space</article-title>
          ,
          <year>2013</year>
          . arXiv:
          <volume>1301</volume>
          .
          <fpage>3781</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C.</given-names>
            <surname>Cortes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vapnik</surname>
          </string-name>
          ,
          <article-title>Support-vector networks</article-title>
          ,
          <source>Machine learning 20</source>
          (
          <year>1995</year>
          )
          <fpage>273</fpage>
          -
          <lpage>297</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>