<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>CaseScope: An Enhanced Search Tool for European Court Cases</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alexandre Correia</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pedro Evangelista</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nádia Soares</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eugénio Rocha</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cláudio Teixeira</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center of Research and Development in Mathematics and Applications (CIDMA), University of Aveiro</institution>
          ,
          <addr-line>3810-193 Aveiro</addr-line>
          ,
          <country country="PT">Portugal</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Physics, University of Aveiro</institution>
          ,
          <addr-line>3810-193 Aveiro</addr-line>
          ,
          <country country="PT">Portugal</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Mindera</institution>
          ,
          <addr-line>Rua Goncalo Cristovão 347 s404, 4000-270 Porto</addr-line>
          ,
          <country country="PT">Portugal</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Natural Language Processing (NLP) is a rapidly growing field of research, enabled by advances in computer power and deep learning models. As a subfield of Artificial Intelligence, NLP can help with tasks such as Named Entity Recognition and Sentiment Analyses by extracting meaningful connections between words from a text. New architectures for neural networks like transformers have been responsible for a great increase in performance at these tasks. These improvements motivated this work, where we look into the extraction of information from legal documents in the CURIA database to develop CaseScope, a search tool that presents users with filters that are machine-generated for court cases from the European Union. Besides enhancing CaseScope's search space with NLP techniques, we also provide a faster way to understand if a case is relevant to the user's search by presenting generated summaries of documents produced with recent models. Main diferences between CaseScope and currently available legal search tools are also compared. Features described in this work were developed with a multidisciplinary team, with expertise in many fields, including legal. Throughout our work, we present how CaseScope is built, from data collection to the search interface, to give a better insight into our approach to each step of creating CaseScope.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Natural language processing</kwd>
        <kwd>artificial intelligence</kwd>
        <kwd>legal search assistance</kwd>
        <kwd>information extraction</kwd>
        <kwd>keywords extraction</kwd>
        <kwd>summarization</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>emerge.</p>
      <p>In the legal domain, NLP has applications like
docuNatural Language Processing (NLP) is a subfield of Arti- ment analysis, contract review, and legal research. These
ifcial Intelligence (AI) focused on developing algorithms applications can help lawyers and legal professionals
and models that enable computers to interpret and gene- extract insights and information from legal documents,
rate natural language in the form of text or speech. These identify patterns in legal language, and automate routine
models can be exploited in a wide range of applications, tasks such as contract drafting and review.
such as machine translation and sentiment analysis. With Attorneys often face the time-consuming task of
recent advancements in NLP, the legal domain is well- searching and reviewing past court decisions relevant
placed to take advantage of this progress, since the vast to their cases [4]. Existing search tools, though helpful,
majority of documents produced in court and by legisla- rely on extensive human efort for the annotation and
tors are written with specific wording, and the informa- classification of documents. Moreover, they cannot fully
tion that can be extracted depends heavily on context. reflect on what the user is looking for, being only able to</p>
      <p>A deep understanding of the context contained in docu- produce results based on information curated by humans.
ments can be the diference between a simple information This dependency on human curation creates a bottleneck,
system that can only see a sequence of words, and a requiring more manpower for handling larger volumes of
knowledgeable AI, that is capable of advising about a information. To address this, we explored the use of NLP
specific legal case. methods in CaseScope, utilizing the CURIA database,</p>
      <p>With the development of Google Transformers [1] which is a collection of legal cases deliberated by the
and, more recently, OpenAI’s GPT [2], many problems Court of Justice of the European Union (CJEU) [5]. By
that were deemed unsolvable with prior methods became extracting information directly from cases, we aim to
more approachable [3], opening space for new tools to enhance the search space, present more suitable results,
improve information display, and potentially alleviate
Proceedings of the Sixth Workshop on Automated Semantic Analysis of the workload of human annotators.
Information in Legal Text (ASAIL 2023), June 23, 2023, Braga, Portugal. With these developments in mind, the present work
fo$ alexandredc@ua.pt (A. Correia); cuses on the assessment and implementation of relevant
pneaddriao..seovaarnegs@elimstian@demrain.cdoemra(.cNo.mSo(Par.eEsv);aenuggeelinsitoa@); ua.pt (E. Rocha); techniques to create a search tool for legal documents
claudio.teixeira@mindera.com (C. Teixeira) from the CJEU, using state-of-the-art NLP techniques to
© 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License address complex problems when searching for
informaCPWrEooUrckReshdoinpgs IhStpN:/c1e6u1r3-w-0s.o7r3g ACttEribUutRion W4.0oInrtekrnsahtioonpal (PCCroBYce4.0e).dings (CEUR-WS.org)
tion in large collections of domain-specific documents.</p>
    </sec>
    <sec id="sec-2">
      <title>2. State of the art</title>
      <p>In the legal domain, NLP tasks that can be considered
simple in the open domain, usually become more challenging
[6]. Reinforced with the fact that legal experts are
necessary to produce annotated datasets, makes the legal
domain a very interesting and enticing field to apply
state-of-the-art NLP methods.</p>
      <p>NLP applied to the legal domain is a subject of study
since the beginning of computer science [4]. However,
due to the complexity of this domain, it has been
dificult to make successful applications. The introduction
of transformers has altered this dynamic [7]. Since then,
many new models were developed to perform well in
this specific domain, achieving state-of-the-art results.
Notably, LegalBERT stands out as a leading example of
such models, comparing results between adapting a BERT
model with further pre-training on domain-specific
corpora or pre-training the model from scratch [8].</p>
      <p>Existing tools like Casetext [9] and Fastcase [10]
already assist lawyers and researchers in their search tasks.
Although useful, these tools mainly compute the
similarity between documents and searched keywords in order
to achieve their result-retrieving capabilities, so they do
not yet make use of recent NLP developments as taking
advantage of Large Language Models (LLMs) like GPT
to enhance their searchable database.</p>
      <p>Despite the fact that the number of annotated datasets
is slim, the availability of large-scale corpus of legal data,
like legal cases and national laws, have been increasing.
Programming packages such as the one utilized in this
study to retrieve data from the CJEU [11, 12] can simplify
the process of accessing such datasets.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>To build an eficient and modern search tool, a variety
of factors including the quality of data, storage systems,
and the eficiency of the implementation must be
considered. To develop a robust and capable tool, it is necessary
to carefully consider each of these factors and make
informed decisions about how to approach them. In this
section, the focus is on the technical details of CaseScope,
including data collection, data storage, NLP application,
and the search tool itself. A diagram describing how these
components are integrated can be analyzed in Figure 1.</p>
      <p>The subsection 3.1 covers data collection. A
description of methods used to collect the necessary data, the
challenges faced during the data collection process and
how they were overcome is presented. In 3.2, we discuss
data storage, which systems were used to store the
collected data and explain the rationale behind the choices
made. The 3.3 and 3.4 subsections focus on the NLP
application and on the search tool itself. Algorithms used
to enhance the data are described and the user interface
of CaseScope and how it was designed to facilitate the
user’s interactions with the system is explained.</p>
      <sec id="sec-3-1">
        <title>3.1. Data collection</title>
        <p>As stated in the introduction, CaseScope has the objective
of enabling enhanced search capabilities on documents
published by the CJEU. This collection of documents is
also known as CURIA. To collect all available data we
opted to utilize a pre-existing package for the
programming language R capable of executing queries to a bigger
database called Eur-Lex [13], where CURIA documents
are also present.</p>
        <p>The eurlex R-package was created with this exact
objective in mind, to give a cleaner option of data retrieval
to political scientists and legal scholars working with
data from the EU. To achieve this, an interface was
created with methods written in R that enable researchers
to retrieve data from Eur-lex, without having to learn
how to compile SPARQL queries. Through this package
is possible to request certain metadata, execute pre-made
or any other manually input query, as well as download
XML files from court cases [ 11]. Utilizing this tool, we
were able to collect more document data than just text.
Information was retrieved in three distinct ways.
3.1.1. XML file
XML files have a tree-shaped representation of data,
making them ideal for organizing and storing metadata.
Additionally, having a well-established category hierarchy
enables automation in both storage and search processes.</p>
        <p>We utilized these characteristics to establish an
automated retrieval system for relevant data, which forms the
basis for our filtering system in searches. The retrieved
data includes identifiers, dates, document type, court and
jury information, subject matters, treaties, and the
document’s link. Additionally, we collected a set of concepts
discussed in each case. Both subject matters and concepts
provide a high-level representation of the document’s
topics. Subject matters ofer insight into the main subject,
while concepts are organized in a hierarchical structure,
associating each document with one or more concepts.
3.1.2. Keywords
The document’s title, retrieved using the R-package
described before, is a header that provides the name of the
two main parties involved, the identifier of the case, and
associated keywords. These keywords are what we were
looking for, since the first two could already be collected
from the XML file.</p>
        <p>Keywords found in this section are written without
hard rules. There is no pre-determined set of keywords
that can be applied to a document like there is for subject
matters and concepts. This means more specific context
can be given to each case, but it also means it is harder to
create connections between documents based on these
keywords.
is the most important section of the document, as it is
where the court analyzes the legal and factual issues and
applies the relevant law to the facts of the case. The
decision section set out the court’s final decision and any
specific orders or directions made by the court.</p>
        <p>Legal documents are often written with highly
technical and specialized language, which can make it very
dificult to comprehend and extract the correct meaning.
They can also be extremely lengthy and numerous, so
summarized legal documents and identification of
similar subjects in diferent texts are also useful. By utilizing
NLP techniques in these texts we hoped to ease that
hardship, helping lawyers, legal researchers, and other
professionals analyze legal documents more efectively.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Data storage</title>
        <p>Since we had two diferent types of data, structured
metadata, and unstructured textual data from documents,
different methods of storage were utilized for each type of
data. For information that had a clear structure, a
relational database was designed and developed to provide
organized and searchable storage for data, including case
details, enabling eficient filtering and retrieval, with
related tables and citation tracking. For textual data, like
the full text of each document, Elasticsearch [14] was
utilized as our indexing system, enabling scalable
storage and retrieval of large volumes of text with complex
queries, suitable for searching through the full text of
Curia documents.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. NLP application</title>
        <p>By incorporating NLP capabilities into a legal search tool,
3.1.3. Legal text users can easily and quickly find the information they
The text of each document was also retrieved. Text is need, saving them time and improving the accuracy of
the primary form of data that can be processed with NLP their research.
techniques. From legal texts, it is possible to extract infor- In CaseScope, we utilized NLP models to accomplish
mation such as parties involved, keywords, and citations. two tasks. The first was enhancing the keyword field for</p>
        <p>The structure of CURIA documents is generally con- each document. This means taking advantage of NLP
sistent and follows a standard format, which includes capabilities to understand word dependencies and
imporsections such as the introduction, legal context, argu- tance in a text, and from that retrieve new keywords that
ments of the parties, assessment of these arguments, and can be associated with the document. By doing this we
decision. This structure helps to ensure that all relevant were able to expand beyond the human-generated
keyinformation is included in the document and makes it words, making it easier to find documents that are related
easier for readers to understand the legal issues at stake. to the user search. The second task where we applied</p>
        <p>The introduction section of these documents provides NLP models was document summarization. These
autoan overview of the case and the issues at stake. The legal matically generated summaries of documents can save
context section enumerates all the relevant EU law and users a significant amount of time when reviewing large
any national law or international treaties that are relevant volumes of documents. By displaying them together with
to the case. This section is critical as it helps readers to each respective document in a list of search results in
understand the legal framework within which the case CaseScope’s interface, the user can better understand
was heard. The arguments section outlines the parties’ which documents they should review first.
arguments, which is important for understanding the To accomplish these two tasks, two diferent models
legal issues being deliberated. The assessment section were tested and their results were stored. Firstly we made
use of the T5 model [15]. This is a model released in 2019, Figure 3: Mockup design for CaseScope’s interface. This
posthat can run on a local machine, which made it a great sible design for a specific version directed only at fiscal search.
choice to begin with. Besides the T5 capabilities, the On the top is the free text search, followed by three dropdown
model is somewhat outdated in the NLP ever-evolving menus with main keywords, where the user can select from
ifeld. So to try to achieve the best results possible, we the options. Next, there are other keywords, which the user
then utilized models from the GPT family [16], the "gpt- can select and unselect as they see fit. In the bottom-left
3.5-turbo" and the "gpt-4" [2], through OpenAI’s API. The corner, there is a representation of the concept tree intended
ifrst is a more afordable, equally capable version of "gpt- to present the user with a complementary way of searching
3.5" and the latter is, at the moment, the most capable tchorronuegrhretphreesreensutlatsp. oTshseibbleludeisrpelcatyanogfltehseornesthueltsbolitstto.m-right
model from the GPT family.</p>
        <p>The results produced by these models were stored in
a JSON format as shown in Figure 2. They were then
placed in the relational database, as fields in the
"document" table, one for generated keywords, and another
for generated summaries.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Search tool</title>
        <p>One of the objectives of this work was the creation of a
competent legal search tool that can distance itself from
existing tools by taking advantage of the most recent
developments in the NLP field. Legal search tools can
be used in a variety of legal contexts. For example, legal
professionals may use legal search tools to find relevant
case law when preparing legal briefs or arguing cases
in court, and researchers may use legal search tools to
identify trends in legal decisions or explore the evolution
of legal concepts over time.</p>
        <p>We developed CaseScope with a "search first - filter
after" approach. In CaseScope, users can search for specific
concepts and then apply filters to the results interactively.
Figure 3 shows an early proposed interface, initially
designed for searching fiscal documents but scalable to
other domains. The interface features a search bar for
text queries and three dropdown menus for applying
filters. Users can apply multiple filters and see the number
of cases resulting from each filter. After each search or
iflter application, results are displayed with their
corresponding summaries and main keywords.</p>
        <p>This is the primary way of searching with CaseScope
but an advanced search interface can also be present to
let the user search freely with filters without having to
go through an initial search.</p>
        <p>CaseScope, through the use of its API, may also be
integrated with any interface, and even exploited as an
extension for existing search tools.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results discussion</title>
      <p>The use of a pre-existing package for the programming
language R allowed for the execution of queries to the
Eur-Lex database, enabling the collection of more data
than just text from the documents. In addition, this
package provided a cleaner option for data retrieval, as it was
not necessary to learn how to compile SPARQL queries.</p>
      <p>One of the challenges we faced while collecting data
was keyword retrieval from the documents’ titles with
often including irrelevant information such as the
decision on costs. Moving on to "gpt-3.5-turbo," we observed
improvements with more concise phrases and relevant
information. Although artifacts like the inclusion of the
original language of the case were still present, the model
showed promise. Lastly, we tested the "gpt-4" model,
which ofers the advantage of handling larger texts
without the need for chunking. The results are promising, as
demonstrated by the example of a summary generated
using this model:
text separation techniques. Figure 4 shows an example
of a document title. From that title, our objective was
to retrieve the following keywords: "References for a "The Court of Justice of the European Union ruled that
preliminary ruling", "Direct taxation", "Freedom of estab- Article 49 TFEU does not preclude national legislation
lishment", "Corporate income tax", "Measures to prevent restricting the ground for exclusion from the scope of
tax avoidance by shell companies", "Determination of tax- measures to prevent tax avoidance by shell companies
able income on the basis of presumed minimum income", to companies whose securities are traded on national
"Exclusion from the scope of those measures of compa- regulated markets. This exclusion does not apply to other
nies and entities listed on national regulated markets". companies, whether national or foreign, whose securities</p>
      <p>Although seemingly simple, this task became challeng- are not traded on national regulated markets but are
ing due to diferent documents using diferent characters controlled by companies and entities listed on foreign
to separate keywords, as well as diferences in spacing. regulated markets."
Those challenges were eventually overcome by
understanding every diference and taking measures to correct
each of them, but this means regular reviews are still
necessary in case new documents are written diferently,
making the process not fully autonomous.</p>
      <p>Another task we aimed at completing was the
collection of the questions referred to the court. These
questions are an important section of Curia documents, and
from them, it is possible to understand what is being
argued in the case, and if the court responded or not to the
questions referred. For this task results were still
incoherent due to inconsistent document formatting, making
the retrieval of these questions directly from text a very
dificult task to automate.</p>
      <p>From the XML files, retrieval of information was
simpler. This was our main source of information to create
filters, enabling the collection of basic information on each
case, such as parties involved, identifiers, and citations.</p>
      <p>It also permitted the use of the already well-established
tree of concepts curated by the EU to categorize cases.</p>
      <p>With the application of NLP models, it was possible
to produce summaries for documents. The presentation
of accurate and reliable summaries for large documents
has great potential to improve the productivity of legal
search work. If a well-written summary of a document
is available while the user is only browsing through the
results they can rapidly understand if a given document is
relevant or not for their work, accelerating this discovery
process and making the pool of documents that have to
be fully analyzed smaller.</p>
      <p>To produce these summaries, we tested T5,
"gpt-3.5turbo," and "gpt-4" models and conducted a qualitative
evaluation. Summaries generated by the T5 model were
either lengthy or missed the main points of the case,</p>
      <p>Besides summary generation, NLP models were also
applied to enhance CaseScope’s search space. With the
help of GPT models, more keywords were collected for
each document. Taking the document with its title
presented in Figure 4 as an example, in addition to the
keywords already listed before, we were then capable of
collecting the following keywords by utilizing "gpt-4":
"Italian laws", "tax avoidance measures", "securities",
"national regulated markets", "freedom of establishment",
"discrimination", "national companies", "foreign
companies", "parent company", "shell companies", "European
Union law".</p>
      <p>As we can see the existence of overlapping keywords
indicate successful extraction of useful keywords by the
model. The generated keywords are smaller in size than
the human-labeled ones, which helps in expanding the
search space with a broader and more general group of
keywords. In some cases, the model separates a single
keyword from the human-labeled set into two distinct
keywords. For instance, the keyword "Measures to
prevent tax avoidance by shell companies" is divided into
"tax avoidance measures" and "shell companies" in the
set of generated keywords.</p>
      <p>With the objective of better understanding how
CaseScope relates to legal search tools available on the market,
we described the features of four well-established and
reliable legal search tools together with CaseScope’s
features. This description can be reviewed in Table 1.</p>
      <p>For CaseScope, we marked features that are in our
roadmap, which doesn’t exclude the addition of new
features in a later review. AI/Machine Learning will at least
be present in the form of what was discussed in this work,
the enhancement of the search field with the assistance</p>
      <p>Feature
AI/Machine Learning</p>
      <p>Annotations
Brief Analytics</p>
      <p>Case Alerts
Case Law Research
Change Tracking
Data Visualization
Query Suggestions</p>
      <p>Search History</p>
      <p>Search/Filter
Self-Service Search
Statutes Research</p>
      <p>Tax Law Research
Third-Party Analysis Integration</p>
      <p>Summarization
Keyword Extration
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
of NLP techniques. Data visualization and Search/Filter
are core concepts of a search tool, so those are mandatory.</p>
      <p>Search History is a feature we understand to be essential
to have a good workflow, enabling the user to pause a
search session and resume it at a later time.</p>
      <p>The work here described shows our focus and ability
to develop other features that are not yet available in
previously discussed tools, and that is the diferentiator
factor for CaseScope, enhancing search capabilities and
the workflow of legal practitioners and researchers.
CaseScope can be integrated with any interface through its
API, and could even be leveraged by existing legal search
tools to extend their capabilities, since it is a robust
application developed with insights from legal practitioners.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>more keywords for each document, improving
CaseScope’s search space. Overall, the results show the
potential of NLP models in legal research and document
analysis, and the benefits they can bring to legal
practitioners and researchers.</p>
      <p>Future work for CaseScope can focus on evaluating the
results of NLP models systematically using automated
metrics for summaries evaluation, enhancing confidence
and identifying areas for improvement. Additionally,
implementing a Q&amp;A system to retrieve answers to specific
legal questions and integrating them into CaseScope’s
searchable database would be valuable. Furthermore,
ensuring an extensible tool that stays updated with the
latest CURIA documents requires creating a pipeline to
automate the methods described in this work, ensuring
a stable and useful legal search tool.</p>
      <p>The main objective of this work was to review the various Acknowledgments
necessary methods to develop a search tool for CURIA
legal documents and apply them. Advanced NLP models This work is part of a joint venture project where we
were utilized to tackle the intricate challenges involved in worked with Morais Leitão, Galvão Teles, Soares da Silva
searching for information in vast collections of domain- &amp; Associados, who as lawyers and domain experts
apspecific documents. proached us with this innovative project to be developed</p>
      <p>We first looked into how data from the CURIA doc- combining legal and technical expertise. We would like to
uments could be retrieved to feed CaseScope, through express our gratitude to Cláudia Baptista, Ana Pedro
Casan R-package named "eurlex" and further preprocessing. tro, Carlos Coelho and António Queiroz Martins for their
Then we went over our approach to storing this said data, contribution to this research. The fourth author was
parby utilizing a relational database and an indexing system, tially supported by the Center for Research and
Developfor structured and unstructured data, respectively. ment in Mathematics and Applications (CIDMA), through</p>
      <p>With the application of these NLP models, we were the Portuguese Foundation for Science and Technology,
able to produce summaries for documents and collect reference UIDB/04106/2020 and UIDP/04106/2020.
[17] Casefleet, 2023. URL: https://www.casefleet.com/,
accessed on April 30, 2023.
[1] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, [18] Liquidtext, 2023. URL: https://www.liquidtext.net/,
L. Jones, A. N. Gomez, L. Kaiser, I. Polosukhin, At- accessed on April 30, 2023.</p>
      <p>tention is all you need, 2017. arXiv:1706.03762.
[2] OpenAI, Gpt-4 technical report, 2023.</p>
      <p>arXiv:2303.08774.
[3] L. Tunstall, L. Von Werra, T. Wolf, Natural language
processing with transformers, " O’Reilly Media,</p>
      <p>Inc.", 2022.
[4] H. Zhong, C. Xiao, C. Tu, T. Zhang, Z. Liu,</p>
      <p>M. Sun, How does nlp benefit legal system:
A summary of legal artificial intelligence, 2020.</p>
      <p>arXiv:2004.12158.
[5] The court of justice of the european union, 2023.</p>
      <p>URL: https://curia.europa.eu/jcms/jcms/j_6/en/,
accessed on April 17, 2023.
[6] J. Savelka, V. R. Walker, M. Grabmair, K. D. Ashley,</p>
      <p>Sentence boundary detection in adjudicatory
decisions in the united states, Traitement Automatique
Des Langues (2017) 21–45. doi:10.47164/ijngc.</p>
      <p>v14i1.1004.
[7] A. Gillioz, J. Casas, E. Mugellini, O. A. Khaled,</p>
      <p>Overview of the transformer-based models for nlp
tasks, in: 2020 15th Conference on Computer
Science and Information Systems (FedCSIS), 2020, pp.</p>
      <p>179–183. doi:10.15439/2020F20.
[8] I. Chalkidis, M. Fergadiotis, P. Malakasiotis,</p>
      <p>N. Aletras, I. Androutsopoulos, Legal-bert:
The muppets straight out of law school, 2020.</p>
      <p>arXiv:2010.02559.
[9] Casetext, 2023. URL: https://casetext.com/, accessed</p>
      <p>on April 30, 2023.
[10] Fastcase, 2023. URL: https://www.fastcase.com/,
ac</p>
      <p>cessed on April 30, 2023.
[11] The court of justice of the european union, 2023.</p>
      <p>URL: https://cran.r-project.org/web/packages/
eurlex/vignettes/eurlexpkg.html, accessed on April
17, 2023.
[12] M. Ovádek, Facilitating access to data on
european union laws, Political Research Exchange
3 (2021) 1870150. URL: https://doi.org/10.1080/
2474736X.2020.1870150. doi:10.1080/2474736X.</p>
      <p>2020.1870150.
[13] Eur-lex: Access to european union law, 2023. URL:
https://eur-lex.europa.eu/homepage.html?locale=
en, accessed on April 13, 2023.
[14] Elasticsearch, 2023. URL: https://www.elastic.co/,</p>
      <p>accessed on April 30, 2023.
[15] C. Rafel, N. Shazeer, A. Roberts, K. Lee, S. Narang,</p>
      <p>M. Matena, Y. Zhou, W. Li, P. J. Liu, Exploring the
limits of transfer learning with a unified text-to-text
transformer, 2020. arXiv:1910.10683.
[16] A. Radford, K. Narasimhan, Improving language
understanding by generative pre-training, 2018.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>