=Paper=
{{Paper
|id=Vol-2604/paper26
|storemode=property
|title=Ontologies as a Set to Describe Legal Information
|pdfUrl=https://ceur-ws.org/Vol-2604/paper26.pdf
|volume=Vol-2604
|authors=Anatolii Getman,Volodymyr Karasiuk,Yevhen Hetman
|dblpUrl=https://dblp.org/rec/conf/colins/GetmanKH20
}}
==Ontologies as a Set to Describe Legal Information==
Ontologies as a Set to Describe Legal Information
Anatolii Getman1[0000-0002-1987-2760], Volodymyr Karasiuk1[0000-0001-9092-2137]
and Yevhen Hetman2[0000-0002-1801-7252]
1
Yaroslav Mudryi National Law University, Kharkiv, Ukraine
2
National Academy of Law Science of Ukraine, Kharkiv, Ukraine
karasiuk@yahoo.com
Abstract. The article discusses the features of legal knowledge ontology crea-
tion. It is determined that ontology is the most appropriate way to describe legal
knowledge. The particular qualities of legal information and the features of the
language of a right were investigated. A review of legal knowledge ontologies
that are used in various branches of law was made. The properties of legal in-
formation and the requirements for regulatory documentation in Ukraine were
described. The formalization of the structure of the ontology database was pre-
sented, taking into account the required attributes of the concepts. The method-
ology of the work with the knowledge base was proposed to use the independ-
ent work of many users. The legal knowledge ontology at the law university
was filled by all users of the software package, but experts checked the quality
of this content. Crowdsourcing was considered as the main technique of the on-
tology filling process. Several branches of the ontology of legal knowledge
were filled. The results of the experimental operation of this ontology by uni-
versity students were analyzed.
Keywords: artificial intelligence, legal information, ontology, knowledge rep-
resentation, crowdsourcing
1 Introduction
The legal systems of continental Europe, including Ukraine, use regulatory law. That
is, they pay more attention to judicial interpretation in assessing legal facts and ac-
tions than to judicial precedents. Therefore, the regulatory framework (legislation)
first of all is to be easy of access and simply ordered. For many practical areas of ap-
plication of law information should also be available on the daily activities of legal
structures, that is, by-laws and regulations (on a second-priority basis). A modern way
of structuring legal knowledge is to describe them using ontologies. Based on legal
ontologies and inference rules, automated decision-making methods in the field of
law can be implemented. However, to create legal ontologies, it is necessary to con-
duct a legal analysis by the efforts of legal experts and knowledge specialists. Fea-
tures of the application of legal information requires the concept of legislative defini-
Copyright © 2020 for this paper by its authors.
Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
tions and attributes of regulatory documents in which they are introduced and used.
One more serious problem is the organization of the process of filling the ontology
with relevant content. In this process, enormous amounts of legislative (and regulato-
ry) information as well as the presence of collisions and confusion of terminology are
complicating conditions. With regard to legal information for application software, it
is to be noted the high variability of its content. Continuous changes in the regulatory
framework are difficult to track and model in the ontology. It is impossible to rely on
only the efforts of legal experts to fill the ontology, due to its volume. In this case,
you can count on the use of crowdsourcing and advanced rules for working with the
content of the ontology.
The ideas and approaches considered in this article were implemented in the pro-
cess of creating an ontology, which is applied in the educational process at the univer-
sity. The present article is a further development of the ideas contained in [1-4].
2 Particular Qualities of Legal Information
Legal information has a great importance for the functioning of modern society. Due
to its significance, it has significant features and otherness. Characteristics of legal
information and the legal language in Ukraine in general are given in detail in reviews
[5, 6].
With regard to legal terminology, it is special because of its versatility, since any
social activity is closely linked to law. A clear definition and unambiguous interpreta-
tion of legal terms is an extremely important condition for becoming constitutional
state, contributes to the improvement of legislative activity, the effective implementa-
tion of legislative acts, creates the conditions for their ordering and systematization of
legislation. In order to carry out its functions, the term must satisfy the requirements
that are scientifically substantiated and legally binding. Most experts consider sys-
tematic, accurate, unambiguous, stylistic neutrality (lack of emotionally expressive
marking) and motivation to be inalienable attributes of the legal term.
Regulatory and legal terminology, moreover, are characterized by multi-
component quasi-terms - long-term descriptive turnovers of different structures that
give a fairly accurate definition of the subjects and objects of specific relationships
and their interaction, thereby reflecting the essence of legal relations in each case,
which is regulated in this case regulatory act.
There are certain rules for using regulatory terminology when creating legal docu-
ments [7]. And the volume of by-laws (instructions, regulations, certificates, and oth-
er) is generally difficult to assess.
3 Formulation of the Problem
The main purpose of this research are: explore legal information to spot features that
are fundamental to the formation of the electronic structure of legal knowledge; study
the problem area of knowledge engineering to build legal information ontology, and
then provide an ontology model and technology to work with it. Features of legal in-
formation imposes their requirements on the structure of the ontology and the rela-
tionship between concepts. Attributes of concepts are very important for practical
activities in the field of law. The task is also to investigate the crowdsourcing method
for ontology filling-in.
4 The Use of Ontologies to Represent the Legal Information
Intelligent knowledge-based systems have various models of data processing and dis-
play. The most common in practice are taxonomies, semantic networks, and ontolo-
gies.
Taxonomy. Taxonomies display hierarchy of concepts associated subordination re-
lation, wherein each subsequent level taxonomy includes previous subtypes. Formally
taxonomy can generally be described: T = C, H , where C is the set of terms (con-
cepts), H - hierarchy relationships between terms (concepts).
Semantic network. Network models represent a directed graph wherein vertices -
are concepts and arcs - the relationship there between: N = C, R , where C is the set
of terms (concepts), R - set of relationships between concepts. This is the closest
view to what natural language information appears. Network model at the same time
does not give an idea of the domain hierarchy; in addition, the creation and modifica-
tion of the subject area model is difficult; moreover, for the processing of network
models a special apparatus for formal inference and planning is necessary [8].
Ontology. Ontologies are a variety of frame models that are used to describe re-
sources in WEB applications, corporate databases, document processing applications,
and the like [9-11]. Ontologies have a large number of element types and relation-
ships between them. Ontologies contain such elements: concepts (notions), classes,
relations, interpretation functions. Concepts are objects - the vertices of the ontology
graph that have their own semantic representation; classes combine many of the same
type of concepts. Relationships and interpretation functions define the relationship
between terms and their properties.
Ontology should introduce a minimum basic set of concepts, but sufficient for
modeling purposes, and given the description of complex situations, determine the
ratio between these concepts, their characteristics, effectively transmit certain value
concepts necessary to describe situations and interpretation functions.
A more relevant definition of ontology is given in the IEEE Std 1872 ™ -2015
standard: ontology is the basis that defines the main concepts, their properties, rela-
tionships between concepts and domain rules. Taxonomies provide an ordered set of
vocabulary and a single type of relationship between the terms, and an ontology pro-
vides more relationships, restrictions, and rules. Ontologies provide relevant
knowledge about the subject area explicit in a computer-interpreted format, which
allows software (SW) to reason about this knowledge to output new information. On-
tologies are an excellent tool to reduce the ambiguity in transferring knowledge be-
tween groups of people, information systems, knowledge bases, control systems and
other objects that are built on the same abstraction [12].
With regard to law, ontologies here allow us to emphasize [13, 14] the relationship
of legal norms with each other; the relevance of official legal conclusions; take into
account the degree of priority of legal norms; the identity of the diverse branches of
law and more. Ontological engineering in law therefore has its own characteristics: 1)
a large number of generally accepted concepts with their own specific application; 2)
the difference in the structures of various branches of law; 3) the presence of a general
theoretical legal level between the ontology of the upper level and the ontology of the
subject area; 4) a large number of theoretical assumptions and abstract constructions,
which depend on the specifics of legal views. Therefore, it should be considered that
the most effective approach is founded on the learning of the terminology of law and
the construction of ontologies on large text blocks.
Today, reliable methods for creating ontologies have not been created, and there is
no methodology for a comprehensive assessment of ontologies. Although general
approaches imply the idea of an assessment [15]: epistemological identity (clarity,
intuition, relevance, completeness), executive authenticity (sequence, computability),
the possibility of multiple use (task and method, subject area). These criteria are antic-
ipated to be developed for use as a standard for describing all ontologies.
In Ukraine, work on the creation of legal ontologies is also underway [16 - 18].
5 Overview of Legal Information Ontologies
Currently, a large number of software packages have been developed that implement
information systems in law based on ontologies. Consider the most interesting.
PrOnto [19], the ontology is designed to describe information about products and
their groups, and is currently effectively used to describe the GDPR (General data
protection regulation). This ontology considers the GDPR as a starting point, however
it is meant to be extended to the concepts and relative relations of other legal frame-
works.
LRI-Core - ontology, which covers all branches of law. Book [20] illustrates the
use of LRI-Core for legal ontologies in the Netherlands.
CLO – (Core Legal Ontology) SLO is an instrumental framework for building le-
gal ontologies for any branches of [21].
LKIF - is, first of all, several libraries of related concepts and their properties.
LKIF is a universal means for the legal domain [22].
LTS - Legal Taxonomy Syllabus - is an ontology-based legal curriculum [23].
Eurovoc thesaurus - is a multilingual thesaurus that forms the basis of domain
names for the European Union terminology database. Eurovoc is available in 23 offi-
cial languages.
Eunomos - is a software for managing legal documents and terminology. Eunomos
is focused on tracking changes in regulatory documents. This software supports mul-
tiple languages and focuses on many areas of application in various branches of law.
LegalRuleML - a package that uses Legal XML to represent a model of legal
knowledge and uses the rules to build a legal ontology. It includes NLP tools. Very
interesting project.
NRV - following development of LegalRuleML, which contains a set of basic
primitives for identification and classification.
ODRL - a structured language focused on legal support of open licenses in publish-
ing digital data on the network - photos, software, news and other data.
LDR - a modification of the language ODRL, which is focused on the use of model
LOD, including the multilingual data.
CC - language for describing the use of Creative Commons copyright licenses.
RDF concept used.
L4LOD - ontology for one problem area - for a universal description of licenses in
a computer network for data exchange.
ELI - ontology for a standardized description of European law through XML. This
ontology can be used online on the internet.
LOTED2 - legal ontology, which provides information on tendering in Europe. It
supports the legal grounding for purchases.
PPROC - an ontology in which a semantic description is used to support public
procurement procedures. Ontology prevents misconduct in procurement.
And other ontologies [24, 25].
6 Features of Legal Knowledge Ontology
The mathematical description of the legal ontology that we are developing is pre-
sented in detail in [3].
Features of legal information that were considered earlier should be taken into ac-
count when developing an ontology scheme:
- the presence of a synonymous description of some legal terms (nodes of ontolo-
gy);
- the presence of restrictions on the duration of wording of a legal term;
- the presence of a mandatory relationship between definitions (ontology nodes)
and strict wording (legislative definitions) in regulatory documents.
-and more others.
That is, the database implements the ontology, includes the following parts:
- concept (legal term) and connections;
- connections of groups of synonyms of legal terms;
- texts - definitions of legal terms in accordance with regulatory documents;
- lexical mapping of legal terms and relationships;
- indices the use of legal terms and relationships in the source document.
More about each part.
• concept and relations: are recorded with unique identifiers, with a name string for
output in the graphical interface.
• texts - definitions of legal terms: presented in the form of sentences, with fixation
of belonging to the section and the text, that is, among them the following entities can
be distinguished:
- text - in accordance with the structure of the normative document, this is a se-
quence of paragraphs, type, title, list of service attributes of the document;
- paragraph - description of the structure of the part of the document, which in-
cludes the title of the paragraph, the sequence of blocks and separate sentences of the
text;
- sentence - a structural unit of the source text in the form of a string and the type
of sentence (indicating membership in the source regulatory document or its para-
graph).
Fig. 1 schematically shows the hierarchy between the legal term (concept), sen-
tences, groups of synonyms.
Fig. 1. The structure of concepts and their relationships in the ontology database.
The problem of terminological confusion. The basic principle of legal technology
requires that different terminology in one text designate different legal phenomena.
Moreover, ideally, they try to extend this principle to the entire legal system as a
whole, so that only one term answers each legal phenomenon within it. This problem
is not confined to the particular case of terminological confusion in any particular
document - it is quite common, has long been known and has deep roots. One can find
a series of many terminological inconsistencies and contradictions in the Ukrainian
legislation. An analysis of the laws of Ukraine from the point of view of their termi-
nological certainty gives reason to conclude that the meaning of the term used in one
law is not always the same in content with the term contained in another law. Howev-
er, this is due to the fact that in addition to general legal terminology, which combines
the basic terms of the entire system of legislation, there is intersectoral and industry
terminology. Tautological constructions, polysemy and the like lead to confusion in
understanding the regulatory requirements of the laws of Ukraine [26].
In addition, conflicts between the provisions of regulatory documents of various
branches of law are not uncommon. In the era of the widespread dissemination of
electronic documents, digital databases and systems of related documents, the prob-
lem of adhering to terminological unity and its gradual implementation in current leg-
islation and practice of its application is of fundamental importance. The terminologi-
cal toolkit gives lawyers such an opportunity. However, on February 4, 2020 Law No.
469-IX was introduced to improve the electronic form of workflow in the Verkhovna
Rada. Previously, it was assumed that publications on the Verkhovna Rada website
are considered the official publication of laws (along with the publication in the Voice
of Ukraine newspaper and in the Verkhovna Rada Vedomosti). Now the official is
considered the publication of laws and other acts of the Verkhovna Rada in these me-
dia. That is, the electronic form of the document was made dependent on the printed
form. And the issues of streamlining the conceptual and terminological apparatus of
legal information remain relevant for modern Ukrainian jurisprudence. Naturally,
when creating an ontological structure, these circumstances should be kept in mind.
7 How to Work with This Ontology
Crowdsourcing. Only the efforts of experts are not enough to build ontology of this
scale. In addition, the legal knowledge system is very dynamic, «alive». One can re-
call Wikipedia, which was filled by everyone and a high level of reliability of the in-
formation contained in it. It means that the approach does works.
A number of projects for the creation and improvement of ontologies are known in
which the crowdsourcing method was applied [27, 28].
We managed to attract a significant group of students to fill the ontology. As part
of the training course, they analyzed the information of the subject area, identified
concepts, and created (filled) the ontology. The work was carried out under the gen-
eral guidance of a teacher, and after acquiring some experience, without assistance.
The results turned out to be rather optimistic both in relation to the initial ontology,
covering one subject area [4].
The completion of the first stage of creating an ontology of legal knowledge
showed the following results. The quality of extracting legal concepts from legislative
documents and textbooks was 90%. Students identified and contributed a total of
more than 6,000 legal terms. Experts noted some of the shortcomings identified in the
ontology. Students were not able to correctly identify some concepts that have com-
plex definitions consisting from several words. Conversely, several concepts were
defined verbose, more complex than they actually were. Also, when filling in the on-
tology, some concepts were not placed in the branches where they should have been
located according to the hierarchy of concepts. However, in general, the result is quite
good.
8 Discussion
After creating the ontology, it is necessary to evaluate the characteristics and reliabil-
ity of the obtained ontology. In accordance with [29], evaluation is required through-
out the entire ontology life cycle. However, in analyzing the different methods for
evaluating of an ontology it is to be noted that the proposed metric [30] difficult to
apply ontologies in various problem areas that have serious differences of the struc-
ture and output functions.
Ontology can be actively used if it is complete and has a convenient interface. Us-
ers got used to modern information systems with a developed interface. Shortcomings
in the interface repel potential users.
The experience of filling and training operation of an exemplary ontology has
shown that the problem of updating (supplementing) the ontology with new concepts
is acute. Users rarely refer to concepts that are needed infrequently, and these branch-
es remain unfilled. That is, to achieve perfect coverage of the subject area is very dif-
ficult. Perhaps, when scaling the system, with the access of a huge number of users
(all at once), this problem will be solved, as is now the case on Wikipedia — the clas-
sic crowdsourcing system.
Organizing the process of filling the legal ontology, there is a proposal on the fea-
sibility of forming an ontology at the time of adoption and the description of the nor-
mative act, due to a competent and proper analysis of correlation of concepts by ex-
perts in the divisions of the Verkhovna Rada are responsible for work with legal in-
formation.
In any case, ontologies are a better mechanism than using XML markup of legal
documents. Although it requires large expenditures for maintenance and development.
The lack of state support for the development and filling of legal ontology dooms
dozens of enthusiastic groups to proactive, unrelated (multidirectional) work in this
area.
9 Conclusions
As a result of the study, analysis of the advantages and disadvantages of various
knowledge representation systems, it is advisable to use an ontological representation
to describe legal knowledge.
The full implementation of the task of developing an ontological description of le-
gal information allows lawyers to provide a tool for access to relevant regulatory in-
formation in real time. And given the volume of the regulatory framework and its
variability, this will have a great practical effect.
In the framework of this work, the structure of the ontology database is formed, the
structure takes into account the attributes of concepts related to legislative definitions
and the time frame of validity of regulatory documents.
The created ontology is filled with key concepts of one branch of law (which is
considered as the core on the basis of which a full ontology will be developed). This
ontology is used in the educational process for specially prepared tasks in the study of
selected topics.
To fill in the ontology, the crowdsourcing method was used with the attracting of a
sufficient number of law students. We hope that with the accumulation of experience
with the created ontology, the quality of filling in the ontology will increase.
Based on the work performed, promising for future work, we consider the follow-
ing:
- development of automatic ontology comparison tools, which will be an important
tool for assessing the completeness, consistency of ontologies created by different
users, as well as comparing ontologies of different branches of law (or ontologies of
different countries);
- development of an interface in a natural language, to attract non-professional us-
ers of information systems to work.
References
1. Tatsyi, V., Getman, A., Ivanov, S., Karasiuk, V., Lugoviy, O., Sokolov O.: Semantic net-
work of knowledge in science of law. In: Shokin Yu., Bychkov, I., Potaturkin O. (Eds.)
Proceedings of the IASTED International Conference on Automation, Control, and Infor-
mation Technology (ACIT 2010), 218-222. ACTA Press, Anaheim, USA (2010).
2. Karasiuk, V.: Ontological paradigm of process of content for education purposes. Bulletin
of V. Karazin Kharkiv National University 19 (1015), 148-154 (2012).
3. Getman, A., Karasiuk, V., Hetman, Ye., Shynkarov, O.: Ontological Representation of Le-
gal Information and an Idea of Crowdsourcing fot Its Filling. In: O. Chertov et al. (Eds.):
ICDSIAI 2018, AISC, 836, 179-188, 2019. DOI: 10.1007/978-3-319-97885-7_18.
4. Getman, A., Karasiuk, V.: A crowdsourcing approach to building a legal ontology from
text. Artificial Intelligence and Law. 22(3) 313-335 (2014). DOI: 10.1007/s10506-014-
9159-1.
5. Law of Ukraine “On information” No. 2657-XII. In: Summaries of the Verkhovna Rada of
Ukraine (VRU), 48 (1992).
6. Besedna, L. Terminology and definition of the concepts in regulatory acts. Information and
Law. 2(5) 39-47 (2012).
7. Gladkivska, O.: Requirements for normative and legal terminology. Information and Law.
1(13) 55-62 (2015).
8. Borge-Holthoefer, J. and Arenas, A.: Semantic Networks: Structure and Dynamics. Entro-
py. 12 1264-1302 (2010). DOI: 10.3390/e12051264.
9. Gruber, T.: Toward Principles for the Design of Ontologies Used for Knowledge Sharing.
Int Journal of Human-Computer Studies. 43 (5-6) 907–928 (1995).
10. Jones, T.: Artificial Intelligence: A Systems Approach. Jones and Bartlett Publishers, Inc.
(2008).
11. Doerr M., Kritsotaki A., Christophides V., Kotzinos D. Reference Ontology for
Knowledge Creation Processes. In: Moen A., Morch A.I., Paavola S. (eds) Collaborative
Knowledge Creation. Technology Enhanced Learning, vol 7. SensePublishers, Rotterdam.
31–52 (2012). DOI: 10.1007/978-94-6209-004-0_3.
12. The National Artificial Intelligence Research And Development Strategic Plan: 2019.
https://www.nitrd.gov/pubs/National-AI-RD-Strategy-2019.pdf, last accessed 2020/02/11.
13. Noy, N., McGuiness, D.: Ontology development 101. Stanford Knowledge Systems La-
boratory Technical Report KSL-01-05 (2001).
http://www.ksl.stanford.edu/people/dlm/papers /ontology-tutorial-noy-mcguinness.pdf, last
accessed 2015/06/10.
14. Leone, V., Caro, L.D., Villata, S.: Legal Ontologies and How to Choose Them: the
InvestigatiOnt Tool. In: International Semantic Web Conference (2018). http://ceur-
ws.org/Vol-2180/paper-36.pdf, last accessed 2020/03/10.
15. Visser, P., Bench-Capon, T.: A Comparison of Four Ontologies for the Design of Legal
Knowledge Systems. Artificial Intelligence and Law. 6, 27-57 (1998).
16. Khala, K.: Development of an OWL ontology for representing conceptual knowledge in
the legal field on the example of court cases. Ukrainian Journal of Educational Studies and
Information Technology, 4(2), 51-55 (2016).
17. Kosenko, S.: The main statements of ontology theory and its implementation in the system
of legal knowledge. Information Technology and Security, 4, 2(7), 154-171 (2016).
18. Kiridon, A. (Ed.): The science of the ambush that theoretical and methodological principle
of creating basic encyclopedias. Kiyv, Encyclopedic View (2015).
19. Palmirani M., Martoni M., Rossi, A., Bartolini, C., Robaldo L.: PrOnto: Privacy Ontology
for Legal Reasonin. Springer International Publishing (2018) DOI: 10.1007/978-3-319-
98349-3_11.
20. Chen, H., Brandt, L. et al. (Eds). Digital Government: E-Government Research, Case
Studies, and Implementation. Springer US 233-261 (2008) DOI: 10.1007/978-0-387-
71611-4.
21. Gangemi, A.: Design patterns for legal ontology construction. CEUR Workshop Proceed-
ings. 321, 65-85 (2007).
22. Hoekstra, R., Breuker, J., Di Bello M., Boer, A.: The LKIF core jntology of basic legal
concepts. In: Proceedings of the CEUR Conference, 321, 43-63 (2007).
23. Schmitz, P., Francesconi, E., Batouche, B., Landercy S. P., Touly, V.: Ontological models
of legal contents and users’ activities for EU e-Participation services. EGOVIS 2017.
LNCS, 10441 99-114 (2017).
24. Peruginelli G. and Faro S. (Eds.): Knowledge of the Law in the Big Data Age. IOS Press.
Netherlands 2019. DOI: 10.3233/FAIA317.
25. Jimenez, C., Alarcon, D., Royo, C.: Comparative Analysis of Legal Ontologies, a Litera-
ture Review (TFR). University of Catalonia (2019). https://pdfs.semanticscholar.org/a8cd/
7aeb45b61d95ccd6ad71f1d585de1e93a798.pdf, last accessed 2020/02/11.
26. Klochko, M.: Legal Terminology: Concepts, Features. State Building and Local Self-
Government 18 148-154 (2009).
27. Sivula, A., Kantola, J. Ontology focused crowdsourcing management. Procedia Manufac-
turing 3 632 – 638 (2015). DOI: 10.1016/j.promfg.2015.07.286 https://www.researchgate.
net/ publication/283960344_Ontology_Focused_Crowdsourcing_Management, last ac-
cessed 2020/02/19.
28. Amini, R., Cheatham, M., Grzebala, P. and McCurdy, H. B.: Towards Best Practices for
Crowdsourcing Ontology Alignment Benchmarks. In: 15th International Semantic Web
Conference (ISWC) 2016 1-12 (2016). http://disi.unitn.it/~pavel/om2016/papers/om2016_
Tpaper1.pdf, last accessed 2020/01/19
29. Zurek, T.: Conflicts in legal knowledge base. Foundations of computing and decision sci-
ences, 37 (2) 129-145 (2012). DOI: 10.2478/v10209-011-0006-9.
30. Zhu, H. et al.: Quality Model and Metrics of Ontology for Semantic Descriptions of Web
Services. Tsinghua science and technology, 22 3 254 – 272 (2017). https://ieeexplore.
ieee.org/stamp/stamp.jsp?arnumber=7914198, last accessed 2020/02/29.