<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>When Data Stands Before the Law: an Experience Report on Representing Financial Rules in SPARQL</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marcello Ceci</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicolas Sannier</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sallam Abualhaija</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Domenico Bianculli</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Halling</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>FDEF - University of Luxembourg</institution>
          ,
          <addr-line>6 rue Richard Coudenhove-Kalergi L-1359</addr-line>
          <country country="LU">Luxembourg</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>SnT - University of Luxembourg</institution>
          ,
          <addr-line>29 Avenue John F. Kennedy, L-1855</addr-line>
          <country country="LU">Luxembourg</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Automated compliance checking of data (including financial data) against applicable law is only possible with a formal representation of complex legal rules. The literature in the fields of legal informatics and Requirements Engineering (RE) can count on decades of contributions to the representation of data models, rule languages, and reasoners for legal application. There is however a representational gap between data and legal norms, which prevents a comprehensive approach to legal knowledge representation, resulting in the lack of a standard solution for legal rules representation. This paper reports on our experience regarding the formal representation of complex financial rules, more specifically, the entirety of Article 43 of the Luxembourgish UCITS Law. We used SPARQL to specify the rules to be used for the automatic validation of a real financial dataset. We provide observations regarding (a) the complexity of the resulting SPARQL queries and how SHACL can help address some of this complexity, and (b) the alignment of the rules/queries with the knowledge expressed by the corresponding legal statements. We discuss the implications of these observations and describe the main challenges in achieving a machine-readable representation of legal norms.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Automated Compliance Checking</kwd>
        <kwd>FinTech</kwd>
        <kwd>Legal Knowledge Representation</kwd>
        <kwd>Legal Reasoning</kwd>
        <kwd>SPARQL</kwd>
        <kwd>SHACL</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Run-time monitoring of investment funds’ activities is an important challenge in the financial
industry [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This is particularly relevant for regulated funds such as UCITS (Undertakings for Collective
Investments in Transferable Securities), which are legally required, among other things, to diversify
their portfolio composition: they should not, for example, be mainly composed of investments toward
the same issuer. Compliance with these requirements can, however, be sensitive to the evolution
of markets: a fund operating close to the limits might end up infringing the law not because of the
manager’s actions, but simply because of market fluctuations afecting the value of the assets in its
portfolio. There is therefore a need to monitor the activities of investment funds and the corresponding
data as a whole, to ensure compliant operations.
      </p>
      <p>
        Automated compliance checking shows a prominent need for maintainable models of applicable
regulations and advocates for the use of open semantic standards such as RDF and OWL [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. It also
requires close alignment with data management practices [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. However, there is a certain lack of
cohesion between research in Description Logics and practical data modelling [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], which carries over to
the field of legal knowledge representation where — we contend — there is a representational gap between
data and legal norms. As a result of this gap, current commercial applications provide proprietary (ad
hoc) solutions assisting asset managers and legal experts in various tasks of the compliance process
(e.g., tax and accounting management, portfolio management, risk management, reporting), which
are designed as black-box systems that do not rely on open semantics regarding data. Because of this,
and because legal experts lack technical expertise1, these systems often come with consulting services
necessary for any customization (e.g., the implementation of additional legal rules or features). The
ifeld of automated compliance checking seems thus torn between two needs: the need for legal rigor
and the need for viable implementations.
      </p>
      <p>
        This paper investigates the problem of representing the legal norms expressed by legal statements
(i.e., regulations) in a machine-readable format supported by a logical reasoner and aligned with actual
data representing the state of afairs. Specifically, the paper reports on our experience of using SPARQL
(SPARQL Protocol and RDF Query Language)2 for expressing complex norms in the financial sector. We
focus on the legal statements contained in Article 43 of the Luxembourgish UCITS Law3, enacting Article
51 of the EU UCITS Directive as it sets asset composition limits for UCITS funds. The paper presents
the methodology we followed to define the SPARQL rules, aligned to a conceptual model that we built,
as part of previous work, on the basis of a real dataset sourced from a financial information provider [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
Our experience shows that SPARQL is capable of handling complex financial rules. However, the
translation process is far from intuitive, making it challenging for legal experts to understand the rules
or its implications. Moreover, the complexity of the resulting rules might lead to unpractical execution
times [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>The paper highlights these challenges, related to the complexity of the representation and the
misalignment between the legal statements and the representation. We note that some of them were
not highlighted in the literature because they only emerge when dealing with complex legal norms
and with application to real data. We show how some of these challenges can be addressed by SHACL
(Shapes Constraint Language)4. However, not all of them can be solved in the modeling phase (e.g., by
splitting the representation of a norm among several rules), as some cases require the input of a legal
expert during the execution phase. It seems that compliance checking can only be semi-automated,
possibly relying on an argumentation framework to support alternative representations of the law.</p>
      <p>The rest of the paper is structured as follows: we define the problem of representing financial rules for
automated compliance checking in Sect. 2. In Sect. 3 we review the state of the art in the representation
of legal knowledge and reasoning. In Sect. 4 we present the methodology that we followed for building
the model and the rules, and in Sect. 5 we report on the experience of representing financial rules. In
Sect. 6 we reflect on the experience, and provide a research outlook in Sect. 7. Sect. 8 concludes the
paper.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Preliminaries</title>
      <p>In this section, we provide preliminary information regarding the representation of legal knowledge
and reasoning.</p>
      <p>
        The peculiarity of law. Legal statements do not behave as normal statements: they do not describe
an existing objective reality, but rather prescribe a desired intersubjective reality [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Any knowledge
representation efort aiming to represent the legal norms expressed in these statements for application
to data has to take into account the complexities entailed by legal norms, including the regulative [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and
constitutive [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] efects of norms, exceptions [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], regulatory change [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], and legal interpretation [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>
        The representational gap. Especially in domains such as finance, the volume and velocity of
governing laws and governed financial transactions make run-time compliance checking a challenge.
There is a representational gap between data (logically organized and deterministic) and legal norms
(general and abstract, open-textured [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]), originating in the deductive nature of formal logic, whereas
legal reasoning relies on pragmatism, persuasion and critical thinking, involving patterns of beliefs
in a process of guesswork and deductive justification [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. For example, where legal norms would
distinguish between “transferable securities” and “money market instruments” at a high level, data
would consider several types of bonds, whose qualification to the former or the latter type is often
uk/sra/research-publications/technology-innovation-in-legal-services/
2https://www.w3.org/TR/sparql11-query/
3Law of 17 December 2010 relating to undertakings for collective investment, available in English at: https://www.cssf.lu/
wp-content/uploads/L_171210_UCI.pdf.
4https://www.w3.org/TR/shacl/
unclear [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The alignment between legal concepts and data thus requires explicit clarifications, as done
by financial regulators [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]). As a result, full automation of compliance checking is currently a chimera.
      </p>
      <p>
        From a research perspective, legal informatics would propose a highly articulated representation of
the law in order to ensure its legal soundness, which however results in constructs (e.g., exceptional
facts being subcategories of relevant facts in OWL 2 [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]) that are hard (and sometimes downright
impossible) to map to actual data. Approaches in software engineering (SE) would not adequately
implement legal considerations, with the consequent risk of overlooking aspects such as alternative
legal interpretations or inter-relations among norms during the compliance assessment.
      </p>
      <p>
        In the current state of afairs, creating formal compliance rules remains a task devoted to developers
in the lack of a standard approach that would allow legal experts to directly contribute to the process.
Large Language Models (LLMs) have the potential to help in that regard, towards the automation of
rule creation; however, they currently fall short in creating rules representing the norms expressed
by legal statements [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]: so far, successful applications of LLMs in the legal field are rather limited,
involving only document generation and analysis, case prediction, and legal research [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>
        The need for a model and a rule set. One essential characteristic of automated compliance
checking is explainability. This consideration is reinforced for AI-based systems automating legal
interpretation and application: since they are categorized as high risk under the AI Act, explainability is
critical for their acceptance in the EU [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. One solution to provide explainability for legal interpretation
and application is to use symbolic representations [20] to bridge the formal rules and the understanding
of the law by legal experts. Previous contributions to legal knowledge representation and modeling
provide the tools necessary for a representation that can fill the aforementioned representational
gap [21], expressing the semantics of legal norms in a format that can be mapped to the data to be
checked.
      </p>
      <p>
        This implies the need for a data model and a rule language supported by a reasoner. The data model
is used to map the legal concepts to a common, coherent representation, and to ensure alignment
with the target data [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. On this regard, automated compliance checking advocates for models as
semantic resources to allow reuse and ensure maintainability [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The rules are needed to validate (and
manipulate) data, detecting paths in the data model that allow compliance experts to identify outliers,
i.e., breaches [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. The expressiveness of the rules determines, in turn, the logical complexity to be
handled by the reasoner in the reasoning layer [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. State of the Art</title>
      <p>In this section, we review the state of the art in the representation of data models, rule languages,
and reasoners for legal applications, from the perspective of two research communities, namely legal
informatics and requirements engineering (RE). RE is a field of SE that investigates regulatory compliance,
in particular the specification of legal requirements and rules based on the analysis of legal norms [ 22].</p>
      <sec id="sec-3-1">
        <title>3.1. Models for Representing Legal Concepts</title>
        <p>
          The field of legal informatics provides approaches for legal compliance checking in the Semantic Web
based on modeling deontic norms in terms of ontology classes and ontology property restrictions [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ].
Peculiarities of legal knowledge such as norm defeasibility or multiple interpretations are also modeled
within the OWL 2 language [
          <xref ref-type="bibr" rid="ref16">16, 23</xref>
          ]. However, these models are considered too complex and thus
expensive (in terms of construction and execution time) in practical scenarios [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], where the choice
falls on databases and SQL-based queries, which only employ first-order logic, considered too limited
for legal reasoning [24].
        </p>
        <p>In the field of RE, models such as the taxonomy of semantic metadata of legal provisions [ 25] or
the approach for deriving business processes from the law [22] attempt to tame the complexity of the
legal domain with task-automation objectives in mind. However, those models have a high-level, often
generic representation of the provisions, and a weak relationship to the concrete state of afairs [26].</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Rule Languages for Representing Norms</title>
        <p>
          Semantic markup languages such as LegalRuleML [27], open formats [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] and representations of the
norms complexities [20] have been widely investigated in the field of legal informatics. The level of
abstraction at which these rules are formulated, however, does not allow straightforward mapping to
actual data for automated compliance checking purposes. Furthermore, not all languages are supported
by an automated reasoner, and, for those who are, execution time and scalability are recognized
challenges [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
        <p>Complex normative structures have been studied also in the field of RE to support the precise
definition of software requirements. While attempts at rule-based formalizations [ 28] lack consideration
for legal peculiarities [29], approaches such as Legal GRL [30] and Nòmos [31] account for alternative
representations of legal norms. However, these valiant attempts at bridging the representational gap
sufer from two major limitations on either end of the gap: on the legal side, the resulting artifacts
(e.g., goal models) are hard to understand for legal experts; on the SE side, since they are meant for
compliant design purposes rather than compliance checking, their output requires further processing
to be suitable for formal specification, which would further drive the software implementation.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Legal Reasoning</title>
        <p>
          Regarding the reasoning layer, we note that the simulation of legal reasoning is possible only insofar
as it is allowed by the expressiveness of the conceptual models and the rule sets. Many observations
regarding the characteristics, challenges and limitations of legal reasoning are thus inherited from
the chosen models and rule languages, especially regarding performance and scalability. A recent
contribution in the field of legal informatics [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] compares several existing reasoners in terms of their
capability to handle exceptions and contrary-to-duty obligations, two important elements of complexity
in legal reasoning. However, the rules used to test the reasoners are rather simple, as they do not
include complex relations between the involved entities, nor possible impact on other rules. They
indeed describe complex information, such as publishing comments on a product evaluation. However,
these variables are not automatically detected, as the representation is done independently of the target
data.
        </p>
        <p>
          The most comprehensive solution to representing legal reasoning while accounting for aspects such
as alternative or conflicting representations of norms or data consists in the use of argumentation
frameworks [32, 33]. However, the execution times are particularly high [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], and no complete approach
towards representing legal compliance as an argumentation framework has been attempted yet [34].
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Motivation</title>
        <p>
          As discussed in this section, there is a lack of cohesion among the approaches to legal knowledge
representation for software applications:
• the legal theoretical approach, investigating interoperable representations of legal concepts and
rules, is mostly ignored by the software engineering community due to its detachment from
industry concerns or practical data structure [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ];
• the engineering approach, investigating the automation of steps of the compliance process, is
mostly based on ad-hoc models which do not suficiently account for the complexity of the legal
domain; its contributions are thus considered of limited usefulness by the legal community [29].
Here, the field of RE has provided important attempts to bridge the gap, which however have not
been considered satisfactory.
        </p>
        <p>As a result, current simulations of legal reasoning are largely incomplete: e.g., they are either not
understandable by a legal expert, not mappable to real data, not scalable, not applicable to diferent
contexts, or not maintainable over time. Bridging the gap between raw data and conceptual/rule based
representation of a domain of interest is a long-standing problem in contexts like data integration,
data warehousing, data exchange, and ontology-based data management. Typical solutions resort to
mappings5.</p>
        <p>This paper investigates the above-presented representational gap by reporting on an attempt to
represent legal norms in a set of rules aligned to a real data set.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <p>
        Context. This research is part of a project investigating the automation of run-time monitoring of fund
activities [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The proposed solution includes the (manual) definition of a conceptual model and the
specification of rules to check the compliance of the portfolio composition.
      </p>
      <p>
        Conceptual model definition. The conceptual model was built, as part of previous work [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ],
following a methodology [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] where model elements are initially created based on domain concepts
taken from the letter of the law, and progressively refined by looking at other sources. Specifically, in
this work, we based the refinement on secondary legal sources such as guidelines or Q&amp;As issued by
regulatory authorities (e.g., [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]). We also adapted the model to the reality under legal scrutiny, i.e., data
regarding portfolio composition of a company. The data came from a proprietary dataset from a major
ifnancial information provider, covering periodical (monthly) extracts of fund asset composition over a
period of five years. The integration of these extracts into the model went a long way in ensuring that
the concepts (and ultimately the rules) were expressed at the right level of granularity.
      </p>
      <p>
        The process of building the concept model inevitably involved the interpretation of the legal text,
e.g., to interpret the concept of “issuer [of a financial instrument]” as an institution, or to correctly
represent the relationship between derivatives and their underlying financial instruments (more on
legal interpretations in Sect. 6). The model is available in our previous contribution [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and in the online
annex6.
      </p>
      <p>Compliance rules specification. We developed the rules as constraints on the model, following a
three-step process.</p>
      <p>(I) First, we identified the required actions by looking at verbs and logical operators within the legal
statements.</p>
      <p>
        (II) For each required action, we then identified the other main rule elements (namely addressee,
pre-condition(s) and constraint(s) — see [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]) and rewrote the legal text in streamlined, logical fashion
(e.g., we converted all passive verbs in their active forms) using an if-then structure whenever
pre-conditions were present.
      </p>
      <p>(III) The last step involved the actual representation of the rules in SPARQL and the alignment of
the rule variables to the conceptual model. This is also the step where any discrepancies or missing
elements in the model/data (non-observable variables) could be detected (see Sect. 6.2).</p>
    </sec>
    <sec id="sec-5">
      <title>5. Using SPARQL to Represent Financial Rules</title>
      <p>In this section, we report on the application of the process for defining the rules for Art. 43 of the
Luxembourgish UCITS law7 following the steps introduced in the previous section. The first and second
authors, with the validation of the last author, contributed to the first two steps of the translation, while
the last step was carried out by the first author only.</p>
      <p>Step I. We identified all verbs bound (even implicitly) to deontic modalities (e.g., “may invest”, “may
not exceed”) which led us to identify 11 rules, 10 of which (R1–R10) are regulative rules (shown in
Table 1 and in their full version in the online annex) and one (R11) is a constitutive rule (a definition).
This activity included the integration of regulatory changes and the resolution of cross-references. We
note that not all legal statements introduce additional rules: for example, the statement “The transferable
securities and money market instruments referred to in paragraphs 3 and 4 shall not be taken into account</p>
      <sec id="sec-5-1">
        <title>5See for instance the R2RML recommendation: https://www.w3.org/TR/r2rml/. 6The annex for this paper is available at: https://dx.doi.org/10.6084/m9.figshare.29304779. 7English version of the Luxembourgish UCITS Law available at: https://www.cssf.lu/wp-content/uploads/L_171210_UCI.pdf.</title>
        <p>for the purpose of applying the limit of 40% referred to in paragraph 2” (Art. 43(5)) only results in an
additional (negative) constraint added to the rules expressed in Art. 43(3) and 43(4) (see below for more
on inter-relations among norms).</p>
        <p>
          Step II. Successively, we identified the elements of the rules. On this regard, we note that the
addressee is always “A UCITS”, the action is always “shall not invest” and the rules do not have
preconditions (applicability conditions), but rather constraints (compliance conditions, see [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]) ranging
over the type of financial instrument targeted, the type of issuer, and the ratio set as upper limit. Table 1,
despite not being complete due to space reasons, gives an idea of the investment limits to be represented
as constraints. For instance, R1 would state that a “UCITS fund” shall not invest more than “10%” of its
“assets” in “transferable securities” (TS) and “money market instruments” (MMI) from a “single issuer”
that is not “EU-guaranteed”.
        </p>
        <p>Inter-relations among norms. We note that not all constraints of a rule are introduced in the same
legal statement. For example, the criterion of R1 on the issuer not being “EU-guaranteed” does not
appear in the initial provision of Art. 43(1) but rather as an exception implied by the norm expressed in
Article 43(3), which corresponds to R7 in the table. This is due to the fact that the norms expressed by
the legal statements often overlap, through exceptions expressed within the same paragraph (e.g., Art.
43(2),“this limitation does not apply to [. . . ]”) or through references in other paragraphs or articles (e.g.,
Art. 43(3), “the limit laid down in the first sentence of paragraph 1 [. . . ] ”). Figure 1 gives an overview
of the inter-relations among the norms expressed in Art. 43. In this work, inter-relations have been
integrated into the rules by the authors; see Sect. 6.1 for the possible solutions to represent them.</p>
        <p>Step III. Representing the rules of Art. 43 in SPARQL and mapping its variables to the conceptual
model was complex, as certain passages in the legal statements require (combinations of) functions
which appear non-intuitive. Overall, it took approximately 25 hours to specify the formal representations
in SPARQL, with the complex expressions in Table 2 accounting for most of the time. Considering the
limited experience of the authors with SPARQL, generative models (in our case, MS Copilot, based on
GPT-4) were helpful in refining the rules, even if they were not able to propose fully correct rules.</p>
        <p>Example. Due to space limitations, we illustrate our process only using rule R1, while all other
rules are available in the online annex. We started our representation of R1 by rewriting a simplified
version of the first sentence of Art. 43(1), shown in Figure 2(a). We note that R1 should also include the
exceptions coming from Art. 43(3) and (4): for clarity reasons, we did not include them in the figure,
but we discuss them below. We then mapped the entities in the rule to the model — see Figure 2(b).
Finally, we created the SPARQL query shown in Figure 2(c).</p>
        <p>Regarding inter-relations between norms, as mentioned above, R1 has exceptions explicitly introduced
(e) “Where X invests more than [ratio] the total value must Aggregate function [20], results not grouped by issuer
be [ratio]”</p>
        <p>Preceding statement’s condition becomes negative condition
of rule
Aggregate function [20], results grouped by issuer
Condition added to reference
Rule is cumulative (not alternative) to the referenced one(s)
Add negation of condition to previous statement, Add rule from
condition
(h) “[reference] shall not be taken into account [rule]”</p>
        <p>Entities in reference are excluded from rule
(i) “The limits of [reference] shall not be combined”</p>
        <p>Compliance to each referenced rule is necessary
(j) “[entity] shall be regarded as [entity]”</p>
        <p>New definition (constitutive norm)</p>
        <p>Article 43(1)
inter-relation
limits shall not (i)
be combined</p>
        <p>R4
5% OTC (a)(b)</p>
        <p>R3
10% OTC (a)</p>
        <p>R2
20% on D (a)</p>
        <p>R1
10% on TS, MMI (a)
(h)</p>
        <p>(g)</p>
        <p>R10
35% for TS, MMI, D
Article 43(5)</p>
        <p>(j)</p>
        <p>Inter-relation
43(3) and (4) excl. R5</p>
        <p>R11
Etensional definition of
"same body"</p>
        <p>(f)
(f) (c)
(f)</p>
        <p>(f)
(g)</p>
        <p>R5
40% overall (d)
(h)</p>
        <p>R7
35% for EU guaranteed</p>
        <p>R8
25% for covered B</p>
        <p>(e)</p>
        <p>R9
80% overall</p>
        <p>Metarule
the law of investment applies
(not modeled here)</p>
        <p>Article 43(2)</p>
        <p>R6
20% combined</p>
        <p>Article 43(3)
Article 43(4)
in paragraphs (3) and (4) of Art. 43; according to these norms, certain assets (e.g. the transferable
securities issued by a EU Member State) are excluded from R1, since a diferent limit applies to them.
As we will further discuss in Sect. 6, one of the possible ways to represent exceptions is to incorporate
them in the original rule. The complete (simplified) text of R1 would then be as follows:
A fund shall not invest more than 10% of its assets in ‘transferable securities’ or ‘money market
instruments’ issued by the same ‘issuer’ unless these are guaranteed by a Member State Equivalent
Institution, or are covered bonds, or are bonds issued before 8 July 2022 by a credit institution which has
its registered ofice in a Member State and is subject by law to special supervision.</p>
        <p>Similarly, the SPARQL query for R1 would require the addition of the following exclusion filters in
(a)
(b)
 A fund shall not invest more than 10% of its assets in ‘transferable securities’ or
‘money market instruments’ issued by the same ‘issuer’.</p>
        <p>UCITS
hasInvestment</p>
        <p>Investment
total_asset_value: double
Financial_Instrument
shares: int
share_value: double
isConvertible: boolean</p>
        <p>Institution
issuedBy LEI: double
globalSectorID: Global_Sector_ID
globalSector: Global_Sector
sector: SectorCode
Transferable_Security</p>
        <p>Money_Market_Instrument
(c) SELECT ?issuer (SUM(?total_asset_value) AS ?total_investment_value)
?portfolio_value ((SUM(?total_asset_value) / ?portfolio_value) * 100 AS ?percentage)
WHERE {
{ SELECT (SUM(?total_asset_value) AS ?portfolio_value)
WHERE { ?investment rumofa:investmentOf rumofa:MyUCITS ;
rumofa:total_asset_value ?total_asset_value . } }
?investment rumofa:total_asset_value ?total_asset_value ;
rdf:type ?type ;
rumofa:issuedBy ?issuer .</p>
        <p>FILTER (?type IN (rumofa:Money_Market_Instrument,
rumofa:Transferable_Security))}
GROUP BY ?issuer ?portfolio_value</p>
        <p>HAVING ((SUM(?total_asset_value) / ?portfolio_value) * 100 &gt; 10
order to implement the exceptions:</p>
        <p>FILTER NOT EXISTS {?type rdf:type rumofa:CoveredBond}
FILTER NOT EXISTS {?issuer rumofa:MS_Guaranteed true} UNION {
?issuer rdf:type rumofa:MemberStateEquivalent}
FILTER NOT EXISTS {?investment rumofa:issue_date ?issueDate .</p>
        <p>?issuer rdf:type rumofa:CreditInstitution ;
rumofa:prudential_supervision true ;
rumofa:hasRegisteredOffice ?office .
?office rumofa:locatedIn ?country .
?country rdf:type rumofa:MemberState .</p>
        <p>FILTER (?issueDate &lt; "2022-07-08"^^xsd:date)}}</p>
        <sec id="sec-5-1-1">
          <title>Observations</title>
          <p>Following the methodology presented in this paper, the resulting 10 SPARQL queries were validated
(using a Virtuoso8 endpoint) against sample data that was manually created to cover the various
situations expressed in the legal rules. SPARQL proved expressive enough to represent the norms
expressed by the legal statements of Art. 43.</p>
          <p>We noted how the semantics of certain passages of the legal statements appeared non-intuitive when
expressed in a logical format: legal experts notably find it hard to understand the equivalence between
the textual expressions and functional representations in Table 2, or between the textual norm of
Figure 2(a) and its corresponding query in Figure 2(c). The causes of this would deserve investigation, as
they seem to be related to the representational gap — see Sect. 2. As a consequence, technical expertise is
needed to fully understand and validate the SPARQL query. This technical expertise needs however to be
blended with the legal expertise needed to formally represent legal peculiarities such as non-observable
variables and alternative legal interpretations of the norm. We discuss these open challenges in the
next section.</p>
        </sec>
      </sec>
      <sec id="sec-5-2">
        <title>8https://vos.openlinksw.com/owiki/wiki/VOS/VOSSPARQL</title>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Discussion</title>
      <p>In this section, we reflect on the experience of applying the representation process to complex legal
rules.</p>
      <sec id="sec-6-1">
        <title>6.1. Logical and Structural Complexity of the Representation</title>
        <p>Our experience with Art. 43 shows that formal representations of legal norms are highly articulated,
going beyond expressing logical operators or navigating knowledge graphs, as it is the case with the
examples shown in the literature. For example, they involve aggregation [20] of the value of investments
involving the same issuers (see Table 2). Representing indirect manipulations of the original data is
not straightforward, as these operations often conceal legal interpretations (e.g., when classifying
investments).</p>
        <p>To compute the complexity of a SPARQL query, the only method found in the literature is the one
proposed by Buil-Aranda et al. [35]. The method consists in assigning a weight of 1 to each operator
and each triple pattern of the query. The resulting complexity calculation is shown in Table 3.</p>
        <p>Such complexity scores are unseen for SPARQL queries. According to Arias et al. [36]: “Most
[SPARQL] queries are simple, i.e., 66.41% of DBPedia queries and 97.25% of SWDF just contain a single
triple pattern [. . . ] and the [relationship] chains in 98% of the queries have length one, with the longest
path having a length of five”. However, we note that this analysis does not account for nested structures,
so a long yet straightforward query with many ANDs would show a complexity similar to a highly
nested (and thus less intelligible) query.</p>
        <p>
          In addition to the logical complexity described above, we note the presence of inter-relations
among norms such as exceptions, introduced in Sect. 5. We have two solutions for handling such
inter-relations in our representation:
• The first solution consists of implementing the relation directly in the afected rule . We showed
an example in Sect. 5, where we added constraints to R1 to exclude the exceptional investments.
This is the solution adopted in the literature [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] and is in line with the ontological vision of
modalities, where an exceptionally allowed action is represented as a subset of a breach [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. This
solution, however, does not scale well. A rule may repeatedly introduce exceptions such as these
into several other rules, and each rule may thus have several of these conditions added, making
any human supervision impractical. Also, this part of the rule has diferent legal sources than the
main part of the rule: these additional sources should be added as metadata to the SPARQL query
for traceability and change impact management [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
• An alternative solution is to allow rules to defeat each other’s consequents using defeasible logics [37].
        </p>
        <p>
          This solution implies a hierarchical structuring of the rules to automatically determine the outcome
of conflicts, establishing which rules apply and which do not. This alternative has been suggested
in the literature [
          <xref ref-type="bibr" rid="ref12">33, 37, 32, 12</xref>
          ] and implemented in the SPINdle reasoner [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. Although the
solution is useful for conflicting rules with complementary consequents, it would not be suficient
in our example of Art. 43(3), where the exceptional investments are not only assigned a diferent
threshold, but also excluded from the application of R1.
        </p>
        <p>Further research is required to establish whether these considerations can be generalized beyond
exceptions to all inter-relations among norms.</p>
        <p>
          Generally speaking, such a high complexity within and among norms is likely to impact not only
the performance of reasoners [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], but also the usability of the rules for legal experts. This makes it hard
to establish a collaboration between a legal expert and a developer in writing those rules. While an
intermediate representation would help in that regard, we note that the visualization of R1 as a graph is
not possible using current standards, as functions such as GROUP BY are not covered and accounted
by the literature [38]. The issue of query/data visualization is acknowledged in the legal informatics
literature [39] as an important challenge towards the creation of a human-friendly (ideally, visual)
intermediate language that can be aligned one-to-one to a formal language such as SPARQL.
        </p>
      </sec>
      <sec id="sec-6-2">
        <title>6.2. Misalignments Between Legal Statements and their Representation</title>
        <p>Regarding the case in which the norms expressed by the legal statements cannot be represented in the
model or in the ruleset in a way that allows automated compliance checking, we have identified two
similar yet distinct situations related to legal interpretations and data observability, respectively.</p>
        <p>
          Legal Interpretation. No efort on legal knowledge representation can ignore the subject of legal
interpretation. The law being an inter-subjective reality [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], its statements do not provide information
about factual reality, but rather describe a desired reality using generic and abstract terminology. The
application of the law to the case in point is always subject to legal interpretation, an intellectual process
which takes into account the semantics of the text(s) but also the underlying principles and assumptions
on the intentions of the regulator. The challenges posed by legal interpretation are strictly related to
the representational gap (see Sect. 2).
        </p>
        <p>
          As an example, let’s consider the following: on 3 November 2021 CSSF, the Luxembourg financial
market supervisor, published an updated version of its document providing clarifications on the holding
of assets by UCITS (the “UCITS FAQ” [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]). Regarding Article 43, the document specifies that the
20% limit on deposits applies to ancillary liquid assets and does not apply to margin accounts. This
specification impacts the definition of “deposit”; we can easily imagine how, before this clarification,
several alternative versions of the norm would have been possible, either including or excluding ancillary
liquid assets or margin accounts. Another example is in Art. 43(5), where, after stating that certain limits
cannot be combined, the successive statement introduces a prohibition with the word “thus”, which may
qualify the prohibition as either an example or a concretisation of the previous legal statement. In this
work, we opted for the second interpretation, which resulted in R10; however, diferent interpretations
might result in diferent rules.
        </p>
        <p>In order to handle legal interpretation, during the process of rule (and model) creation the legal
expert should be in control of the alternative interpretations of a legal statement, as the choice on
which version of the norm to represent in the ruleset carries important consequences in the compliance
checking.</p>
        <p>Data Observability. We note that not all elements expressed in Art. 43 were observed in our
dataset. An example is an asset being “guaranteed by a Member State” (Art. 43(3), implemented in R1
via inter-rule relationship as shown in Table 1) or a company being “included in the same group for
the purposes of consolidated accounts” (Art. 43(5)). As opposed to legal interpretation, which results in
alternative — and possibly conflicting — versions of the model and ruleset, non-observable properties
can be unambiguously represented in the ruleset; however, the related model category cannot be
automatically instantiated, i.e., the values for those properties cannot be automatically determined
based on available data, which can prevent a checker from efectively verifying the rules. Despite
missing data being a widespread problem in the financial industry [ 40], the issue of data observability is
— to the best of our knowledge — not taken into account in the literature of rule representation, possibly
because it becomes obvious only when complex models have to be implemented against real data.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7. Outlook</title>
      <p>In this section, we discuss possible research directions in the multidisciplinary field of legal knowledge
representation to handle the aforementioned challenges.</p>
      <sec id="sec-7-1">
        <title>7.1. Handling Representational Complexity: Calling for a Splitting of Legal Rules</title>
        <p>In Sect. 6, we introduced inter-relations among norms as elements of complexity, whose solution afect
the logical structure of the rules to be executed on the data. However, inter-relations are not the only
elements that may prejudice the self-containment of norms as individual rule artifacts. In fact, we
note that certain rules may share common constraints. Repeating the same constraint checking for
each rule (i.e., repeated subqueries) may be ineficient and may hinder the readability of the rule. An
alternative consists in introducing new concepts (as variables if the language permits, otherwise as
additional model elements). For example, identifying “certain bonds issued after 8 July 2022” can be
eased by representing these bonds as a specific subclass of “bond” in the model (“ bond issued after
8-7-22”) or as a Boolean class attribute specifying these characteristics, instead of checking the date
of issue for every rule that includes that constraint. These concepts are called intermediate legal
concepts (ILCs) in the legal literature [41].</p>
        <p>The choice of whether to represent ILCs impacts both the knowledge generation phase (i.e., when
building the model and the rules) and the checking phase (i.e., when instantiating the model and running
the rules on it):
• Ignoring ILCs in the representation implies that, at runtime, the related restrictions are explicitly
calculated for each rule. This approach has the main downside of resulting in a considerable
increase in the length of the ruleset, especially when the ILC is relevant to multiple rules.
• Alternatively, ILCs can be represented as additional properties (subclasses or attributes) in the
model, with a specific rule devoted to their instantiation. Technically this could be done either by
creating views over the underlying data or by using named subqueries (known as Common Table
Expressions in SQL). This approach is in line with the legal theory’s view of ILCs [41]; while it
has the downside of splitting the norm into several rules, these end up being shorter and simpler.</p>
        <p>Further research is required to determine the optimal balance to maximise the readability and the
usability of the ruleset for legal experts. A possible solution consists in defining a set of patterns for the
rule/query language, similar to the Ontology Design Patterns [42].</p>
        <p>Inter-relations among norms and ILCs suggest a need for a more articulated structuring of legal
rules, representing the antecedent and consequent parts [31] as two distinct rules, where the antecedent
rule describes the applicability conditions, and the consequent rule states the deontic statements that
apply to the relevant entities. Entities that fulfill the first rule would thus be reclassified as relevant to
(i.e., fulfilling the default precondition for ) the second rule(s). This structure would allow exceptions to
apply to either of these parts, while also providing a template for ILCs. It also allows for representing
definitions with a limited application context, such as R11 (see the online annex).</p>
        <p>
          Using SHACL to Address Representational Complexity
Some of the complexity issues highlighted in the previous subsection can be overcome by SHACL,
a recommendation of the W3C (World Wide Web Consortium) for describing and validating RDF
graphs [43]. In SHACL, validation is based on shapes, which define particular constraints for specific
nodes in a graph. SHACL has been considered a viable solution for representing legal rules [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]; also,
the approach of dealing with (non-)compliance as a validation shape is in line with the intrinsic nature
of compliance, as shown in ontological representations where the (non-)compliant behaviour is seen
as a subset of the possible behaviour [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Furthermore, SHACL provides a declarative way (target
declarations) to specify the focus nodes, i.e., which nodes in the RDF graph should be validated, unlike
SPARQL which requires filtering and grouping. Where SPARQL lacks modularity and reusability,
SHACL in fact provides ways to handle inter-rule relations and layered definitions: for example, it
supports the creation of so-called intermediate shapes, which allow to handle nested constraints for
a more intuitive and modular rule definition. Intermediate shapes can help in handling inter-relations
between norms, whose representation in SPARQL requires repeated subqueries. Following is an example
of intermediate shapes handling the exceptions to R1:
ex:CoveredBondExceptionShape
a sh:NodeShape ;
sh:property [
sh:path rdf:type ;
sh:hasValue rumofa:CoveredBond ;
] .
ex:IssueDateExceptionShape
a sh:NodeShape ;
sh:property [
sh:path rumofa:issue_date ;
sh:lessThan "2022-07-08"^^xsd:date ;
] ;
sh:property [
sh:path rumofa:issuedBy ;
sh:class rumofa:CreditInstitution ;
] .
        </p>
        <p>R1 can then be rewritten, referring to the nested constraints introduced above for representing the
exceptions, as follows:
ex:InvestmentLimitShape
a sh:NodeShape ;
sh:targetNode rumofa:Investment ;
sh:not ex:CoveredBondExceptionShape ;
sh:not ex:IssueDateExceptionShape ;
sh:sparql [</p>
        <p>a sh:SPARQLConstraint ;
[SPARQL query of Figure 2(c)]
]</p>
        <p>While it successfully modularizes exceptions, this SHACL shape does not get rid of SPARQL: in order
to handle aggregate functions (grouping investment value by issuer), it is in fact necessary to embed
the same SPARQL query from Figure 2(c) inside the shape, using the SHACL-SPARQL extension, since
SHACL-Core can only validate constraints on individual nodes or properties.</p>
        <p>In addition to intermediate shapes, we note that SHACL provides a mechanism to declare the severity
of a shape by attaching diferent semantic labels to its validation result (i.e., info, warning, violation),
and allows for the definition of shapes that refer to other shapes, enabling recursive validation (the
latter being particularly useful for modeling legal norms that involve hierarchical or nested conditions,
e.g., group membership). However, some of the features of SHACL (such as intermediate shapes) are
not mentioned in the formal documentation; their usage rather arises from practice. Therefore, these
features are not standardized and may difer in practical implementations. This further complicates the
understanding of the formal representation for a legal expert, prompting the need for reusable patterns
(as mentioned previously) and/or an intermediate language (see the next subsection).</p>
        <p>In conclusion, we note that SHACL seems promising to address some of the challenges related to the
complexity of the query; however it is not expressive and standardized enough to bridge the human
understanding of the law and its formal representation.</p>
      </sec>
      <sec id="sec-7-2">
        <title>7.2. Handling Representation Misalignment: the Need for Human Intervention</title>
        <p>
          Overall, the challenges posed by legal interpretation and data observability impact the representation
of norms either by making the rules more complex or by preventing the automation of rule checking.
The issue of interpretation is well known in the literature [44], being widely recognized as a reason
for the need of explainability of the reasoner’s results [
          <xref ref-type="bibr" rid="ref10 ref18">10, 18</xref>
          ] and for the need of argumentation
frameworks to handle mutually exclusive arguments [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]; however, over the last 50 years the most
common approach to automating compliance in practical scenarios has been to embed an interpretation
of the legal statements into a black-box compliance checking system [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. We recall that no legal analysis
can be considered accurate if it does not take into account alternative interpretations. In the cases where
the ambiguity does not allow for establishing concrete checks, the process can still be semi-automated
by providing the relevant information and possible alternative representations to the domain expert in
charge of the assessment (the human intervention, see below).
        </p>
        <p>Regarding data observability, compliance checking for non-observable variables can be
semiautomated: in this scenario, the legal expert or third-party data may provide the information that is
missing from the dataset to instantiate the model. For instance, whether a “[company is included in
group] in accordance with internationally recognized accounting rules” (Art. 43(5)) is typical additional
knowledge that can be instantiated by the domain expert. This can be done either before the rules
are applied (e.g., by specifying which types of accounting rules are to be considered as internationally
recognized) or as part of the check (e.g., by prompting the expert who will manually instantiate the
attribute for each company). Non-observable data could be detected in SHACL by using intermediate
shapes with specific severity.</p>
        <p>We note that both the above-presented challenges of legal interpretation and data observability imply
the need for human intervention: the check cannot be fully automated, because in certain cases the
reasoner needs external input to complete the reasoning. This implies the need to present modelling
cues (e.g., a reference to non-observable data) in a manageable way. Compliance checking is then a
process where facts are gathered, automatic (and possibly conflicting) classifications are made, and then
a legal expert provides contextualized information that could not be automatically determined. Note
that this contextualized information might also depend on the preferences or values of the human expert
among the alternative interpretations of the legal statement, and thus advocates for an argumentation
structure.</p>
        <p>The complexity of operational languages such as SPARQL and SHACL [28] prompts the need
for an intermediate language to facilitate the legal expert’s understanding of the implications of
alternative formal representations. According to state-of-the-art in legal knowledge representation, this
interaction can be ensured via an argumentation framework [32, 33], representing alternatives (e.g., a
non-observable variable, like, in our example, a ratio being reasonable or not) as arguments, which the
expert can manually accept or reject according to the foreseeable legal implications. Such a framework
has the potential to represent the application of complex rules to real data sets in a human-readable
way, especially if supported by a user interface (but unfortunately, as already noted, the state of the art
in the visualization of legal reasoning is lacking [39]).</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>8. Conclusion</title>
      <p>In this paper, we have reported on our experience on representing complex financial rules in a
machinereadable language for compliance checking. We have presented the approach, consisting of a model and
a rule set, and shown an example. We have then discussed the challenges posed by legal texts to their
abstract representation for automated application. As mentioned in Sect. 7, it seems that the best way to
address the challenges presented in this paper is to increase the number of rules, reducing their length,
and allow human intervention in the choice among alternative interpretations and for non-observable
variables. In that regard, SHACL is a promising candidate to address some of the problems related to
the sheer length of the rules and the repeated subqueries, even though it still has to rely on SPARQL
to handle complex operations such as aggregate functions. The complexity related to the increased
number of rules can be handled through the use of intermediate legal concepts and an intermediate
language, possibly representing the entire compliance process as an argumentation framework.</p>
      <p>As part of future work, we plan to define complexity measures for the resulting rules, with an
augmented dataset that would allow to measure execution times and more generally to evaluate the
performance of SPARQL and SHACL in a practical scenario. Next steps in the representation of complex
legal norms include the investigation of argumentation frameworks and of intermediate representations
that are more lawyer-friendly, in a strive to achieve a modeling of legal knowledge that allows the
integration of a legal expert into the automation of the compliance checking process.
This research was funded in whole, or in part, by the Luxembourg National Research Fund (FNR), under
grant numbers NCER22/IS/16570468/NCER-FT and C24/IS/18894115/AGLAIA.</p>
      <sec id="sec-8-1">
        <title>Declaration on Generative AI</title>
        <p>The authors did not use any Generative AI during the preparation of this work.
G. Mazzini, I. Sanchez, J. Soler Garrido, E. Gomez, The role of explainable AI in the context of the
AI Act, in: FAccT 2023, 2023, p. 1139–1150.
[20] J. Anim, L. Robaldo, A. Z. Wyner, A SHACL-based approach for enhancing automated compliance
checking with RDF data, Information 15 (2024).
[21] B. Fawei, A. Wyner, J. Z. Pan, M. J. Kollingbaum, Using legal ontologies with rules for legal textual
entailment, in: AI Approaches to the Complexity of Legal Systems, 2017, pp. 317–324.
[22] T. D. Breaux, A. I. Antón, J. Doyle, Semantic parameterization: A process for modeling domain
descriptions, ACM Trans. Softw. Eng. Methodol. 18 (2008) 5:1–5:27.
[23] F. Al Khalil, M. Ceci, K. Yapa Bandara, L. O’Brien, SBVR to OWL2 mapping in the domain of legal
rules, in: RuleML 2016, 2016, pp. 258–266.
[24] G. Antoniou, G. Baryannis, S. Batsakis, G. Governatori, M. B. Islam, Q. Liu, L. Robaldo, G. Siragusa,</p>
        <p>Large-scale legal reasoning with rules and databases, Journal of Applied Logic 8 (2021) 911–939.
[25] A. Sleimi, N. Sannier, M. Sabetzadeh, L. C. Briand, M. Ceci, J. Dann, An automated framework for
the extraction of semantic legal metadata from legal texts, Empir. Softw. Eng. 26 (2021) 43.
[26] O. Kosenkov, M. Unterkalmsteiner, D. Méndez, D. Fucci, T. Gorschek, J. Fischbach, On developing
an artifact-based approach to regulatory requirements engineering, in: MoDRE 2024, 2024.
[27] T. Athan, G. Governatori, M. Palmirani, A. Paschke, A. Wyner, LegalRuleML: Design principles
and foundations, Reasoning Web. Web Logic Rules 2015 (2015) 151–188.
[28] D. Merigoux, N. Chataing, J. Protzenko, Catala: a programming language for the law, Proc. ACM</p>
        <p>Program. Lang. 5 (2021) 1–29.
[29] G. Boella, L. Humphreys, R. Muthuri, P. Rossi, L. van der Torre, A critical analysis of legal
requirements engineering from the perspective of legal practice, in: RELAW 2014, 2014, pp. 14–21.
[30] S. Ghanavati, D. Amyot, A. Rifaut, Legal goal-oriented requirement language (legal GRL) for
modeling regulations, in: MiSE 2014, 2014, pp. 1–6.
[31] S. Ingolfo, A. Siena, A. Susi, A. Perini, J. Mylopoulos, Modeling laws with nomos 2, in: RELAW
2013, 2013, pp. 69–71.
[32] L. Longo, Argumentation for knowledge representation, conflict resolution, defeasible inference
and its integration with machine learning, Machine Learning for Health Informatics (2016) 183–208.
[33] M. Billi, R. Calegari, G. Contissa, F. Lagioia, G. Pisano, G. Sartor, G. Sartor, Argumentation and
defeasible reasoning in the law, J — Multidisciplinary Scientific Journal 4 (2021) 897–914.
[34] G. Pisano, Argumentation for Legal Reasoning: Meta-models, Technology and Beyond, Ph.D.</p>
        <p>thesis, UNIBO - Università di Bologna, Italy, 2024.
[35] C. Buil-Aranda, M. Ugarte, M. Arenas, M. Dumontier, A preliminary investigation into SPARQL
query complexity and federation in bio2rdf, in: AMW 2015, 2015.
[36] M. Arias, J. D. Fernández, M. A. Martínez-Prieto, P. de la Fuente, An empirical study of real-world</p>
        <p>SPARQL queries, CoRR abs/1103.5043 (2011).
[37] G. Governatori, Defeasible description logics, in: Rules and Rule Markup Languages for the</p>
        <p>Semantic Web, 2004, pp. 98–112.
[38] F. Haag, S. Lohmann, S. Siek, T. Ertl, QueryVOWL: Visual composition of SPARQL queries, in:</p>
        <p>ESWC 2015 Satellite Events, 2015, pp. 62–66.
[39] S. McLachlan, L. C. Webley, Visualisation of law and legal process: An opportunity missed,</p>
        <p>Information Visualization 20 (2021) 192–204.
[40] S. Bryzgalova, S. Lerner, M. Lettau, M. Pelger, Missing financial data, The Review of Financial</p>
        <p>Studies 38 (2024) 803–882.
[41] G. Sartor, Legal concepts as inferential nodes and ontological categories, Artificial Intelligence
and Law 17 (2009) 217–251.
[42] A. Gangemi, V. Presutti, Ontology Design Patterns, Springer Berlin Heidelberg, Berlin, Heidelberg,
2009, pp. 221–243.
[43] P. Pareti, G. Konstantinidis, A Review of SHACL: From Data Validation to Schema Reasoning for</p>
        <p>RDF Graphs, Springer International Publishing, Cham, 2022, pp. 115–144.
[44] G. Boella, M. Janssen, J. Hulstijn, L. Humphreys, L. van der Torre, Managing legal interpretation
in regulatory compliance, in: ICAIL 2013, 2013, p. 23–32.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ceci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Sannier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Abualhaija</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Shin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bianculli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Halling</surname>
          </string-name>
          ,
          <article-title>Toward automated compliance checking of fund activities using runtime verification techniques</article-title>
          ,
          <source>in: FinanSE</source>
          <year>2024</year>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Amor</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Dimyadi,</surname>
          </string-name>
          <article-title>The promise of automated compliance checking</article-title>
          ,
          <source>Developments in the Built Environment</source>
          <volume>5</volume>
          (
          <year>2021</year>
          )
          <fpage>100039</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Chittoor</surname>
          </string-name>
          ,
          <article-title>Implementing data lineage frameworks in financial institutions: A systematic analysis of compliance, eficiency, and risk management</article-title>
          ,
          <source>Int. J. of Science and Research Archive</source>
          <volume>14</volume>
          (
          <year>2025</year>
          )
          <fpage>353</fpage>
          -
          <lpage>361</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Bogaerts</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jakubowski</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Van den Bussche</surname>
          </string-name>
          ,
          <article-title>SHACL: A description logic in disguise</article-title>
          ,
          <source>in: Logic Programming and Nonmonotonic Reasoning</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>75</fpage>
          -
          <lpage>88</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>N.</given-names>
            <surname>Sannier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ceci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Abualhaija</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bianculli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Halling</surname>
          </string-name>
          ,
          <article-title>A model toward formalizing and monitoring compliance of investment funds activities</article-title>
          ,
          <source>in: MoDRE</source>
          <year>2024</year>
          ,
          <year>2024</year>
          , pp.
          <fpage>272</fpage>
          -
          <lpage>280</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Robaldo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Batsakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Calegari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Calimeri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fujita</surname>
          </string-name>
          , G. Governatori,
          <string-name>
            <given-names>M.</given-names>
            <surname>Morelli</surname>
          </string-name>
          , G. Pisano,
          <string-name>
            <given-names>K.</given-names>
            <surname>Satoh</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Tachmazidis</surname>
          </string-name>
          ,
          <article-title>Taking stock of available technologies for compliance checking on ifrst-order knowledge</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          <volume>3204</volume>
          (
          <year>2022</year>
          )
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Y. N.</given-names>
            <surname>Harari</surname>
          </string-name>
          ,
          <article-title>Nexus: A Brief History of Information Networks from the Stone Age to AI</article-title>
          ,
          <string-name>
            <surname>Diversified</surname>
            <given-names>Publishing</given-names>
          </string-name>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ceci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bianculli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Briand</surname>
          </string-name>
          ,
          <article-title>Defining a model for content requirements from the law: An experience report</article-title>
          ,
          <source>in: RE</source>
          <year>2024</year>
          ,
          <year>2024</year>
          , pp.
          <fpage>18</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ceci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Al Khalil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. O</given-names>
            <surname>'Brien</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Butler</surname>
          </string-name>
          ,
          <article-title>Requirements for an intermediate language bridging legal text and rules</article-title>
          , in: MIREL@JURIX,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>L.</given-names>
            <surname>Robaldo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Batsakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Calegari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Calimeri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fujita</surname>
          </string-name>
          , G. Governatori,
          <string-name>
            <given-names>M.</given-names>
            <surname>Morelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Pacenza</surname>
          </string-name>
          , G. Pisano,
          <string-name>
            <given-names>K.</given-names>
            <surname>Satoh</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Tachmazidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zangari</surname>
          </string-name>
          ,
          <article-title>Compliance checking on first-order knowledge with conflicting and compensatory norms: a comparison among currently available technologies</article-title>
          ,
          <source>Artif. Int. and Law</source>
          <volume>32</volume>
          (
          <year>2024</year>
          )
          <fpage>505</fpage>
          -
          <lpage>555</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Abualhaija</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ceci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Sannier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bianculli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Briand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zetzsche</surname>
          </string-name>
          , M. Bodellini,
          <article-title>AI-enabled regulatory change analysis of legal requirements</article-title>
          ,
          <source>in: RE</source>
          <year>2024</year>
          ,
          <year>2024</year>
          , pp.
          <fpage>5</fpage>
          -
          <lpage>17</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rotolo</surname>
          </string-name>
          , G. Governatori, G. Sartor,
          <article-title>Deontic defeasible reasoning in legal interpretation: two options for modelling interpretive arguments</article-title>
          ,
          <source>in: ICAIL</source>
          <year>2015</year>
          ,
          <year>2015</year>
          , pp.
          <fpage>99</fpage>
          -
          <lpage>108</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>C.</given-names>
            <surname>Guitton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tamò-Larrieux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mayer</surname>
          </string-name>
          ,
          <string-name>
            <surname>G. van Dijck</surname>
          </string-name>
          ,
          <article-title>The challenge of open-texture in law</article-title>
          ,
          <source>Artificial Intelligence and Law</source>
          (
          <year>2024</year>
          )
          <fpage>1</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>Leith</surname>
          </string-name>
          , Logic,
          <source>formal models and legal reasoning</source>
          , Jurimetrics J.
          <volume>24</volume>
          (
          <year>1983</year>
          )
          <fpage>334</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <article-title>CSSF, FAQ concerning the Law of 17 December 2010 relating to UCITS</article-title>
          , https://www.cssf.lu/ wp-content/uploads/FAQ_Law_17_December_
          <year>2010</year>
          .pdf,
          <year>2025</year>
          . Accessed:
          <fpage>2025</fpage>
          -08-20.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>E.</given-names>
            <surname>Francesconi</surname>
          </string-name>
          , G. Governatori,
          <article-title>Patterns for legal compliance checking in a decidable framework of linked open data</article-title>
          ,
          <source>Artificial Intelligence and Law</source>
          <volume>31</volume>
          (
          <year>2022</year>
          )
          <fpage>1</fpage>
          -
          <lpage>20</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>S.</given-names>
            <surname>Abualhaija</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ceci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Sannier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bianculli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lannier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Siclari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Voordeckers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tosza</surname>
          </string-name>
          ,
          <article-title>LLM-assisted elicitation of regulatory requirements: A case study on the GDPR</article-title>
          , in: RE 2025 (to appear),
          <year>2025</year>
          , pp.
          <fpage>5</fpage>
          -
          <lpage>17</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Gan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Qi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. S.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <article-title>Large language models in law: A survey</article-title>
          ,
          <source>AI</source>
          Open 5
          <article-title>(</article-title>
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>C.</given-names>
            <surname>Panigutti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hamon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Hupont</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Fernandez</given-names>
            <surname>Llorca</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Fano</given-names>
            <surname>Yela</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Junklewitz</surname>
          </string-name>
          , S. Scalzo,
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>