<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Network Analysis to Support Navigation and Use of Ever-Changing Legislation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>GUIDO BOELLA</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>LUIGI DI CARO</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>LIVIO ROBALDO</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>ANDREA VIOLATO</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Turin</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Nomotika s.r.l</institution>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>In this abstract we introduce the working context related to the understanding of an
heterogeneous network of references contained in the Italian regulatory framework. We then
present an extended analysis of a large network of laws, providing several types of
analytical evaluation that can be used within a legal management system for understanding
the data through summarization, visualization, and browsing.</p>
      <p>The importance of Information Technology tools lies on the ability of limiting the
well-known complexity of the regulatory framework. Starting from the language used to
express concepts and rules that often create serious problems of interpretation, scientia
iuris is full of innumerable aspects that make it difficult to manage by domain experts.
The use of computer systems can reduce this complexity for those who have to do with
a legal reality that results to be more and more interdisciplinary, international and
multifunctional. All this is even more true by taking into account one of the most complex
aspects of any modern legal text, which is the extensive use of references. It is therefore
the intention of this paper to first investigate the need for the development of tools capable
of assisting the employee to manage the particular regulatory complexity characterized
by cross-references, and then to simulate possible technological solutions.</p>
      <p>The presence of thousands of stratifications in the Italian legal sources over the years
has enabled the strengthening of the use of cross-references between legal texts in order
to complete the content. This scenario is even more complex when considering a
second and unique aspect of this discussion: the temporal dynamism that characterizes the
use of normative references in any legal context. In fact, it is not a rarity that, over the
years, the legislature put hand to the subjects adapting the contents to the stimuli given
by socio-economic factors and the surrounding cultural context. This natural evolution
is often mediated through a surgical use of references which permit to individuate the
precise points under modification. In the Italian scenario, this assumption is far from
being uncommon. Many, in fact, are the examples of legal decrees subsequently converted
(even with modifications) into laws. Finally, it is worth noting how the use of normative
references can also take place outside the purely legislative context. In fact, it is quite
common to find a citation of a law inside a legal text that is not used with explicit
references, but also by case law of the courts, which, in exercising a peculiar hermeneutical
activity of this rule, it implicitly complements the content, clarifying the more conceptual
and obscure aspects and thus bringing important suggestions to the reader.</p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] we introduced the Eunomos software, which is being developed in the context
of the ICT4LAW project1. Eunomos is an advanced legal document management
system based on legislative XML representation of laws which are retrieved automatically
from institutional legislative portals, and incorporates a tool for building legal ontologies
called Legal Taxonomy Syllabus [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Techniques and Applications</title>
      <p>In legislation, all the data can be also viewed as complex networks where nodes are laws
and links represent kinds of relationship like “modification”, “implementation”,
“substitution”, and so on. These information are often complex to treat, organize, and use since
they are many and continuosly changing over time. Manual intervention, indeed, is often
required in classical tasks because of the difficulty to get the “big picture”, that is, to have
an at-a-glance overview over the data under evaluation.</p>
      <p>
        Social Network Analysis is quite a new field that inherits methods from Phisics
(i.e., Complex Networks) and Mathematics (i.e., Theory of Graphs) to face problems
related to the huge amount of data coming from social networks like Twitter2, Facebook3,
Flickr4, and so forth. These algorithms are useful to capture statistics about the type of
the networks along their properties. More in detail, it is possible to analyze a network
in terms of its evolution over time, important nodes, implicit relationships, etc. All these
evaluations can be helpful also in a network of laws. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] presents a large overview over
the mathematical properties of graphs. In this section we present a set of analyses that
may help the jurist to deal with these networks.
      </p>
      <p>The first statistical analysis of a graph is given by the distribution of the number of
neighbours of a node, also called degree. In real networks, the degree distribution has
a tail that often follows a power law, that means that it contains many nodes with low
degree and some node with large degree. This characteristic of real networks is called
community structure. Such communities are also commonly named clusters, and they
represent nodes that play similar roles within the graph. A common and often useful
analysis of a network looks at such latent community structures, and it is based on some
clustering approach. In our test we made use of the concept of edge betweenness of an
edge, i.e., the extent to which it lies along shortest paths between all pairs of nodes in
the network. This algorithm works by iteratively following two steps: computing the
edge betweenness for all edges in the current graph and then removing the edge with the
highest betweenness value. This analysis helps finding communities, since it iteratively
removes central nodes to separate the graph in distinct subnetworks.</p>
      <p>
        Relationships between nodes of a network may have a precise direction, that needs to
be taken into account to understand the system as a whole. PageRank [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] is an algorithm
1ICT4LAW: ICT Converging on Law: Next Generation Services for Citizens, Enterprises, Public
Administration and Policymakers funded by Regione Piemonte 2008-2013, call Converging Technologies 2007,
website: http://www.ict4law.org
2http://www.twitter.com/
3http://www.facebook.com/
4http://www.flickr.com/
that assigns a weight to each node of a network in the World Wide Web domain, with
the purpose of quantifying its relative importance within it. In spite of its original use
on hyperlinks, it can be useful for several other domains dealing with directed graphs.
Generally speaking, in the legal domain, if one law modifies a law B, one usually does
not find on B a link back to A. In some case, there can be few relationships that may be
reciprocal (i.e., citations). The PageRank algorithm measures the importance of a node
by considering that of the nodes that link to it. Thus, this indicates a finer node evaluation
with respect to the edge betweenness. An accurate analysis of the most important nodes
within a network of laws can be helpful to understand and use the data.
      </p>
      <p>The diameter of a graph can give useful insights. In detail, it is the largest number of
nodes which must be traversed in order to travel from one node to another. In a network of
laws it can be used to estimate the maximum complexity of modification/citation paths.
Since a network of law is often disconnected, it is constituted by many subnetworks. In
this case, the diameter considers the maximum distance found in all subnetworks.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Data and Results</title>
      <p>In this section we present an analysis of the data coming from Eunomos, i.e., a network
of around 10K laws interconnected by 7K links. The dataset contains different types of
laws, as for instance “stato:legge”, that indicates the principal type of law of Italy. The
resulting distribution of the degree levels demonstrated that most of the laws have few
connections with other laws, whereas some other laws have a large connectivity within
the graph. This is in line with the majority of the networks automatic systems have to
deal with.</p>
      <p>The diameter of the network (actually, the maximum diameter of all the disconnected
subnetworks in it) is 8. This means that, in the worst case, if a jurist has to navigate and
understand a path between two laws in the network, it could go through other eight laws.
Still, links can have different meanings, so the process may include multiple cateogories
of operations. Having a tool that can support this navigation may represent a dramatically
help in such process.</p>
      <p>Finally, the distribution of the PageRank scores over the nodes/laws of the network
gives an interesting result since it seems to perfectly split the important nodes (also called
hubs) from the others. A system that makes use of these scores can filter out unimportant
information rather than letting emerge crucial points within the regulatory framework.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Gianmaria</given-names>
            <surname>Ajani</surname>
          </string-name>
          , Guido Boella, Leonardo Lesmo, Alessandro Mazzei, Daniele P Radicioni, and
          <string-name>
            <given-names>Piercarlo</given-names>
            <surname>Rossi</surname>
          </string-name>
          .
          <article-title>Legal taxonomy syllabus: Handling multilevel legal ontologies</article-title>
          .
          <source>Proceedings of Langtech</source>
          <year>2008</year>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Boella</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Humphreys</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Martin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rossi</surname>
          </string-name>
          , and
          <string-name>
            <surname>L. van der Torre. Eunomos,</surname>
          </string-name>
          <article-title>a legal document management system based on legislative XML and ontologies (position paper). Applying Human Language Technology to the Law</article-title>
          , page
          <volume>41</volume>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Be</surname>
          </string-name>
          <article-title>´la Bolloba´s. Modern graph theory</article-title>
          , volume
          <volume>184</volume>
          . Springer,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Lawrence</given-names>
            <surname>Page</surname>
          </string-name>
          , Sergey Brin, Rajeev Motwani, and
          <string-name>
            <given-names>Terry</given-names>
            <surname>Winograd</surname>
          </string-name>
          .
          <article-title>The pagerank citation ranking: bringing order to the web</article-title>
          .
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>