<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards Integrating the FLG Framework with the NLP Combinatory Framework</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Amin Rabinia</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mauro Dragoni</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sepideh Ghanavati</string-name>
        </contrib>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>BACKGROUND
Towards Integrating the FLG Framework with the NLP Combinatory</p>
      <p>Framework
1 School of Computing and Information Science, University of Maine, Orono, ME, USA</p>
      <p>{amin.rabinia, sepideh.ghanavati}@maine.edu
2 Researcher at Fondazione Bruno Kessler, Trento, Italy</p>
      <p>dragoni@fbk.eu
Automatic modeling of privacy regulations is a highly
demanded goal in requirements engineering. The FOL-based
Legal-GRL (FLG) is a semi-automated goal-oriented
modeling framework for extracting and representing the legal
requirements of IT systems. One limitation of the FLG
framework, however, is its manual requirements extraction
process. Manual extraction of legal requirements is
cumbersome, error-prone, and time-consuming. To overcome this
shortcoming, we integrate this requirements modeling
framework with another framework that combines several natural
language processing (NLP) approaches. This Combinatory
framework specifically exploits NLP techniques, such as
Part-Of-Speech tagging and syntactic parsing, along with
NLP tools, such as C&amp;C and Boxer, to propose an automated
approach for extraction of rules from legal texts. This
approach enables us to fully automate the FLG framework in
order to propose a comprehensive framework for modeling
privacy regulations and other legal requirements. This paper
outlines the two frameworks and their integration process.</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>
        Privacy regulations enforce a set of legal requirements for
any IT system. Developers need to capture and satisfy such
requirements in their products. Legal-GRL
        <xref ref-type="bibr" rid="ref4">(Ghanavati et al.
2014)</xref>
        is a modeling language for extracting and
representing privacy regulations. One drawback of Legal-GRL is its
manual modeling process. This limitation has motivated an
alternative modeling framework, the FOL-based
LegalGRL (FLG)
        <xref ref-type="bibr" rid="ref5 ref6">(Rabinia and Ghanavati 2017 and 2018)</xref>
        . The
FLG framework suggests a new approach based on
Firstorder Logic (FOL) to facilitate the automation of modeling
process. The FLG covers two main tasks of legal
requirements extraction and representation. The representation
task, is fully automated
        <xref ref-type="bibr" rid="ref6">(Rabinia and Ghanavati 2018)</xref>
        . The
1 The database is accessible at
https://github.com/PERC-Lab/FLGFramework
requirements extraction task, however, is still manual. To
automate this task, we suggest integration of FLG with the
NLP Combinatory framework
        <xref ref-type="bibr" rid="ref3">(Dragoni et al. 2016)</xref>
        . This
framework combines several natural language processing
(NLP) techniques and tools in order to extract rules from
legal texts. While the Combinatory framework performs the
extraction task, the FLG handles the representation part of
legal requirements modeling. In this paper, we aim to sketch
a brief outline of the two frameworks and the
theoretical/practical possibilities for integration of both in one
unified framework.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Background</title>
      <p>
        The FLG framework consists of three phases: A. manual
requirements extraction; B. storing the requirements in a
database; and, C. requirements models generation. The
extraction phase (A) starts with collecting and analyzing natural
language legal statements. This is a manual task that legal
experts have to accomplish using a legal ontology. The
result of this process is a set of atomized and structured
statements called Restricted Natural Language Statements
(RNLSs). In the second phase (B), the RNLSs, which are in
FOL notation format, would be stored in an SQL database.1
This database is capable of exporting the data (or the
formalized legal requirements) as a formatted XML file. In the
third phase (C), the XML file is imported into the modeling
tool support, jUCMNav
        <xref ref-type="bibr" rid="ref1">(Amyot et al. 2011)</xref>
        , to
automatically generate the requirements goal models.
      </p>
      <p>To automate the first phase of the FLG framework, we
integrate it with the NLP Combinatory framework. This
framework combines two sets of NLP techniques: first,
Stanford Parser and C&amp;C/Boxer, for grammatical analysis
of the sentences, and second, WordNet, for handling the
deontic language of the legal texts. The input data, consisting
of natural language statements from a legal text, pass
through two separate pipelines that perform the NLP
processes, mentioned above, to extract the rules from the
original statements. The result from both branches of the pipeline
will be combined to confirm each other.</p>
    </sec>
    <sec id="sec-4">
      <title>Integration of the Frameworks</title>
      <p>Fig.1. shows a sketch of the integration of the two
frameworks. The integrated framework would be organized in
three phases: A. Legal requirements extraction, where the
legal documents are imported into the NLP Combinatory
pipeline and the RNLSs would be exported. The process
inside the pipeline entails application of several ontologies
along with the NLP techniques upon the input statements.
B. Storing the requirements, where the RNLSs would be
stored and finally retrieved as an XML/GRL file. C. Goal
model generation, where the XML file is imported into the
modeling tool support, jUCMNav, to generate the legal
requirements goal models. Within the integrated framework,
the entire process of modeling would be automated. This
possibility, however, demands further
modifications/improvements of the two frameworks, as they are still in
progress. The integration process takes place in three layers:
1) A unified format of output/input needs to be set for a
proper dataflow between the two frameworks. Since the
FLG framework has an SQL Server database, it can import
data from a variety of file formats including a plain text file.
The RNLSs, then, can be stored in the NLP Combinatory
framework as a text file and imported in the FLG database.</p>
      <p>2) The FLG framework requires a finer-grained analysis
of the rules than what the NLP Combinatory framework
provides as its output. The general format of such output is:
Term1 =&gt; [O] Term2, where terms are parts of a sentence or
statement. However, the final goal models of the FLG need
further details of the requirements, e.g. actors. These
elements of the statements are accessible within the NLP
Combinatory framework, as it exploits the NLP techniques..</p>
      <p>3) The FLG framework has also a legal ontology that
helps simplifying the legal requirements models. During the
integration process, the ontologies of the two frameworks
should be integrated in order to handle syntactical
modifications of the statements.</p>
      <p>We perform the integration procedure in an iterative,
nonsequential fashion. The integration of ontologies (3) should
be the first priority, since it contributes to the second layer.
Running the dataflow (1) between the two frameworks also
can be done afterwards, when the data format is matched
during 2 and 3.</p>
      <p>The outcome of this integration is a fully automated
framework for modeling of privacy requirements.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Amyot</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mussbacher</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ghanavati</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Kealey</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>GRL</given-names>
            <surname>Modeling</surname>
          </string-name>
          and
          <article-title>Analysis with jUCMNav</article-title>
          .
          <source>iStar, 766</source>
          , pp.
          <fpage>160</fpage>
          -
          <lpage>162</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Dragoni</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villata</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rizzi</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Governatori</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <year>2016</year>
          ,
          <string-name>
            <surname>December.</surname>
          </string-name>
          <article-title>Combining NLP approaches for rule extraction from legal documents</article-title>
          .
          <source>In 1st Workshop on MIning and REasoning with Legal texts (MIREL</source>
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Ghanavati</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Amyot</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Rifaut</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <year>2014</year>
          , June.
          <article-title>Legal goaloriented requirement language (legal GRL) for modeling regulations</article-title>
          .
          <source>In Proceedings of the 6th international workshop on modeling in software engineering</source>
          (pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          ). ACM.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Rabinia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Ghanavati</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <year>2017</year>
          ,
          <string-name>
            <surname>September.</surname>
          </string-name>
          <article-title>FOL-Based Approach for Improving Legal-GRL Modeling Framework: A Case for Requirements Engineering of Legal Regulations of Social Media</article-title>
          .
          <source>In 2017 IEEE 25th International Requirements Engineering Conference Workshops (REW)</source>
          (pp.
          <fpage>213</fpage>
          -
          <lpage>218</lpage>
          ). IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Rabinia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Ghanavati</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <year>2018</year>
          ,
          <string-name>
            <surname>August.</surname>
          </string-name>
          <article-title>The FOL-Based Legal-GRL (FLG) Framework: Towards an Automated Goal Modeling Approach for Regulations</article-title>
          . In 2018 IEEE 8th International
          <string-name>
            <surname>Model-Driven Requirements</surname>
          </string-name>
          Engineering Workshop (MoDRE) (pp.
          <fpage>58</fpage>
          -
          <lpage>67</lpage>
          ). IEEE.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>